Video data are transmitted to a receiver. A language menu is displayed
from which a user selects a language that can be different from the
original language broadcast with the video data. Video data
identification information and language identification information
corresponding to the language selected from the menu is derived and
transmitted to e.g. an Internet server. The identification information is
used to select an audio translation data set from several audio
translation data sets stored in said server, wherein each of said several
audio translation data sets includes a language translation of original
audio data related to said video data. The selected audio translation
data set is sent to the receiver and reproduced synchronously together
with said video data.