When a pause request occurs, an audio delay time constituted by a delay
time of frames of audio data based on the video data frames is found.
During the pause, a frame offset time constituted by the offset of the
frame start time of the video data and the audio data is monitored. When
a pause release request is issued, based on the audio delay time and the
frame offset time, the audio correction time to be corrected in the pause
request is calculated. Then, when it is judged that the audio data is
advanced with respect to the video data based on the audio correction
time cumulatively added for each pause request, the video data is delayed
by one frame with respect to the audio data, while when it is judged that
the audio data is delayed with respect to the video data, the audio data
is delayed by one frame with respect to the video data.