In a moving image playback apparatus, periods A which represent human
utterance periods, and other periods B are determined on the basis of
sub-information contained in moving image data. Based on the moving image
data, periods A undergo high-speed moving image playback with playback
voice within the speed range from a normal speed to a predetermined speed
(e.g., 1.5 to 2 times of the normal speed) at which the user can
recognize playback contents, while periods B undergo high-speed moving
image playback with at least playback voice in a small tone volume or
silent high-speed moving image playback at a speed (e.g., 5 to 10 times
of the normal speed) higher than the predetermined speed. During the
playback, the moving image playback speeds can be adjusted in accordance
with user attribute information registered in a user profile (14).