A summary reproducing apparatus, which is capable of reproducing a summary
accurately for each type of video information and of reducing a burden in
generating digest information, provided with an audio feature amount
extraction unit for obtaining a sound feature amount on the basis of a
preset parameter from entered audio/video information, a genre
information obtaining unit for obtaining genre information from
additional information added to the entered audio/video information, a
decision parameter setting unit for setting an optimum parameter for
extracting a sound feature amount on the basis of genre information, and
a control unit for deciding digest segments to be extracted in stored
audio/video information on the basis of an audio feature amount suitable
for the preset parameter and for controlling a reproduction unit on the
basis of the digest segments, wherein a summary is reproduced by using a
parameter optimized for each genre.