A method of extracting audio excerpts comprises: segmenting audio data
into a plurality of audio data segments; setting a fitness criteria for
the plurality of audio data segments; analyzing the plurality of audio
data segments based on the fitness criteria; and selecting one of the
plurality of audio data segments that satisfies the fitness criteria. In
various exemplary embodiments, the method of extracting audio excerpts
further comprises associating the selected one of the plurality of audio
data segments with video data. In such embodiments, associating the
selected one of the plurality of audio data segments with video data may
comprise associating the selected one of the plurality of audio data
segments with a keyframe.