A system and method for merging scenes in a video sequence and constructing a
keyframe
to represent the underlying merged video content includes decomposing a video sequence
into a series of component scenes, merging component scene pairs until a predetermined
number of scene sets remain, extracting a keyframe from each scene set containing
a single component scene, and constructing a new keyframe for each scene set containing
a plurality of component scenes.