Music videos are automatically produced from source audio and video signals.
The music video contains edited portions of the video signal synchronized with
the audio signal. An embodiment detects transition points in the audio signal and
the video signal. The transition points are used to align in time the video and
audio signals. The video signal is edited according to its alignment with the audio
signal. The resulting edited video signal is merged with the audio signal to form
a music video.