A method for detecting music in a speech signal having a plurality of
frames. The method comprises defining a music threshold value for a first
parameter extracted from a frame of the speech signal, defining a
background noise threshold value for the first parameter, and defining an
unsure threshold value for the first parameter. The unsure threshold
value falls between the music threshold value and the background noise
threshold value. If the first parameter falls between the music threshold
value and the background noise threshold value, the speech signal is
classified as music or background noise based on analyzing a plurality of
first parameters extracted from the plurality of frames.