A method is provided for detecting music in a speech signal having a
plurality of frames. The method comprises obtaining one or more first
pitch correlation candidates from a first frame of the plurality of
frames; obtaining one or more second pitch correlation candidates from a
second frame of the plurality of frames; selecting a pitch correlation
(Rp) from the one or more first pitch correlation candidates and the one
or more second pitch correlation candidates; and distinguishing music
from background noise based on analyzing the pitch correlation (Rp). The
method may further comprise filtering the speech signal using a one-order
low-pass filter prior to the obtaining the one or more first pitch
correlation candidates, and down sampling the speech signal by four prior
to the obtaining the one or more first pitch correlation candidates