A method of speech recognition is provided that determines a
production-related value, vocal-tract resonance frequencies in
particular, for a state at a particular frame based on the
production-related values associated with two preceding frames using a
recursion. The production-related value is used to determine a
probability distribution of the observed feature vector for the state. A
probability for an observed value received for the frame is then
determined from the probability distribution. Under one embodiment, the
production-related value is determined using a noise-free recursive
definition for the value. Use of the recursion substantially improves the
decoding speed. When the decoding algorithm is applied to training data
with known phonetic transcripts, forced alignment is created which
improves the phone segmentation obtained from the prior art.