A method and apparatus for generating a pronunciation score by receiving a
user phrase intended to conform to a reference phrase and processing the
user phrase in accordance with at least one of an articulation-scoring
engine, a duration scoring engine and an intonation-scoring engine to
derive thereby the pronunciation score. The scores provided by the
various scoring engines are adapted to provide a visual and/or numerical
feedback that provides information pertaining to correctness or
incorrectness in one or more speech-features such as intonation,
articulation, voicing, phoneme error and relative word duration. Such
useful interactive feedback will allow a user to quickly identify the
problem area and take remedial action in reciting "tutor" sentences or
phrases.