A method and device for correcting mispronunciations of a user, the method
comprising the following steps: providing a database comprising a
plurality of records each of which comprising at least a textual and a
vocal representation of a specific word; training a speech recognition
module to recognize spoken utterances of said user comprising the words
represented by said records; generating word models for each recognized
spoken word; associating each word model with a respective database
record; after training said speech recognition module with sufficient
words receiving spoken utterance from said user; extracting a sequence of
words from said spoken utterance and generating a word model for each
extracted word; comparing said word models to the word models associated
with said database records; constructing an audible output comprising
vocal representations obtained from records which their word models
matched word models generated for said extracted word, wherein said word
models comprises features extracted from data of the words spoken by said
user.