A method and computer-readable medium convert the text of a word and a user's
pronunciation
of the word into a phonetic description to be added to a speech recognition lexicon.
Initially, two possible phonetic descriptions are generated. One phonetic description
is formed from the text of the word. The other phonetic description is formed by
decoding a speech signal representing the user's pronunciation of the word. Both
phonetic descriptions are scored based on their correspondence to the user's pronunciation.
The phonetic description with the highest score is then selected for entry in the
speech recognition lexicon.