Parsing routines extract from a conventional pronunciation dictionary an
entry, which includes a dictionary word and dictionary phonemes
representing the pronunciation of the dictionary word. A correspondence
table is used to compress the pronunciation dictionary. The
correspondence table includes correspondence sets for a particular
language, each set having a correspondence text entry, a correspondence
phoneme entry representing the pronunciation of the correspondence text
entry and a unique correspondence set identifying symbol. A matching
system compares a dictionary entry with the correspondence sets, and
replaces the dictionary entry with the symbols representing the best
matches. In the absence of a match, symbols representing silent text or
unmatched phonemes can be used. The correspondence symbols representing
the best matches provide compressed pronunciation dictionary entries. The
matching system also generates decoder code sets for subsequently
translating the symbol sets. A decoder system uses the decoder code sets
for translating symbol sets in the compressed pronunciation dictionary to
generate phonemes corresponding to selected text.