A method for converting text to concatenated voice by utilizing a digital
voice library and a set of playback rules is provided. The method
includes receiving and expanding text data to form a sequence of text and
pseudo words. The sequence of text and pseudo words is converted into a
sequence of speech items, and the sequence of speech items is converted
into a sequence of voice recordings. The method includes generating voice
data on the sequence of voice recordings by concatenating adjacent
recordings in the sequence of voice recordings.