A method of making a digital voice library utilized for converting text to concatenated
voice in accordance with a set of playback rules includes generating a complex
tone that reflects a particular inflection required for a particular voice recording
of a particular speech item. The complex tone is composed of portions of a recording
of a voice talent uttering a vocal sequence. The voice talent is recorded reciting
the particular speech item to make the particular voice recording. The voice talent
uses the complex tone as a guide to allow the voice talent to recite the particular
speech item in accordance with the particular inflection.