An arrangement is provided for generating a reduced unit database of a desired
size to be used in text to speech operations. A reduced unit database with a desired
size is generated based on a full unit database. The reduction is carried out with
respect to a text database with a plurality of sentences. Units from the full database
are pruned to minimize an overall cost associated with using alternative units
other than the units in the reduced unit database.