Noise-superimposed speech data is grouped according to acoustic
similarity, and sufficient statistics are prepared using the speech data
in each of the groups. A group acoustically similar to voice data of a
user of the speech recognition is selected, and sufficient statistics
acoustically similar to the user's voice data are selected from the
sufficient statistics in the selected group. Using the selected
sufficient statistics, an acoustic model is prepared.