A method and system for evaluating telephone services provided by speech recognition
interfaces an evaluation engine with a voice recognition service over a telephone
system to submit speech utterance samples to the voice recognition service, receive
the response of the voice recognition service to the sample utterances, and determine
error and recognition of the sample utterances by the voice recognition service
by comparing actual voice recognition service responses to expected responses.
The evaluation engine permits evaluation of a voice recognition service for plural
glossaries in different contexts, such as through predetermined nodes of a voice
recognition service menu having plural glossaries.