An apparatus comprising a session file, session file editor, annotation
window, concatenation software and training software. The session file
includes one or more audio files and text associated with each audio file
segment. The session file editor displays text and provides text
selection capability and plays back audio. The annotation window operably
associated with the session file editor supports user modification of the
selected text, the annotation window saves modified text corresponding to
the selected text from the session file editor and audio associated with
the modified text. The concatenation software concatenates modified text
and audio associated therewith for two or more instances of the selected
text. The training software trains a speech user profile using a
concatenated file formed by the concatenating software. The session file
may have original audio associated with the selected text, wherein the
apparatus further comprises software for substituting the modified text
for the selected text. In some embodiments, the concatenation software
concatenates modified text and audio associated therewith for two or more
instances of the selected text. In some embodiments, the training
software trains a speech user profile using a concatenated file formed by
the concatenating software.