Displays a character string representing content of speech in
synchronization with reproduction of the speech. An apparatus includes: a
unit for obtaining scenario data representing the speech; a unit for
dividing textual data resulting from recognition of the speech to
generate pieces of recognition pieces of recognition data; a unit for
detecting in the scenario data a character matching each character
contained in each piece of recognition data for which no matching
character string has been detected to detect in the scenario data a
character string that matches the piece of recognition data; and a unit
for setting the display timing of displaying each of character strings
contained in the scenario data to the timing at which speech recognized
as the piece of recognition data that matches the character string is
reproduced.