A method for generating an index of the text of a video image sequence is provided.
The method includes the steps of determining the image text objects in each of
a plurality of frames of the video image sequence; comparing the image text objects
in each of the plurality of frames of the video image sequence to obtain a record
of frame sequences having matching image text objects; extracting the content for
each of the similar image text objects in text string format; and storing the text
string for each image text object as a video text object in a retrievable medium.