A method of generating an electronic text file from a paper-based document
that includes a plurality of characters includes capturing a plurality of
partially overlapping digital images of the document. Optical character
recognition is performed on each one of the plurality of captured digital
images, thereby generating a corresponding plurality of electronic text
files. Each one of the electronic text files includes a portion of the
plurality of characters in the document. The plurality of electronic text
files are compared with one another to identify characters that are in
common between the electronic text files. The plurality of electronic
text files are combined into a combined text file based on the
comparison. The combined text file includes the plurality of characters
in the document.