The invention is a method of and system for identifying a target form for increased
efficiency in an automated data capture process. Forms are scanned and stored as
digitized images. Regions are defined on the form relative to corresponding reference
points between the form and the digitized image. The regions are defined in areas
that contain anticipated digitized data from data fields of the form. Digitized
data is recognized through such means as optical character recognition (OCR) and
the resulting string variable is compared in form to a plurality of formats expected
for that data. Scoring systems are used to attain a resultant score for a number
of string variables which is compared to a predetermined confidence number. If
said confidence number is reached, the form is flagged as a target form and used
in the data capture process. A first step identification of certain graphical features
can be added as an initial determination as to the source of the form.