A method for generating, classifying, searching, and analyzing
standardized text templates drawn from a plurality of text documents and
for identifying standardized text deviations from standardized text
templates. Semi-standardized documents may be represented as standardized
templates and deviations from standardized templates, with such templates
themselves automatically generated by a computer-implemented method from
a plurality of similar text documents. The method enables enhanced
analysis of semi-standardized documents and automatic extraction of
information from standardized text templates.