A method and system for interesting relationships in text documents
includes generating a dictionary of keywords in the text documents,
forming categories of the text documents using the dictionary and an
automated algorithm, counting occurrences of the structured variables,
categories and structured variable/category combinations in the text
documents, and calculating probabilities of occurrences of the structured
variable/category combinations.