Accordingly, the invention is a method for automatic deduction of rules
for matching document content to a category within a strange taxonomy, which allows
the document to be automatically classified into a proper category for storage
in that strange taxonomy. The method includes the steps of spidering the taxonomy
to determine its structure and contents, extracting keywords from documents within
the strange taxonomy, formulating rules for determining the category from the extracted
keywords, and applying the rules to classify a new document whose keywords have
been extracted. The taxonomy is strange because the user has no knowledge of its
internal structure and needs no such knowledge. The taxonomy may be flat or may
be hierarchal, the later having rules formulated at each level for proceeding to
the next level. Variations for creating new and refurbishing old document management
systems are disclosed.