A system and method for processing information in unstructured or
structured form, comprising a computer running in a distributed network
with one or more data agents. Associations of natural language artifacts
may be learned from natural language artifacts in unstructured data
sources, and semantic and syntactic relationships may be learned in
structured data sources, using grouping based on a criteria of shared
features that are dynamically determined without the use of a priori
classifications, by employing conditional probability constraints.