Systems and methods for automatically and generically matching models are
provided, such as may be provided in a matching application or matching
component, or provided in a general purpose system for managing models.
The methods are generic since the methods apply to hierarchical data sets
outside of any particular data model or application. Similarity
coefficients are calculated for, and mappings are discovered between,
schema elements based on their names, data types, constraints, and schema
structure, using a broad set of techniques. Some of these techniques
include the integrated use of linguistic and structural matching, context
dependent matching of shared types, and a bias toward subtree, or leaf,
structure where much of the schema content resides.