A method for processing source data from a plurality of diverse sources in
a selected data domain includes specifying a unified schema that lists
markup tags in the selected data domain that can exist in a document in
the markup language and defining correspondences of data fields from the
sources to the markup tags listed by the schema. The source data are
mapped in accordance with the correspondences to generate unified data in
the markup language.