A method and apparatus for extracting a model vector representation from
multimedia documents. A model vector provides a multidimensional
representation of the confidence with which multimedia documents belong
to a set of categories or with which a set of semantic concepts relate to
the documents. A model vector can be associated with multimedia documents
to provide an index of its content or categorization and can be used for
comparing, searching, classifying, or clustering multimedia documents. A
model vector can be used for purposes of information discovery,
personalizing multimedia content, and querying a multimedia information
repository.