Systems and methods for word sense disambiguation, including discerning one
or more senses or occurrences, distinguishing between senses or occurrences, and
determining a meaning for a sense or occurrence of a subject term. In a collection
of documents containing terms and a reference collection containing at least one
meaning associated with a term, the method includes forming a vector space representation
of terms and documents. In some embodiments, the vector space is a latent semantic
index vector space. In some embodiments, occurrences are clustered to discern or
distinguish a sense of a term. In preferred embodiments, meaning of a sense or
occurrence is assigned based on either correlation with an external reference source,
or proximity to a reference source that has been indexed into the space.