A text processing method is provided that includes the following steps.
First, an abstract mathematical vector space is generated based on a
collection of documents. Respective documents in the collection of
documents have a representation in the abstract mathematical vector space
and respective terms contained in the collection of documents have a
representation in the abstract mathematical vector space. Then, the
abstract mathematical vector space is perturbed to produce a perturbed
abstract mathematical vector space that is stored in an electronic format
accessible to a user. Perturbing the abstract mathematical vector space
may include modifying the representation of a document with a newly
computed representation for that document, or modifying the
representation of a term with a newly computed representation for that
term.