A computerized method for automatic document classification based on a combined
use of the projection and the distance of the differential document vectors to
the differential latent semantics index (DLSI) spaces. The method includes the
setting up of a DLSI space-based classifier to be stored in computer storage and
the use of such classifier by a computer to evaluate the possibility of a document
belonging to a given cluster using a posteriori probability function and to classify
the document in the cluster. The classifier is effective in operating on very large
numbers of documents such as with document retrieval systems over a distributed
computer network.