A method for building a classification model for classifying unclassified
documents based on the classification of a plurality of documents which
respectively have been classified as belonging to one of a plurality of
classes, said documents being digitally represented in a computer, said
documents respectively comprising a plurality of terms which respectively
comprise one or more symbols of a finite set of symbols, and said method
comprising the following steps: representing each of said plurality of
documents by a vector of n dimensions, said n dimensions forming a vector
space, whereas the value of each dimension of said vector corresponds to
the frequency of occurrence of a certain term in the document
corresponding to said vector, so that said n dimensions span up a vector
space; representing the classification of said already classified
documents into classes by separating said vector space into a plurality
of subspaces by one or more hyperplanes, such that each subspace
comprises one or more documents as represented by their corresponding
vectors in said vector space, so that said each subspace corresponds to a
class.