Information that individual elements (characteristic character strings)
indicative of characteristics of a registered document appear in the
registered document is stored in advance. When calculating similarity of
the registered document, a query designated by a searcher is analyzed.
The query is represented by a characteristic vector having the individual
elements which take the relation between a plurality of words into
consideration. Pieces of appearance information of the individual words
contained in the query are counted. The counted appearance information is
compared with a searching index to calculate similarity between
documents.