Overlapping subdocuments in a vector space search process

The present invention is a method and apparatus for retrieving information from a database. Initially, the documents within the database are divided into mutually exclusive subdocuments that generally correspond to paragraphs of text. The present invention further creates a second set of subdocuments that overlap adjacent paragraphs of text. In particular, the location of the overlapping subdocuments depends on the size of the initial paragraphs. This second set of overlapping subdocuments are scored just as the mutually exclusive subdocuments are scored. The scores from both the mutually exclusive and overlapping subdocuments are used in ranking the relevance of documents to a query. The use of both sets of subdocument scores improves the effectiveness of the scoring algorithm.
La presente invenzione è un metodo e un apparecchio per il richiamo delle informazioni da una base di dati. Inizialmente, i documenti all'interno della base di dati sono divisi reciprocamente nei subdocuments di esclusiva che corrispondono generalmente ai paragrafi di testo. La presente invenzione ulteriore genera un secondo insieme dei subdocuments che coincidono i paragrafi adiacenti di testo. In particolare, la posizione dei subdocuments di sovrapposizione dipende dal formato dei paragrafi iniziali. Questo secondo insieme dei subdocuments di sovrapposizione è notato appena mentre i subdocuments di esclusiva sono notati reciprocamente. I segni da entrambi reciprocamente l'esclusiva ed i subdocuments di sovrapposizione sono usati in posto l'attinenza dei documenti con domanda. L'uso di entrambi gli insiemi dei segni del subdocument migliora l'efficacia della procedura notante.

Web www.patentalert.com

< (none)

< Relational database management of multi-dimensional data

> System and method for reducing compile time in a top down rule based system using rule heuristics based upon the predicted resulting data flow

> (none)

~ 00005