An apparatus and method is provided for pruning an index of a corpus of
text documents by creating an inverted index of terms appearing in the
documents, wherein the index includes postings of the terms in the
documents, ranking the postings in the index, and pruning from the index
the postings below a given level in the ranking.