A method, system, and computer program product are provided for extracting
information from a plurality of articles in a distributed manner and for
storing the extracted information in an information store. The invention
identifies a plurality of articles from which information is to be
extracted and a plurality of information extractors for extracting the
information from the articles. Each article is assigned a priority score
and ranking the articles from highest to lowest priority, thereby
generating a queue; wherein the priority score for each article is
calculated using a user-configurable priority calculation algorithm. The
plurality of articles is assigned to the plurality of information
extractors based on order in the queue, wherein an article with a higher
rank is presented for information extraction before an article with a
lower rank. Information extracted by information extractors from the
articles is stored in the information store.