The invention provides systems, methods, and computer programs to improve
the accuracy and efficiency with which data analysts can use news
stories, press releases, and other sources of information to maintain
databases that contain information about individuals and businesses and
other organizations. Documents containing material information are
acquired in computer-readable form and optionally may then be reduced to
raw text. One or more computerized systems process the text and tag
important terms such as proper nouns, job titles, awards, and other terms
indicating professional, educational, corporate, or other developments.
The invention provides a user interface with which a data analyst can
review, confirm, remove, modify, introduce, and link the tags, ultimately
adding the information and links to a database and storing the source
document in an electronic warehouse for future retrieval.