An information abstracting method and apparatus for extracting and
displaying keywords as an information abstract. Given a large number of
character string data sets divided into prescribed units, the extracted
keywords are significant and effective in describing a topic common to the
plurality of units. The information abstracting apparatus comprises an
input section for accepting an input of character string data divided into
prescribed units, with each individual character represented by a
character code, and an output section for displaying the result of
information abstracting. Keywords contained in each of the prescribed
units are extracted by a keyword extracting section from the character
string input data from the input section. A score is calculated for each
keyword by a score calculating section, so that a higher score is given to
a keyword extracted from a larger number of units. On the basis of the
calculated scores, keywords are selected by an abstracting section and are
outputted as an information abstract by the output section.