The present invention relates to an apparatus and method for recognizing
biological named entity from biological literature based on united
medical language system (UMLS). The apparatus and the method receives
metathesaurus from the UMLS, constructs a concept name database, a single
name database and a category keyterm database, which are language
resources to be used recognize a named entity, receives each concept name
stored in the concept name database, extracts features of each of the
concept names by using data stored in the single name database and the
category keyterm database, constructs a rule database by creating rules
used to recognize the named entity and filtering the rules by using the
extracted features, receives a biological literature, extracts nouns and
noun phrases that are candidate named entities, applies the rules stored
in the rule database to the nouns and the noun phrases, and recognizes
the named entities. In the present invention, the biological named
entities can be effectively extracted which can be used as important
information individual in input literature.