The present invention relates to an example-based concept-orietned data
extraction method. In an example labeling phase, the exemplary data
string is converted into an exemplary token sequence, in which the target
concepts and filler concepts are labeled to be tuples for use as an
example, and thus an exemplary concept graph is constructed. In the data
extraction phase, the untested data string is converted into an untested
token sequence to be processed, and, based on the associated concept
recognizers defined by the tuples in the example labeling phase, it is
able to detect the concept candidates and establish the composite
concepts and aggregate concepts, thereby constructing a hypothetical
concept graph. After comparing the exemplary concept graph with the
hypothetical concept graph, the optimal hypothetical concept sequence in
the hypothetical graph is determined, so as to extract the targeted data
from the matched target concepts.