The present invention relates generally to the classification of items
into categories, and more generally, to the automatic selection of
different classifiers at different places within a hierarchy of
categories. An exemplary hierarchical categorization method uses a hybrid
of different classification technologies, with training-data based
machine-learning classifiers preferably being used in those portions of
the hierarchy above a dynamically defined boundary in which adequate
training data is available, and with a-priori classification rules not
requiring any such training-data being used below that boundary, thereby
providing a novel hybrid categorization technology that is capable of
leveraging the strengths of its components. In particular, it enables the
use of human-authored rules in those finely divided portions towards the
bottom of the hierarchy involving relatively close decisions for which it
is not practical to create in advance sufficient training data to ensure
accurate classification by known machine-learning algorithms, while still
facilitating eventual change-over within the hierarchy to machine
learning algorithms as sufficient training data becomes available to
ensure acceptable performance in a particular sub-portion of the
hierarchy.