The present invention provides mathematical model-based incremental
clustering methods for classifying sets of data and predicting new data
values, based upon the concepts of similarity and cohesion. In order to
increase processing efficiency, these methods employ weighted attribute
relevance in building unbiased classification trees and sum pairing to
reduce the number of nodes visited when performing classification or
prediction operations. In order to increase prediction accuracy, these
methods employ weighted voting over each value of target attributes to
calculate a prediction profile. The present invention also allows an
operator to determine the importance of attributes and reconstitute
classification trees without those attributes deemed unimportant to
further increase classification structure node processing efficiency.