A first distribution of cases over a first group of categories is
received. A categorizer trained using a search-and-confirm technique
classifies the cases into a second group of categories. A second
distribution of the cases over the second group of categories is
generated using results of the classifying. The first and second
distributions are compared to identify differences between the first and
second distributions.