A computerized method of representing a dataset with a taxonomy includes
augmenting a dataset containing a plurality of records with a plurality
of predetermined exemplars; representing the plurality of records and
predetermined exemplars within the augmented dataset as a plurality of
clusters in an initial taxonomy layer; generating a truncated hierarchy
of cluster sets based on clusters within the initial taxonomy layer,
wherein clusters within the truncated hierarchy contain no more than a
predetermined number of exemplars; and labeling clusters within the
truncated hierarchy.