A method for representing a dataset comprises clustering the dataset using
an unsupervised, non-parametric clustering method to generate a set of
clusters each comprising a set of data points in an image; clustering the
data points of each cluster using a supervised, partitional clustering
method to partition each cluster into a specified number of sub-clusters;
generating a density estimate value of each grid point of a set of grid
points sampled from the image at a specified resolution for each
sub-cluster using a kernel density function; identifying a maximum
density estimate value and a sub-cluster associated with the maximum
density estimate value for the grid point; adding each grid point for
which the maximum density estimate value exceeds a specified threshold to
the sub-cluster associated with the maximum density estimate value; and,
for each cluster, merging the sub-clusters of the cluster into a
corresponding cluster region in the image.