A set of data is received containing values associated with respective
data points, the values associated with each of the data points being
characterized by a distribution. The values for each of the data points
are expressed in a form that includes information about a distribution of
the values for each of the data points. The distribution information is
used in clustering the set of data with at least one other set of data
containing values associated with data points.