Methods and systems for clustering document collections are disclosed. A
system for clustering observations may include a processor and a
processor-readable storage medium. The processor-readable storage medium
may contain one or more programming instructions for performing a method
of clustering observations. A plurality of parameter vectors and a
plurality of observations may be received. A distribution may also be
determined. An optimal partitioning of the observations may then be
selected based on the distribution, the parameter vectors and a
likelihood function.