A method detects events in multimedia. Features are extracted from the
multimedia. The features are sampled using a sliding window to obtain
samples. A context model is constructed for each sample. An affinity
matrix is determined from the models and a commutative distance metric
between each pair of context models. A second generation eigenvector is
determined for the affinity matrix, and the samples are then clustered
into events according to the second generation eigenvector.