The present invention is directed to a computer automated method of
selectively identifying a user specified behavior of a crowd. The method
comprises receiving video data but can also include audio data and sensor
data. The video data contains images a crowd. The video data is processed
to extract hierarchical human and crowd features. The detected crowd
features are processed to detect a selectable crowd behavior. The
selected crowd behavior detected is specified by a configurable behavior
rule. Human detection is provided by a hybrid human detector algorithm
which can include Adaboost or convolutional neural network. Crowd
features are detected using textual analysis techniques. The configurable
crowd behavior for detection can be defined by crowd behavioral language.