Input video data is processed to detect a change in a posture of a person
shown in the video data. The change of posture may be the result of an
event, for example, the person falling or getting up. The input video
data may include a plurality of frames. Objects in the frames are tracked
and then classified, for example, as human and non-human targets. At
least one of the position or location of a human target in the frames is
identified. Changes in the location or position of the human target
between the frames is determined. When the change in at least of the
position or location exceeds a predetermined threshold, a falling down
event or a getting up event is detected. The changes in the position or
location of the human target can be determined based on a number of
different factors.