A method and/or system for tracking objects, such as humans, over a wide
area (that is, over an area that is delineated by a large spatial domain
and/or a long-duration temporal domain) is provided. Such tracking is
facilitated by processing, in real-time, near real-time or otherwise
contemporaneous with receiving, images captured by each of a plurality or
network of slightly overlapping stereo sensors, such as stereo cameras.
The method includes and the apparatus is adapted for obtaining a
plurality of local-track segments, wherein the plurality of local-track
segments correspond to an object captured in images taken by a respective
plurality of stereo sensors; and combining the local-track segments to
form a global track.