A method, article of manufacture, and apparatus for monitoring a location
having a plurality of audio sensors and video sensors are disclosed. In
an embodiment, this comprises receiving auditory data, comparing a
portion of the auditory data to a lexicon comprising a plurality of
keywords to determine if there is a match to a keyword from the lexicon,
and if a match is found, selecting at least one video sensor to monitor
an area to be monitored. Video data from the video sensor is archived
with the auditory data and metadata. The video sensor is selected by
determining video sensors associated with the areas to be monitored. A
lookup table is used to determine the association. Cartesian coordinates
may be used to determine positions of components and their areas of
coverage.