The present invention is directed to a system and method for on-road
vehicle detection. A video sequence is received that is comprised of a
plurality of image frames. A potential vehicle appearance is identified
in an image frame. Known vehicle appearance information and scene
geometry information are used to formulate initial hypotheses about
vehicle appearance. The potential vehicle appearance is tracked over
multiple successive image frames. Potential motion trajectories for the
potential vehicle appearance are identified over the multiple image
frames. Knowledge fusion of appearance, scene geometry and motion
information models are applied to each image frame containing the
trajectories. A confidence score is calculated for each trajectory. A
trajectory with a high confidence score is determined to represent a
vehicle appearance.