A method for detecting and recognizing at least one traffic sign is
disclosed. A video sequence having a plurality of image frames is
received. One or more filters are used to measure features in at least
one image frame indicative of an object of interest. The measured
features are combined and aggregated into a score indicating possible
presence of an object. The scores are fused over multiple image frames
for a robust detection. If a score indicates possible presence of an
object in an area of the image frame, the area is aligned with a model. A
determination is then made as to whether the area indicates a traffic
sign. If the area indicates a traffic sign, the area is classified into a
particular type of traffic sign. The present invention is also directed
to training a system to detect and recognize traffic signs.