A face recognition system and process for identifying a person depicted in an
input
image and their face pose. This system and process entails locating and extracting
face regions belonging to known people from a set of model images, and determining
the face pose for each of the face regions extracted. All the extracted face regions
are preprocessed by normalizing, cropping, categorizing and finally abstracting
them. More specifically, the images are normalized and cropped to show only a person's
face, categorized according to the face pose of the depicted person's face by assigning
them to one of a series of face pose ranges, and abstracted preferably via an eigenface
approach. The preprocessed face images are preferably used to train a neural network
ensemble having a first stage made up of a bank of face recognition neural networks
each of which is dedicated to a particular pose range, and a second stage constituting
a single fusing neural network that is used to combine the outputs from each of
the first stage neural networks. Once trained, the input of a face region which
has been extracted from an input image and preprocessed (i.e., normalized, cropped
and abstracted) will cause just one of the output units of the fusing portion of
the neural network ensemble to become active. The active output unit indicates
either the identify of the person whose face was extracted from the input image
and the associated face pose, or that the identity of the person is unknown to
the system.