We use a skin color model based on the Muhulanobis metric and a shape analysis based on invariant moments to automatically detect and locate human faces in two-dimensional natural scene images. First, color segmentation of an input image is performed by thresholding in a perceptually plausible hue-saturation color space where the effects of the variability of human skin color and the dependency of chrominance on changes in illumination are reduced. We then group regions of the resulting binary image which have been classified as candidates into clusters of connected pixels. Performing median filtering on the image and discarding the smallest remaining clusters ensures that only a small number of clusters will be used for further analysis. Fully translation-, scale- anti in-plane rotation invariant moments are calculated for each remaining cluster. Finally, in order to distinguish faces from distractors, a multilayer perceptron neural network is used with the invariant moments as the input vector. Supervised learning of the network is implemented with the backpropagation algorithm, at first for frontal views of faces. Preliminary results show the efficiency of the combination of color segmentation and of invariant moments in detecting faces with a large variety of poses and against relatively complex backgrounds.
[1]
F. Luschan,et al.
Völker, Rassen, Sprachen : anthropologische Betrachtungen
,
1927
.
[2]
J. Davenport.
Editor
,
1960
.
[3]
Ming-Kuei Hu,et al.
Visual pattern recognition by moment invariants
,
1962,
IRE Trans. Inf. Theory.
[4]
Yunlong Sheng,et al.
Invariant pattern recognition using Fourier-Mellin transforms and neural networks
,
1991
.
[5]
Yajun Li,et al.
Reforming the theory of invariant moments for pattern recognition
,
1992,
Pattern Recognit..
[6]
Miroslaw Pawlak,et al.
On Image Analysis by Moments
,
1996,
IEEE Trans. Pattern Anal. Mach. Intell..
[7]
A. Murat Tekalp,et al.
Automatic Image Annotation Using Adaptive Color Classification
,
1996,
CVGIP Graph. Model. Image Process..