Neural Network-Based Face Detection

We present a neural network-based upright frontal face detection system. A retinally connected neural network examines small windows of an image and decides whether each window contains a face. The system arbitrates between multiple networks to improve performance over a single network. We present a straightforward procedure for aligning positive face examples for training. To collect negative examples, we use a bootstrap algorithm, which adds false detections into the training set as training progresses. This eliminates the difficult task of manually selecting nonface training examples, which must be chosen to span the entire space of nonface images. Simple heuristics, such as using the fact that faces rarely overlap in images, can further improve the accuracy. Comparisons with several other state-of-the-art face detection systems are presented, showing that our system has comparable performance in terms of detection and false-positive rates.

[1]  D. Marquardt An Algorithm for Least-Squares Estimation of Nonlinear Parameters , 1963 .

[2]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[3]  Takeo Kanade,et al.  Picture Processing System by Computer Complex and Recognition of Human Faces , 1974 .

[4]  Ramesh C. Jain,et al.  Three-dimensional object recognition , 1985, CSUR.

[5]  Editors , 1986, Brain Research Bulletin.

[6]  Charles R. Dyer,et al.  Model-based recognition in robot vision , 1986, CSUR.

[7]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[8]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[9]  Geoffrey E. Hinton,et al.  Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[10]  Alex Waibel,et al.  Readings in speech recognition , 1990 .

[11]  S. Rose Selective attention , 1992, Nature.

[12]  Michael Gleicher,et al.  Through-the-lens camera control , 1992, SIGGRAPH.

[13]  Pascal Fua,et al.  Computational strategies for object recognition , 1992, CSUR.

[14]  William H. Press,et al.  Numerical recipes in C++: the art of scientific computing, 2nd Edition (C++ ed., print. is corrected to software version 2.10) , 1994 .

[15]  T. Poggio,et al.  3D Object Recognition: Symmetry and Virtual Views , 1992 .

[16]  Harris Drucker,et al.  Boosting Performance in Neural Networks , 1993, Int. J. Pattern Recognit. Artif. Intell..

[17]  Dean A. Pomerleau,et al.  Neural Network Perception for Mobile Robot Guidance , 1993 .

[18]  R. Vaillant,et al.  An original approach for the localization of objects in images , 1993 .

[19]  H. Martin Hunke,et al.  Locating and Tracking of Human Faces with Neural Networks , 1994 .

[20]  R. Vaillant,et al.  Original approach for the localisation of objects in images , 1994 .

[21]  Gilles Burel,et al.  Detection and localization of faces on digital images , 1994, Pattern Recognit. Lett..

[22]  Thomas S. Huang,et al.  Human face detection in a complex background , 1994, Pattern Recognit..

[23]  R. Palmer,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[24]  Shumeet Baluja,et al.  A Method for Integrating Genetic Search Based Function Optimization and Competitive Learning , 1994 .

[25]  Alex Pentland,et al.  View-based and modular eigenspaces for face recognition , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Michael C. Burl,et al.  Finding faces in cluttered scenes using random labeled graph matching , 1995, Proceedings of IEEE International Conference on Computer Vision.

[27]  S. Nayar,et al.  A Theory of Pattern Rejection , 1995 .

[28]  M. Smith,et al.  Video Skimming for Quick Browsing based on Audio and Image Characterization , 1995 .

[29]  Alex Pentland,et al.  A subspace method for maximum likelihood target detection , 1995, Proceedings., International Conference on Image Processing.

[30]  David Casasent,et al.  Feature space trajectory neural net classifier: 8-class distortion-invariant tests , 1995, Other Conferences.

[31]  Alex Pentland,et al.  Probabilistic visual learning for object detection , 1995, Proceedings of IEEE International Conference on Computer Vision.

[32]  Kah Kay Sung,et al.  Learning and example selection for object and pattern detection , 1995 .

[33]  Alexander H. Waibel,et al.  A real-time face tracker , 1996, Proceedings Third IEEE Workshop on Applications of Computer Vision. WACV'96.

[34]  Pietro Perona,et al.  Recognition of planar object classes , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[35]  Michael J. Swain,et al.  WebSeer: An Image Search Engine for the World Wide Web , 1996 .

[36]  Sandor Z. Der,et al.  FERET (Face Recognition Technology) Recognition Algorithm Development and Test Results. , 1996 .

[37]  Takeo Kanade,et al.  Intelligent Access to Digital Video: Informedia Project , 1996, Computer.

[38]  Roberto Cipolla,et al.  Scale and Orientation Invariance in Human Face Detection , 1996, BMVC.

[39]  A. Favero,et al.  Italy , 1996, The Lancet.

[40]  James F. Blinn,et al.  Blue screen matting , 1996, SIGGRAPH.

[41]  Ming Zhang,et al.  Face recognition using artificial neural network group-based adaptive tolerance (GAT) trees , 1996, IEEE Trans. Neural Networks.

[42]  David J. Kriegman,et al.  What is the set of images of an object under all possible lighting conditions? , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[43]  Shumeet Baluja,et al.  Expectation-based selective attention , 1996 .

[44]  Hiroshi Murase,et al.  Real-time 100 object recognition system , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[45]  KrügerNorbert,et al.  Face Recognition by Elastic Bunch Graph Matching , 1997 .

[46]  Jeremy S. De Bonet,et al.  Multiresolution sampling procedure for analysis and synthesis of texture images , 1997, SIGGRAPH.

[47]  Roberto Cipolla,et al.  Feature-based human face detection , 1997, Image Vis. Comput..

[48]  Takeo Kanade,et al.  Name-It: association of face and name in video , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[49]  Norbert Krüger,et al.  Face recognition by elastic bunch graph matching , 1997, Proceedings of International Conference on Image Processing.

[50]  Michael A. Smith,et al.  Video skimming and characterization through the combination of image and language understanding techniques , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[51]  Hyeonjoon Moon,et al.  The FERET evaluation methodology for face-recognition algorithms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[52]  Federico Girosi,et al.  Training support vector machines: an application to face detection , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[53]  Shumeet Baluja Face Detection with In-Plane Rotation: Early Concepts and Preliminary Results , 1997 .

[54]  Tomaso A. Poggio,et al.  A bootstrapping algorithm for learning linear models of object classes , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[55]  Thomas S. Huang,et al.  Face detection with information-based maximum discrimination , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[56]  Larry S. Davis,et al.  An anthropometric shape model for estimating head orientation , 1997 .

[57]  Sun-Yuan Kung,et al.  Face recognition/detection by probabilistic decision-based neural network , 1997, IEEE Trans. Neural Networks.

[58]  Trevor Darrell,et al.  Integrated Person Tracking Using Stereo, Color, and Pattern Detection , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[59]  Thomas Vetter,et al.  Estimating Coloured 3D Face Models from Single Images: An Example Based Approach , 1998, ECCV.

[60]  Stanley T. Birchfield,et al.  Elliptical head tracking using intensity gradients and color histograms , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[61]  Tanzeem Choudhury,et al.  Multimodal person recognition using unconstrained audio and video , 1998 .

[62]  J. C. BurgesChristopher A Tutorial on Support Vector Machines for Pattern Recognition , 1998 .

[63]  Christopher Potts,et al.  Polymorphism and type inference in database programming , 1996, TODS.

[64]  Harry Wechsler,et al.  The FERET database and evaluation procedure for face-recognition algorithms , 1998, Image Vis. Comput..

[65]  Rahul Sukthankar,et al.  ARGUS: An Automated Multi-Agent Visitor Identification System , 1999, AAAI/IAAI.

[66]  Amnon Shashua,et al.  The quotient image: Class based recognition and synthesis under varying illumination conditions , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[67]  Hyeonjoon Moon,et al.  The FERET Evaluation Methodology for Face-Recognition Algorithms , 2000, IEEE Trans. Pattern Anal. Mach. Intell..