GBoost : A Generative Framework for Boosting with Applications to Real-Time Eye Coding ?

Recent imaging studies show that the human brain has structures specialized for the detection of eyes and the recognition of eye-related behavior. There is some evidence that such systems may be innate and play an important role in infant social development. The development of machine perception systems that detect eyes and analyze eye behavior will also enable new approaches to human-computer interaction that emphasize natural, face-to-face communication with the user. For such systems to have an impact in everyday life it is important for them to work robustly in natural, unconstrained conditions. We formulate a probabilistic model of image generation and derive optimal inference algorithms for finding eyes within this framework. The approach models the image as a collage of patches of arbitrary size, some of which contain the object of interest and some of which are background. The approach requires development of likelihood-ratio models for object versus background generated patches. These models are learned using boosting methods. One advantage of the generative approach proposed here is that it makes explicit the conditions under which the approach is optimal, thus facilitating progress towards methods that model the image generation process in more realistic ways. The approach proposed here searches the entire image plane in each frame, making it resistant to fast, unpredictable motion. The system is robust to changes in lighting, illumination, and differences in facial structure, including facial expressions and eyeglasses. Furthermore, the system can simultaneously track the eyes and blinks of multiple individuals. We also present pilot results using this system for analysis of eye-openness in EEG studies. Finally we reflect on how the development of perceptive systems like this may help advance our understanding of the human brain.

[1]  M. Holland,et al.  Blinking and Mental Load , 1972, Psychological reports.

[2]  P. Ekman,et al.  Facial action coding system: a technique for the measurement of facial movement , 1978 .

[3]  C N Karson Physiology of normal and abnormal blinking. , 1988, Advances in neurology.

[4]  N. Logothetis,et al.  Viewer-Centered Object Recognition in Monkeys , 1994 .

[5]  Michael C. Burl,et al.  Finding faces in cluttered scenes using random labeled graph matching , 1995, Proceedings of IEEE International Conference on Computer Vision.

[6]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[7]  Ravi Kothari,et al.  Detection of eye locations in unconstrained visual images , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[8]  Norbert Krüger,et al.  Face recognition by elastic bunch graph matching , 1997, Proceedings of International Conference on Image Processing.

[9]  Tzyy-Ping Jung,et al.  Extended ICA Removes Artifacts from Electroencephalographic Recordings , 1997, NIPS.

[10]  Tomaso A. Poggio,et al.  Example-Based Learning for View-Based Human Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  M. Farah,et al.  What is "special" about face perception? , 1998, Psychological review.

[12]  Yoav Freund,et al.  A Short Introduction to Boosting , 1999 .

[13]  K. Nakamura,et al.  The human amygdala plays an important role in gaze monitoring. A PET study. , 1999, Brain : a journal of neurology.

[14]  Harry Wechsler,et al.  Eye Detection Using Optimal Wavelet Packets and Radial Basis Functions (RBFs) , 1999, Int. J. Pattern Recognit. Artif. Intell..

[15]  Irfan A. Essa,et al.  Detecting and tracking eyes by using their physiological properties, dynamics, and appearance , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[16]  Garrison W. Cottrell,et al.  Is All Face Processing Holistic? The View from UCSD , 2000 .

[17]  Robert Frischholz,et al.  BioID: A Multimodal Biometric Identification System , 2000, Computer.

[18]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[19]  Javier R. Movellan,et al.  Probabilistic functionalism: A unifying paradigm for the cognitive sciences , 2001 .

[20]  Klaus J. Kirchberg,et al.  Robust Face Detection Using the Hausdorff Distance , 2001, AVBPA.

[21]  Paul A. Viola,et al.  Robust Real-time Object Detection , 2001 .

[22]  Mark H. Johnson The development and neural basis of face recognition: comment and speculation , 2001 .

[23]  Jon Driver,et al.  Seen Gaze-Direction Modulates Fusiform Activity and Its Coupling with Other Brain Areas during Face Processing , 2001, NeuroImage.

[24]  Qiang Ji,et al.  Real Time Visual Cues Extraction for Monitoring Driver Vigilance , 2001, ICVS.

[25]  Jing Xiao,et al.  Automatic recognition of eye blinking in spontaneously occurring behavior , 2002, Object recognition supported by user interaction for service robots.

[26]  Marian Stewart Bartlett,et al.  A comparison of Gabor filter methods for automatic detection of facial landmarks , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[27]  Mark H. Johnson,et al.  Eye contact detection in humans from birth , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Gwen Littlewort,et al.  An approach to automatic recognition of spontaneous facial actions , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[29]  Paul A. Viola,et al.  A unified learning framework for real time face detection and classification , 2002, Proceedings of Fifth IEEE International Conference on Automatic Face Gesture Recognition.

[30]  Qiang Ji,et al.  Real-Time Eye, Gaze, and Face Pose Tracking for Monitoring Driver Vigilance , 2002, Real Time Imaging.