Eye Finding via Face Detection for a Foveated Active Vision System

Eye finding is the first step toward building a machine that can recognize social cues, like eye contact and gaze direction, in a natural context. In this paper, we present a real-time implementation of an eye finding algorithm for a foveated active vision system. The system uses a motion-based prefilter to identify potential face locations. These locations are analyzed for faces with a template-based algorithm developed by Sinha (1996). Detected faces are tracked in real time, and the active vision system saccades to the face using a learned sensorimotor mapping. Once gaze has been centered on the face, a high-resolution image of the eye can be captured from the foveal camera using a self-calibrated peripheral-ta-foveal mapping.We also present a performance analysis of Sinha's ratio template algorithm on a standard set of static face images. Although this algorithm performs relatively poorly on static images, this result is a poor indicator of real-time performance of the behaving system. We find that our system finds eyes in 94% of a set of behavioral trials. We suggest that alternate means of evaluating behavioral systems are necessary.

[1]  T. Nummenmaa The language of the face , 1964 .

[2]  J. Bruner,et al.  The capacity for joint visual attention in the infant , 1975, Nature.

[3]  J. Bruner,et al.  The role of tutoring in problem solving. , 1976, Journal of child psychology and psychiatry, and allied disciplines.

[4]  S. Thayer Children's Detection of On-Face and Off-Face Gazes. , 1977 .

[5]  Mohammed Ismail,et al.  Analog VLSI Implementation of Neural Systems , 2011, The Kluwer International Series in Engineering and Computer Science.

[6]  Giulio Sandini,et al.  A Foveated Retina-Like Sensor Using CCD Technology , 1989, Analog VLSI Implementation of Neural Systems.

[7]  V. Rich Personal communication , 1989, Nature.

[8]  Dana H. Ballard,et al.  Behavioural constraints on animate vision , 1989, Image Vis. Comput..

[9]  U. Frith Autism: Explaining the Enigma , 2005 .

[10]  N. Badler,et al.  Linguistic Issues in Facial Animation , 1991 .

[11]  Demetri Terzopoulos,et al.  Techniques for Realistic Facial Modeling and Animation , 1991 .

[12]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[13]  Rodney A. Brooks,et al.  Intelligence Without Reason , 1991, IJCAI.

[14]  D. Coombs Real-Time Gaze Holding in Binocular Robot Vision , 1992 .

[15]  David W. Murray,et al.  A modular head/eye platform for real-time reactive vision Mechatronics , 1993 .

[16]  Shumeet Baluja,et al.  Non-Intrusive Gaze Tracking Using Artificial Neural Networks , 1993, NIPS.

[17]  Rodney A. Brooks,et al.  Building brains for bodies , 1995, Auton. Robots.

[18]  D. Povinelli,et al.  Theory of mind: evolutionary history of a cognitive specialization , 1995, Trends in Neurosciences.

[19]  Pawan Sinha,et al.  Perceiving and recognizing three-dimensional forms , 1996 .

[20]  Takeo Kanade,et al.  Human Face Detection in Visual Scenes , 1995, NIPS.

[21]  Yasuo Kuniyoshi,et al.  A foveated wide angle lens for active vision , 1995, Proceedings of 1995 IEEE International Conference on Robotics and Automation.

[22]  Pattie Maes,et al.  Self-Taught Visually-Guided Pointing for a Humanoid Robot , 1996 .

[23]  Christoph von der Malsburg,et al.  Tracking and learning graphs and pose on image sequences of faces , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[24]  Alexander Zelinsky,et al.  Robust Real-Time Face Tracking and Gesture Recognition , 1997, IJCAI.

[25]  Jo Liska Cognitive ethology: The minds of other animals , 1997 .

[26]  Tomaso A. Poggio,et al.  Example-Based Learning for View-Based Human Face Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[27]  Brian Scassellati,et al.  Alternative Essences of Intelligence , 1998, AAAI/IAAI.

[28]  Brian Scassellati Mechanisms of Shared Attention for a Humanoid Robot , 1998 .

[29]  Brian Scassellati A Binocular, Foveated Active Vision System , 1998 .