论文信息 - Rendering of Eyes for Eye-Shape Registration and Gaze Estimation

Rendering of Eyes for Eye-Shape Registration and Gaze Estimation

Images of the eye are key in several computer vision problems, such as shape registration and gaze estimation. Recent large-scale supervised methods for these problems require time-consuming data collection and manual annotation, which can be unreliable. We propose synthesizing perfectly labelled photo-realistic training data in a fraction of the time. We used computer graphics techniquesto build a collection of dynamic eye-region models from head scan geometry. These were randomly posed to synthesize close-up eye images for a wide range of head poses, gaze directions, and illumination conditions. We used our model's controllability to verify the importance of realistic illumination and shape variations in eye-region training data. Finally, we demonstrate the benefits of our synthesized training data (SynthesEyes) by out-performing state-of-the-art methods for eye-shape registration as well as cross-dataset appearance-based gaze estimation in the wild.

[1] Bolei Zhou,et al. Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[2] Verónica Orvalho,et al. A Facial Rigging Survey , 2012, Eurographics.

[3] Peter Robinson,et al. 3D Constrained Local Model for rigid and non-rigid facial tracking , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[4] Antonio Torralba,et al. Evaluation of image features using a photorealistic virtual world , 2011, 2011 International Conference on Computer Vision.

[5] Neil A. Dodgson,et al. Rendering synthetic ground truth images for eye tracker evaluation , 2014, ETRA.

[6] Jean-Marc Odobez,et al. Person independent 3D gaze estimation from remote RGB-D cameras , 2013, 2013 IEEE International Conference on Image Processing.

[7] Andreas Bulling,et al. EyeTab: model-based gaze estimation on unmodified tablet computers , 2014, ETRA.

[8] Päivi Majaranta,et al. Eye Tracking and Eye-Based Human–Computer Interaction , 2014 .

[9] Luc Van Gool,et al. Real time head pose estimation with random regression forests , 2011, CVPR 2011.

[10] Neil A. Dodgson,et al. Robust real-time pupil tracking in highly off-axis images , 2012, ETRA.

[11] Kate Saenko,et al. Exploring Invariances in Deep Convolutional Neural Networks Using Synthetic Images , 2014, ArXiv.

[12] Cordelia Schmid,et al. Multi-view object class detection with a 3D geometric model , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13] Stefanos Zafeiriou,et al. Robust Discriminative Response Map Fitting with Constrained Local Models , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[14] Norman I. Badler,et al. Look me in the Eyes: A Survey of Eye and Gaze Animation for Virtual Agents and Artificial Systems , 2014, Eurographics.

[15] Fernando De la Torre,et al. Supervised Descent Method and Its Applications to Face Alignment , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[16] Hugues Hoppe,et al. Displaced subdivision surfaces , 2000, SIGGRAPH.

[17] Mario Fritz,et al. Prediction of search targets from fixations in open-world settings , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Frank Keller,et al. Training Object Class Detectors from Eye Tracking Data , 2014, ECCV.

[19] Gerhard Tröster,et al. Eye Movement Analysis for Activity Recognition Using Electrooculography , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20] Stefano Soatto,et al. Relevant Feature Selection for Human Pose Estimation and Localization in Cluttered Images , 2008, ECCV.

[21] Timo Schneider,et al. Manifold Alignment for Person Independent Appearance-Based Gaze Estimation , 2014, 2014 22nd International Conference on Pattern Recognition.

[22] Stefanos Zafeiriou,et al. 300 Faces in-the-Wild Challenge: The First Facial Landmark Localization Challenge , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[23] Peter Robinson,et al. Constrained Local Neural Fields for Robust Facial Landmark Detection in the Wild , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[24] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[25] Derek Bradley,et al. High-quality capture of eyes , 2014, ACM Trans. Graph..

[26] Paul E. Debevec,et al. Image-based lighting , 2002, IEEE Computer Graphics and Applications.

[27] Mario Fritz,et al. Appearance-based gaze estimation in the wild , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28] Levent Burak Kara,et al. Neural network-based symbol recognition using a few labeled samples , 2011, Comput. Graph..

[29] Yoichi Sato,et al. Learning-by-Synthesis for Appearance-Based 3D Gaze Estimation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[30] Takeo Kanade,et al. Dense 3D face alignment from 2D videos in real-time , 2015, 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[31] Stefanos Zafeiriou,et al. Statistically Learned Deformable Eye Models , 2014, ECCV Workshops.

[32] Jie Yu,et al. Improving person detection using synthetic training data , 2010, 2010 IEEE International Conference on Image Processing.

[33] Toby Sharp,et al. Real-time human pose recognition in parts from single depth images , 2011, CVPR.

[34] Paul E. Debevec,et al. Effect of illumination on automatic expression recognition: A novel 3D relightable facial database , 2011, Face and Gesture 2011.

[35] Jochen Triesch,et al. OpenEyeSim - A platform for biomechanical modeling of oculomotor control , 2014, 4th International Conference on Development and Learning and on Epigenetic Robotics.

[36] Deva Ramanan,et al. Face detection, pose estimation, and landmark localization in the wild , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[37] M. Argyle,et al. EYE-CONTACT, DISTANCE AND AFFILIATION. , 1965, Sociometry.

[38] Peter J. Hunter,et al. A virtual environment and model of the eye for surgical simulation , 1994, SIGGRAPH.