Accurate and robust eye center localization via fully convolutional networks

Eye center localization is one of the most crucial and basic requirements for some human-computer interaction applications such as eye gaze estimation and eye tracking. There is a large body of works on this topic in recent years, but the accuracy still needs to be improved due to challenges in appearance such as the high variability of shapes, lighting conditions, viewing angles and possible occlusions. To address these problems and limitations, we propose a novel approach in this paper for the eye center localization with a fully convolutional network ( FCN ), which is an end-to-end and pixels-to-pixels network and can locate the eye center accurately. The key idea is to apply the FCN from the object semantic segmentation task to the eye center localization task since the problem of eye center localization can be regarded as a special semantic segmentation problem. We adapt contemporary FCN into a shallow structure with a large kernel convolutional block and transfer their performance from semantic segmentation to the eye center localization task by fine-tuning. Extensive experiments show that the proposed method outperforms the state-of-the-art methods in both accuracy and reliability of eye center localization. The proposed method has achieved a large performance improvement on the most challenging database and it thus provides a promising solution to some challenging applications.

[1]  Honghai Liu,et al.  Robust Gaze Estimation via Normalized Iris Center-Eye Corner Vector , 2016, ICIRA.

[2]  Vibhav Vineet,et al.  Conditional Random Fields as Recurrent Neural Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[3]  Qiang Ji,et al.  Learning-by-synthesis for accurate eye detection , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[4]  Paola Campadelli,et al.  Precise Eye and Mouth Localization , 2009, Int. J. Pattern Recognit. Artif. Intell..

[5]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Derek Hoiem,et al.  Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.

[7]  Jiri Matas,et al.  Feature-based affine-invariant localization of faces , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Hui Yu,et al.  Perception-driven facial expression synthesis , 2012, Comput. Graph..

[9]  Junyu Dong,et al.  SDM-Based Means of Gradient for Eye Center Localization , 2018, 2018 IEEE 16th Intl Conf on Dependable, Autonomic and Secure Computing, 16th Intl Conf on Pervasive Intelligence and Computing, 4th Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[10]  Josef Kittler,et al.  Affine-invariant face detection and localization using GMM-based feature detector and enhanced appearance model , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[12]  Andrew Zisserman,et al.  Regression and classification approaches to eye localization in face images , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[13]  Ling Shao,et al.  Video Salient Object Detection via Fully Convolutional Networks , 2017, IEEE Transactions on Image Processing.

[14]  Xiangyu Zhang,et al.  Large Kernel Matters — Improve Semantic Segmentation by Global Convolutional Network , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Qiang Wang,et al.  A novel design framework for smart operating robot in power system , 2018, IEEE/CAA Journal of Automatica Sinica.

[16]  Longin Jan Latecki,et al.  Semantic Segmentation of RGBD Images with Mutex Constraints , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[17]  Igor S. Pandzic,et al.  Eye pupil localization with an ensemble of randomized trees , 2014, Pattern Recognit..

[18]  Zhi-Hua Zhou,et al.  Projection functions for eye detection , 2004, Pattern Recognit..

[19]  Theo Gevers,et al.  Accurate Eye Center Location through Invariant Isocentric Patterns , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Li Li,et al.  Traffic signal timing via deep reinforcement learning , 2016, IEEE/CAA Journal of Automatica Sinica.

[21]  Paola Campadelli,et al.  Precise Eye Localization through a General-to-specific Model Definition , 2006, BMVC.

[22]  Min Wu,et al.  A facial expression emotion recognition based human-robot interaction system , 2017, IEEE/CAA Journal of Automatica Sinica.

[23]  I. Pitas,et al.  An Eye Detection Algorithm Using Pixel to Edge Information , 2005 .

[24]  Timothy F. Cootes,et al.  A Multi-Stage Approach to Facial Feature Detection , 2004, BMVC.

[25]  Erhardt Barth,et al.  Accurate Eye Centre Localisation by Means of Gradients , 2011, VISAPP.

[26]  Otmar Hilliges,et al.  Deep Pictorial Gaze Estimation , 2018, ECCV.

[27]  Dario Cazzato,et al.  Unsupervised approach for the accurate localization of the pupils in near-frontal facial images , 2013, J. Electronic Imaging.

[28]  C. Distante,et al.  Unsupervised Eye Pupil Localization through Differential Geometry and Local Self-Similarity Matching , 2014, PloS one.

[29]  Aurobinda Routray,et al.  Fast and accurate algorithm for eye localisation for gaze tracking in low-resolution images , 2016, IET Comput. Vis..

[30]  Seunghoon Hong,et al.  Learning Deconvolution Network for Semantic Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[31]  Hong Wang,et al.  Parallel planning: a new motion planning framework for autonomous driving , 2019, IEEE/CAA Journal of Automatica Sinica.

[32]  Otmar Hilliges,et al.  Learning to find eye region landmarks for remote gaze estimation in unconstrained settings , 2018, ETRA.

[33]  Qiang Ji,et al.  A joint cascaded framework for simultaneous eye detection and eye state estimation , 2017, Pattern Recognit..

[34]  LinLin Shen,et al.  A Novel Eye Location Algorithm based on Radial Symmetry Transform , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[35]  Marwan Mattar,et al.  Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments , 2008 .

[36]  Klaus J. Kirchberg,et al.  Robust Face Detection Using the Hausdorff Distance , 2001, AVBPA.

[37]  Wen Gao,et al.  2D Cascaded AdaBoost for Eye Localization , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[38]  Dongpu Cao,et al.  Identification and Analysis of Driver Postures for In-Vehicle Driving Activities and Secondary Tasks Recognition , 2018, IEEE Transactions on Computational Social Systems.

[39]  Steve J. Young,et al.  HMM-based architecture for face identification , 1994, Image Vis. Comput..

[40]  Iasonas Kokkinos,et al.  Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs , 2014, ICLR.

[41]  Honghai Liu,et al.  Convolution-based means of gradient for fast eye center localization , 2015, 2015 International Conference on Machine Learning and Cybernetics (ICMLC).

[42]  Xiaogang Wang,et al.  Deep Convolutional Network Cascade for Facial Point Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Chengjun Liu,et al.  Eye detection using discriminatory Haar features and a new efficient SVM , 2015, Image Vis. Comput..

[44]  Wenhao Zhang,et al.  Eye center localization and gaze gesture recognition for human-computer interaction. , 2016, Journal of the Optical Society of America. A, Optics, image science, and vision.

[45]  Theo Gevers,et al.  Accurate eye center location and tracking using isophote curvature , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[46]  Sven Behnke,et al.  Learning Face Localization Using Hierarchical Recurrent Networks , 2002, ICANN.

[47]  Yan Ren,et al.  A Novel Eye Localization Method With Rotation Invariance , 2014, IEEE Transactions on Image Processing.

[48]  Sanghoon Kim,et al.  Multi-Scale Gabor Feature Based Eye Localization , 2007 .

[49]  Alfons Maes,et al.  Circle-based eye center localization (CECL) , 2015, 2015 14th IAPR International Conference on Machine Vision Applications (MVA).

[50]  Shuangshuang Han,et al.  Cyber-Physical-Social Systems: The State of the Art and Perspectives , 2018, IEEE Transactions on Computational Social Systems.

[51]  Slawomir Gruszczynski,et al.  Hybrid computer vision system for drivers' eye recognition and fatigue monitoring , 2014, Neurocomputing.

[52]  Shuo Chen,et al.  Clustering-based discriminant analysis for eye detection. , 2014, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.

[53]  Bin Li,et al.  Identification of Eye Movements from Non-frontal Face Images for Eye-controlled Systems , 2014, Int. J. Autom. Comput..

[54]  Rung-Ching Chen,et al.  User Rating Classification via Deep Belief Network Learning and Sentiment Analysis , 2019, IEEE Transactions on Computational Social Systems.

[55]  Qingshan Liu,et al.  Stacked Hourglass Network for Robust Facial Landmark Localisation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[56]  Ning Xi,et al.  A Hierarchical FloatBoost and MLP Classifier for Mobile Phone Embedded Eye Location System , 2006, ISNN.

[57]  Minho Lee,et al.  Human intention recognition based on eyeball movement pattern and pupil size variation , 2014, Neurocomputing.

[58]  Gerhard Rigoll,et al.  Improving Facial Landmark Detection via a Super-Resolution Inception Network , 2017, GCPR.

[59]  J. Shanbehzadeh,et al.  Automatic Adaptive Center of Pupil Detection Using Face Detection and CDF Analysis , 2010 .

[60]  Inho Choi,et al.  A variety of local structure patterns and their hybridization for accurate eye detection , 2017, Pattern Recognit..

[61]  Honghai Liu,et al.  Regression-Based Facial Expression Optimization , 2014, IEEE Transactions on Human-Machine Systems.

[62]  Bolei Zhou,et al.  Object Detectors Emerge in Deep Scene CNNs , 2014, ICLR.

[63]  Qiang Ji,et al.  Cascade learning from adversarial synthetic images for accurate pupil detection , 2019, Pattern Recognit..

[64]  한보형,et al.  Learning Deconvolution Network for Semantic Segmentation , 2015 .

[65]  Siome Goldenstein,et al.  Fast eye localization without a face model using inner product detectors , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[66]  Jia Deng,et al.  Stacked Hourglass Networks for Human Pose Estimation , 2016, ECCV.

[67]  Rafael Cabeza,et al.  Hybrid method based on topography for robust detection of iris center and eye corners , 2013, TOMCCAP.

[68]  Ieee Xplore,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence Information for Authors , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[69]  Wenzhong Guo,et al.  An Overview of Unsupervised Deep Feature Representation for Text Categorization , 2019, IEEE Transactions on Computational Social Systems.

[70]  Alan Hanjalic,et al.  Eye localization for face matching: is it always useful and under what conditions? , 2008, CIVR '08.

[71]  Gregory Shakhnarovich,et al.  Feedforward semantic segmentation with zoom-out features , 2014, CVPR.

[72]  Mehmet Türkan,et al.  Human eye localization using edge projections , 2007, VISAPP.

[73]  Dongpu Cao,et al.  From Intelligent Vehicles to Smart Societies: A Parallel Driving Approach , 2018, IEEE Trans. Comput. Soc. Syst..

[74]  Xuan Li,et al.  Training and testing object detectors with virtual images , 2017, IEEE/CAA Journal of Automatica Sinica.