Appearance-based gaze estimation using deep features and random forest regression

Conventional appearance-based gaze estimation methods employ local or global features as eye gaze appearance descriptor. But these methods don't work well under natural light with free head movement. To solve this problem, we present an appearance-based gaze estimation method using deep feature representation and feature forest regression. The deep feature is learned through hierarchical extraction of deep Convolutional Neural Network (CNN). And random forest regression with cluster-to-classify node splitting rules is used to take advantage of data distribution in sparse feature space. Experimental results demonstrate that the deep feature has a better performance than local features on calibrated gaze regression. The combination of deep features and random forest regression provides an effective solution for gaze estimation in a natural environment.

[1]  Edwige Pissaloux,et al.  Gaze estimation using local features and non-linear regression , 2012, 2012 19th IEEE International Conference on Image Processing.

[2]  Yoichi Sato,et al.  Learning-by-Synthesis for Appearance-Based 3D Gaze Estimation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Takahiro Okabe,et al.  Adaptive Linear Regression for Appearance-Based Gaze Estimation , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Xin Geng,et al.  Head Pose Estimation Based on Multivariate Label Distribution , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Qingqi Long,et al.  A flow-based three-dimensional collaborative decision-making model for supply-chain networks , 2016, Knowl. Based Syst..

[6]  Razvan Pascanu,et al.  Theano: new features and speed improvements , 2012, ArXiv.

[7]  Dongkyung Nam,et al.  Hierarchical gaze estimation based on adaptive feature learning , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[8]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[9]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[10]  Fan Min,et al.  Three-way recommender systems based on random forests , 2016, Knowl. Based Syst..

[11]  Hongbo Liu,et al.  A Real-Time Video-based Eye Tracking Approach for Driver Attention Study , 2012, Comput. Informatics.

[12]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[13]  Jean-Marc Odobez,et al.  Person independent 3D gaze estimation from remote RGB-D cameras , 2013, 2013 IEEE International Conference on Image Processing.

[14]  Jean-Marc Odobez,et al.  Gaze estimation from multimodal Kinect data , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[15]  Timo Schneider,et al.  Manifold Alignment for Person Independent Appearance-Based Gaze Estimation , 2014, 2014 22nd International Conference on Pattern Recognition.

[16]  Yi-Ping Hung,et al.  Appearance-Based Gaze Tracking with Free Head Movement , 2014, 2014 22nd International Conference on Pattern Recognition.

[17]  Mario Fritz,et al.  Appearance-based gaze estimation in the wild , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Luc Van Gool,et al.  Real time head pose estimation with random regression forests , 2011, CVPR 2011.

[19]  Qiang Ji,et al.  In the Eye of the Beholder: A Survey of Models for Eyes and Gaze , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Sean Hughes,et al.  Clustering by Fast Search and Find of Density Peaks , 2016 .

[21]  Rama Chellappa,et al.  Growing Regression Forests by Classification: Applications to Object Pose Estimation , 2013, ECCV.

[22]  Luc Van Gool,et al.  Real-time facial feature detection using conditional regression forests , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[23]  Xiaowu Chen,et al.  Person-independent eye gaze prediction from eye images using patch-based features , 2016, Neurocomputing.

[24]  Xiaoqing Ding,et al.  Person-independent head pose estimation based on random forest regression , 2010, 2010 IEEE International Conference on Image Processing.

[25]  Alessandro Laio,et al.  Clustering by fast search and find of density peaks , 2014, Science.

[26]  Fernando De la Torre,et al.  Supervised Descent Method and Its Applications to Face Alignment , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Tao Li,et al.  Cost-sensitive feature selection using random forest: Selecting low-cost subsets of informative features , 2016, Knowl. Based Syst..

[28]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[29]  Cristina Conati,et al.  Eye-tracking for user modeling in exploratory learning environments: An empirical evaluation , 2007, Knowl. Based Syst..

[30]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[31]  Xiaogang Wang,et al.  Deep Learning Face Representation by Joint Identification-Verification , 2014, NIPS.