Energy Efficient Pupil Tracking Based on Rule Distillation of Cascade Regression Forest

As the demand for human-friendly computing increases, research on pupil tracking to facilitate human–machine interactions (HCIs) is being actively conducted. Several successful pupil tracking approaches have been developed using images and a deep neural network (DNN). However, common DNN-based methods not only require tremendous computing power and energy consumption for learning and prediction; they also have a demerit in that an interpretation is impossible because a black-box model with an unknown prediction process is applied. In this study, we propose a lightweight pupil tracking algorithm for on-device machine learning (ML) using a fast and accurate cascade deep regression forest (RF) instead of a DNN. Pupil estimation is applied in a coarse-to-fine manner in a layer-by-layer RF structure, and each RF is simplified using the proposed rule distillation algorithm for removing unimportant rules constituting the RF. The goal of the proposed algorithm is to produce a more transparent and adoptable model for application to on-device ML systems, while maintaining a precise pupil tracking performance. Our proposed method experimentally achieves an outstanding speed, a reduction in the number of parameters, and a better pupil tracking performance compared to several other state-of-the-art methods using only a CPU.

[1]  Siva Krishna Dasari,et al.  Predictive modelling to support sensitivity analysis for robust design in aerospace engineering , 2020, Structural and Multidisciplinary Optimization.

[2]  Jan Kautz,et al.  Few-Shot Adaptive Gaze Estimation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[3]  Ulrike Thomas,et al.  Real Time Eye Gaze Tracking System using CNN-based Facial Features for Human Attention Measurement , 2019, VISIGRAPP.

[4]  Francisco Herrera,et al.  Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI , 2020, Inf. Fusion.

[5]  Davis E. King,et al.  Dlib-ml: A Machine Learning Toolkit , 2009, J. Mach. Learn. Res..

[6]  David W. Aha,et al.  DARPA's Explainable Artificial Intelligence (XAI) Program , 2019, AI Mag..

[7]  Thiago Santini,et al.  PuRe: Robust pupil detection for real-time pervasive eye tracking , 2017, Comput. Vis. Image Underst..

[8]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[9]  Noha Younis Ahmed,et al.  Real-time accurate eye center localization for low-resolution grayscale images , 2020, Journal of Real-Time Image Processing.

[10]  Qiang Ji,et al.  Cascade learning from adversarial synthetic images for accurate pupil detection , 2019, Pattern Recognit..

[11]  Pingmei Xu,et al.  On-Device Few-Shot Personalization for Real-Time Gaze Estimation , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[12]  Rafael Cabeza,et al.  Hybrid method based on topography for robust detection of iris center and eye corners , 2013, TOMCCAP.

[13]  Aurobinda Routray,et al.  Fast and accurate algorithm for eye localisation for gaze tracking in low-resolution images , 2016, IET Comput. Vis..

[14]  Lalana Kagal,et al.  Explaining Explanations: An Overview of Interpretability of Machine Learning , 2018, 2018 IEEE 5th International Conference on Data Science and Advanced Analytics (DSAA).

[15]  Bin Li,et al.  Eye centre localisation with hybrid regression framework , 2018, Electronics Letters.

[16]  Yongming Han,et al.  Novel leakage detection and water loss management of urban water supply network using multiscale neural networks , 2021 .

[17]  Faruk Bulut,et al.  A Low-Cost Pupil Center Localization Algorithm Based on Maximized Integral Voting of Circular Hollow Kernels , 2018, Comput. J..

[18]  Klaus J. Kirchberg,et al.  Robust Face Detection Using the Hausdorff Distance , 2001, AVBPA.

[19]  Fei-Yue Wang,et al.  Accurate and robust eye center localization via fully convolutional networks , 2019, IEEE/CAA Journal of Automatica Sinica.

[20]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[21]  Quoc V. Le,et al.  Searching for MobileNetV3 , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[22]  Cynthia Rudin,et al.  Interpretable classifiers using rules and Bayesian analysis: Building a better stroke prediction model , 2015, ArXiv.

[23]  Wenhao Zhang,et al.  Eye center localization and gaze gesture recognition for human-computer interaction. , 2016, Journal of the Optical Society of America. A, Optics, image science, and vision.

[24]  ByoungChul Ko,et al.  Interpretation and Simplification of Deep Forest , 2020, ArXiv.

[25]  Wojciech Matusik,et al.  Eye Tracking for Everyone , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Junyu Dong,et al.  Hybrid regression and isophote curvature for accurate eye center localization , 2019, Multimedia Tools and Applications.

[27]  Byung Cheol Song,et al.  Accurate Eye Pupil Localization Using Heterogeneous CNN Models , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[28]  Gang Liu,et al.  Improving Few-Shot User-Specific Gaze Adaptation via Gaze Redirection Synthesis , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Margo I. Seltzer,et al.  Scalable Bayesian Rule Lists , 2016, ICML.

[31]  Parham Aarabi,et al.  Hybrid eye center localization using cascaded regression and hand-crafted model fitting , 2017, Image Vis. Comput..