Effectiveness of Data Augmentation for CNN-Based Pupil Center Point Detection

Convolutional neural network (CNN) is one of the appearance-based methods, and this method has a potential for pupil center point detection problem. However, CNN requires a very large amount of training data. To overcome this problem, data augmentation (DA) is useful solution. In this paper, we investigate the effects of three DA approaches; affine transformation-based approach, synthetic image-based approach, and generative adversarial network (GAN)-based approach, and evaluate the effectiveness of DA for pupil center point detection task using an off-the-shelf CNN model. We set eleven conditions of the training data, and evaluated the detection accuracy. The result showed that higher detection accuracy was obtained for eye image by applying affine transformation. In our pupil center point detection task, it is found that affine transformation-based approach of DA is effective.

[1]  Takeshi Saitoh,et al.  Pupil center detection for infrared irradiation eye image using CNN , 2017, 2017 56th Annual Conference of the Society of Instrument and Control Engineers of Japan (SICE).

[2]  Dongheng Li,et al.  Starburst: A hybrid algorithm for video-based eye tracking combining feature-based and model-based approaches , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[3]  Pierre Dillenbourg,et al.  3D model-based gaze estimation in natural reading: a systematic error correction procedure based on annotated texts , 2014, ETRA.

[4]  Deva Ramanan,et al.  Detecting activities of daily living in first-person camera views , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Peter Robinson,et al.  Learning an appearance-based gaze estimator from one million synthesised images , 2016, ETRA.

[6]  Minoru Ohyama,et al.  A Support System for Mouse Operations Using Eye-Gaze Input , 2009 .

[7]  James M. Rehg,et al.  Gaze-enabled egocentric video summarization via constrained submodular maximization , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Tomas Pfister,et al.  Learning from Simulated and Unsupervised Images through Adversarial Training , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Takeshi Saitoh,et al.  CNN-Based Pupil Center Detection for Wearable Gaze Estimation System , 2017, Appl. Comput. Intell. Soft Comput..

[10]  Mario Fritz,et al.  Appearance-based gaze estimation in the wild , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Gjergji Kasneci,et al.  PupilNet: Convolutional Neural Networks for Robust Pupil Detection , 2016, ArXiv.

[12]  Jie Yang,et al.  A robust method for eye features extraction on color image , 2005, Pattern Recognit. Lett..