Semi-Supervised Learning for Eye Image Segmentation

Recent advances in appearance-based models have shown improved eye tracking performance in difficult scenarios like occlusion due to eyelashes, eyelids or camera placement, and environmental reflections on the cornea and glasses. The key reason for the improvement is the accurate and robust identification of eye parts (pupil, iris, and sclera regions). The improved accuracy often comes at the cost of labeling an enormous dataset, which is complex and time-consuming. This work presents two semi-supervised learning frameworks to identify eye-parts by taking advantage of unlabeled images where labeled datasets are scarce. With these frameworks, leveraging the domain-specific augmentation and novel spatially varying transformations for image segmentation, we show improved performance on various test cases with limited labeled samples. For instance, for a model trained on just 4 and 48 labeled images, these frameworks improved by at least 4.7% and 0.4% respectively, in segmentation performance over the baseline model, which is trained only with the labeled dataset.

[1]  Karel J. Zuiderveld,et al.  Contrast Limited Adaptive Histogram Equalization , 1994, Graphics Gems.

[2]  Oleg V. Komogortsev,et al.  Benefits of temporal information for appearance-based gaze estimation , 2020, ETRA Short Papers.

[3]  Reynold J. Bailey,et al.  RITnet: Real-time Semantic Segmentation of the Eye for Gaze Tracking , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[4]  David Berthelot,et al.  MixMatch: A Holistic Approach to Semi-Supervised Learning , 2019, NeurIPS.

[5]  Alexander Kolesnikov,et al.  Revisiting Self-Supervised Visual Representation Learning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Zhiyuan Li,et al.  Semi-Supervised Learning by Disentangling and Self-Ensembling Over Stochastic Latent Space , 2019, MICCAI.

[7]  Sandesh Ghimire,et al.  Semi-supervised Medical Image Classification with Global Latent Mixing , 2020, MICCAI.

[8]  Cordelia Schmid,et al.  The 2005 PASCAL Visual Object Classes Challenge , 2005, MLCW.

[9]  C. Hudelot,et al.  Semi-Supervised Semantic Segmentation With Cross-Consistency Training , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  C. V. Jawahar,et al.  Universal Semi-Supervised Semantic Segmentation , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[11]  Christopher Kanan,et al.  Gaze-in-wild: A dataset for studying eye and head coordination in everyday activities , 2019, Scientific Reports.

[12]  Oleg V. Komogortsev,et al.  OpenEDS2020: Open Eyes Dataset , 2020, ArXiv.

[13]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[14]  Xucong Zhang,et al.  Towards End-to-end Video-based Eye-Tracking , 2020, ECCV.

[15]  Jose Dolz,et al.  Boundary loss for highly unbalanced segmentation , 2018, MIDL.

[16]  Zhengyang Wu,et al.  EyeNet: A Multi-Task Deep Network for Off-Axis Eye Gaze Estimation , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[17]  Jeff B. Pelz,et al.  EllSeg: An Ellipse Segmentation Framework for Robust Gaze Tracking , 2020, IEEE Transactions on Visualization and Computer Graphics.

[18]  Seyed-Ahmad Ahmadi,et al.  DeepVOG: Open-source pupil segmentation and gaze estimation in neuroscience using deep learning , 2019, Journal of Neuroscience Methods.

[19]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[20]  Timo Aila,et al.  Temporal Ensembling for Semi-Supervised Learning , 2016, ICLR.

[21]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[22]  Oleg Komogortsev,et al.  Domain Adaptation for Eye Segmentation , 2020, ECCV Workshops.

[23]  Gregory Hughes,et al.  OpenEDS: Open Eye Dataset , 2019, ArXiv.