Diagnosing glaucoma on imbalanced data with self-ensemble dual-curriculum learning

Glaucoma diagnosis often suffers from two types of data imbalances: 1) class imbalance, i.e., the non-glaucoma majority cases occupy most of the data; 2) rare cases, i.e., few cases present the uncommon retinopathy e.g., bayoneting or physiologic cupping. This dual-imbalances make glaucoma diagnosis model easy to be dominated by the majority cases but cannot correctly classify the minority and/or rare ones. In this paper, we propose an adaptive re-balancing strategy in the feature space, Self-Ensemble Dual-Curriculum learning (SEDC), to improve the glaucoma diagnosis on imbalanced data by augmenting feature distribution with feature distilling and feature re-weighting. Firstly, the self-ensembling (SEL) is developed to reinforce the discriminative ability of feature representations for the minority class and rare cases by distilling the features learned from the abundant majority cases. Secondly, the dual-curriculum (DCL) is designed to adaptively re-weight the imbalanced data in the feature space to learn a balanced decision function for accurate glaucoma diagnosis. Benefiting from feature distilling and re-weighting, the proposed SEDC fairly represents fundus images, regardless of the majority or rare cases, by augmenting the feature distribution to obtains the optimal decision boundary for accurate glaucoma diagnosis on the imbalanced dataset. Experimental results on three challenging glaucoma datasets show that our SEDC successfully delivers accurate glaucoma diagnosis by the adaptive re-balancing strategy, with the average mean value of Accuracy 0.9712, Sensitivity 0.9520, Specificity 0.9816, AUC 0.9928, F2-score 0.9547. Ablation and comparison studies demonstrate that our method outperforms state-of-the-art methods and traditional re-balancing strategies. The experiment also shows that the adaptive re-balancing strategy proposed in our method provides a more effective training approach with optimal convergence performance. It endows our SEDC a great advantage to handle the disease diagnosis on imbalanced data distribution.

[1]  Wei Wu,et al.  Dynamic Curriculum Learning for Imbalanced Data Classification , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[2]  Lequan Yu,et al.  Semi-Supervised Medical Image Classification With Relation-Driven Self-Ensembling Model , 2020, IEEE Transactions on Medical Imaging.

[3]  Ioannis A. Kakadiaris,et al.  Deep Imbalanced Attribute Classification using Visual Attention Aggregation , 2018, ECCV.

[4]  Xiaoxiao Li,et al.  REFUGE Challenge: A Unified Framework for Evaluating Automated Methods for Glaucoma Assessment from Fundus Photographs , 2019, Medical Image Anal..

[5]  Yang Song,et al.  Class-Balanced Loss Based on Effective Number of Samples , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Francisco Fumero,et al.  RIM-ONE: An open retinal image database for optic nerve evaluation , 2011, 2011 24th International Symposium on Computer-Based Medical Systems (CBMS).

[7]  Bin Yang,et al.  Learning to Reweight Examples for Robust Deep Learning , 2018, ICML.

[8]  Xing Ji,et al.  CosFace: Large Margin Cosine Loss for Deep Face Recognition , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9]  Evgeny Smirnov,et al.  Hard Example Mining with Auxiliary Embeddings , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[10]  Stefanos Zafeiriou,et al.  ArcFace: Additive Angular Margin Loss for Deep Face Recognition , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Beiji Zou,et al.  Clinical Interpretable Deep Learning Model for Glaucoma Diagnosis , 2020, IEEE Journal of Biomedical and Health Informatics.

[12]  Xiaochun Cao,et al.  Joint Optic Disc and Cup Segmentation Based on Multi-Label Deep Network and Polar Transformation , 2018, IEEE Transactions on Medical Imaging.

[13]  Tien Yin Wong,et al.  Glaucoma detection based on deep convolutional neural network , 2015, 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[14]  Xiaofei Wang,et al.  Attention Based Glaucoma Detection: A Large-Scale Database and CNN Model , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Matthias Bethge,et al.  ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness , 2018, ICLR.

[16]  Michal Valko,et al.  Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning , 2020, NeurIPS.

[17]  Xiangyu Zhu,et al.  AdaptiveFace: Adaptive Margin and Sampling for Face Recognition , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Colin Wei,et al.  Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss , 2019, NeurIPS.

[19]  Nassir Navab,et al.  Medical-based Deep Curriculum Learning for Improved Fracture Classification , 2019, MICCAI.

[20]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Geoffrey E. Hinton,et al.  A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.

[22]  Shiguang Shan,et al.  Self-Paced Curriculum Learning , 2015, AAAI.

[23]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[24]  Xiu-Shen Wei,et al.  BBN: Bilateral-Branch Network With Cumulative Learning for Long-Tailed Visual Recognition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[26]  Qian Wang,et al.  A Self-ensembling Framework for Semi-supervised Knee Osteoarthritis Localization and Classification with Dual-Consistency , 2020, ArXiv.

[27]  Yi Li,et al.  REPAIR: Removing Representation Bias by Dataset Resampling , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Ajinkya More,et al.  Survey of resampling techniques for improving classification performance in unbalanced datasets , 2016, ArXiv.

[29]  Jason Weston,et al.  Curriculum learning , 2009, ICML '09.

[30]  M. He,et al.  Efficacy of a Deep Learning System for Detecting Glaucomatous Optic Neuropathy Based on Color Fundus Photographs. , 2018, Ophthalmology.

[31]  Loïc Le Folgoc,et al.  Attention U-Net: Learning Where to Look for the Pancreas , 2018, ArXiv.

[32]  Bhiksha Raj,et al.  SphereFace: Deep Hypersphere Embedding for Face Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Shuo Li,et al.  Multi-index Optic Disc Quantification via MultiTask Ensemble Learning , 2019, MICCAI.

[34]  Shuo Li,et al.  Multi-indices quantification of optic nerve head in fundus image via multitask collaborative learning , 2019, Medical Image Anal..

[35]  Abhinav Gupta,et al.  Training Region-Based Object Detectors with Online Hard Example Mining , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Rongchang Zhao,et al.  EGDCL: An Adaptive Curriculum Learning Framework for Unbiased Glaucoma Diagnosis , 2020, ECCV.

[37]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[38]  Marcus Rohrbach,et al.  Decoupling Representation and Classifier for Long-Tailed Recognition , 2020, ICLR.

[39]  Shuicheng Yan,et al.  Automatic Feature Learning for Glaucoma Detection Based on Deep Learning , 2015, MICCAI.

[40]  Subhransu Maji,et al.  Bilinear CNN Models for Fine-Grained Visual Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[41]  Baihua Li,et al.  Automatic extraction of retinal features from colour retinal images for glaucoma diagnosis: A review , 2013, Comput. Medical Imaging Graph..

[42]  Harri Valpola,et al.  Weight-averaged consistency targets improve semi-supervised deep learning results , 2017, ArXiv.

[43]  Kaiming He,et al.  Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Shuo Li,et al.  Weakly-Supervised Simultaneous Evidence Identification and Segmentation for Automated Glaucoma Diagnosis , 2019, AAAI.

[45]  Nicolas Guizard,et al.  CASED: Curriculum Adaptive Sampling for Extreme Data Imbalance , 2017, MICCAI.

[46]  Aditya Prasad,et al.  Unsupervised Hard Example Mining from Videos for Improved Object Detection , 2018, ECCV.

[47]  Shifeng Zhang,et al.  Ensemble Soft-Margin Softmax Loss for Image Classification , 2018, IJCAI.

[48]  Meng Yang,et al.  Large-Margin Softmax Loss for Convolutional Neural Networks , 2016, ICML.

[49]  Zachary C. Lipton,et al.  What is the Effect of Importance Weighting in Deep Learning? , 2018, ICML.

[50]  Xiaochun Cao,et al.  Disc-Aware Ensemble Network for Glaucoma Screening From Fundus Image , 2018, IEEE Transactions on Medical Imaging.

[51]  Fan Guo,et al.  Direct Cup-to-Disc Ratio Estimation for Glaucoma Screening via Semi-Supervised Learning , 2020, IEEE Journal of Biomedical and Health Informatics.