Learning the retinal anatomy from scarce annotated data using self-supervised multimodal reconstruction

Abstract Deep learning is becoming the reference paradigm for approaching many computer vision problems. Nevertheless, the training of deep neural networks typically requires a significantly large amount of annotated data, which is not always available. A proven approach to alleviate the scarcity of annotated data is transfer learning. However, in practice, the use of this technique typically relies on the availability of additional annotations, either from the same or natural domain. We propose a novel alternative that allows to apply transfer learning from unlabelled data of the same domain, which consists in the use of a multimodal reconstruction task. A neural network trained to generate one image modality from another must learn relevant patterns from the images to successfully solve the task. These learned patterns can then be used to solve additional tasks in the same domain, reducing the necessity of a large amount of annotated data. In this work, we apply the described idea to the localization and segmentation of the most important anatomical structures of the eye fundus in retinography. The objective is to reduce the amount of annotated data that is required to solve the different tasks using deep neural networks. For that purpose, a neural network is pre-trained using the self-supervised multimodal reconstruction of fluorescein angiography from retinography. Then, the network is fine-tuned on the different target tasks performed on the retinography. The obtained results demonstrate that the proposed self-supervised transfer learning strategy leads to state-of-the-art performance in all the studied tasks with a significant reduction of the required annotations.

[1]  Nadia K Waheed,et al.  Contemporary retinal imaging techniques in diabetic retinopathy: a review , 2016, Clinical & experimental ophthalmology.

[2]  Max A. Viergever,et al.  Ridge-based vessel segmentation in color images of the retina , 2004, IEEE Transactions on Medical Imaging.

[3]  Hossein Rabbani,et al.  Diabetic Retinopathy Grading by Digital Curvelet Transform , 2012, Comput. Math. Methods Medicine.

[4]  Marios S. Pattichis,et al.  Fast localization of optic disc and fovea in retinal images for eye disease screening , 2011, Medical Imaging.

[5]  Luc Van Gool,et al.  Deep Retinal Image Understanding , 2016, MICCAI.

[6]  András Hajdu,et al.  Combining algorithms for automatic detection of optic disc and macula in fundus images , 2012, Comput. Vis. Image Underst..

[7]  Ana Maria Mendonça,et al.  A Pixel-Wise Distance Regression Approach for Joint Retinal Optical Disc and Fovea Detection , 2018, MICCAI.

[8]  Jiong Zhang,et al.  Automatic Optic Disc and Fovea Detection in Retinal Images Using Super-Elliptical Convergence Index Filters , 2016, ICIAR.

[9]  Marcos Ortega,et al.  Multimodal Registration of Retinal Images Using Domain-Specific Landmarks and Vessel Enhancement , 2018, KES.

[10]  Bunyarit Uyyanonvara,et al.  An Ensemble Classification-Based Approach Applied to Retinal Blood Vessel Segmentation , 2012, IEEE Transactions on Biomedical Engineering.

[11]  Rangaraj M. Rangayyan,et al.  Detection of the Optic Nerve Head in Fundus Images of the Retina Using the Hough Transform for Circles , 2010, Journal of Digital Imaging.

[12]  Andrew Zisserman,et al.  Multi-task Self-Supervised Visual Learning , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[13]  Klaus H. Maier-Hein,et al.  Exploiting the potential of unlabeled endoscopic video data with self-supervised learning , 2017, International Journal of Computer Assisted Radiology and Surgery.

[14]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[15]  Mariano Rincón,et al.  Identification of the optic nerve head with genetic algorithms , 2008, Artif. Intell. Medicine.

[16]  Bram van Ginneken,et al.  A survey on deep learning in medical image analysis , 2017, Medical Image Anal..

[17]  Amin Dehghani,et al.  Optic disc localization in retinal images using histogram matching , 2012, EURASIP Journal on Image and Video Processing.

[18]  Marcos Ortega,et al.  Retinal Image Understanding Emerges from Self-Supervised Multimodal Reconstruction , 2018, MICCAI.

[19]  José Manuel Bravo,et al.  Locating the fovea center position in digital fundus images using thresholding and feature extraction techniques , 2013, Comput. Medical Imaging Graph..

[20]  Tanveer F. Syeda-Mahmood,et al.  Building medical image classifiers with very limited data using segmentation networks , 2018, Medical Image Anal..

[21]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[22]  Guy Cazuguel,et al.  FEEDBACK ON A PUBLICLY DISTRIBUTED IMAGE DATABASE: THE MESSIDOR DATABASE , 2014 .

[23]  Andrew Zisserman,et al.  Self-supervised Learning for Spinal MRIs , 2017, DLMIA/ML-CDS@MICCAI.

[24]  András Hajdu,et al.  A review on automatic analysis techniques for color fundus photographs , 2016, Computational and structural biotechnology journal.

[25]  Gustavo Carneiro,et al.  Training Medical Image Analysis Systems like Radiologists , 2018, MICCAI.

[26]  Jorge Novo,et al.  Self-Supervised Deep Learning for Retinal Vessel Segmentation Using Automatically Generated Labels from Multimodal Data , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[27]  Tianfu Wang,et al.  A Cross-Modality Learning Approach for Vessel Segmentation in Retinal Images , 2016, IEEE Transactions on Medical Imaging.

[28]  A.D. Hoover,et al.  Locating blood vessels in retinal images by piecewise threshold probing of a matched filter response , 2000, IEEE Transactions on Medical Imaging.

[29]  Nima Tajbakhsh,et al.  Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning? , 2016, IEEE Transactions on Medical Imaging.

[30]  Lei Zhang,et al.  Multi-level deep supervised networks for retinal vessel segmentation , 2017, International Journal of Computer Assisted Radiology and Surgery.

[31]  Krzysztof Krawiec,et al.  Segmenting Retinal Blood Vessels With Deep Neural Networks , 2016, IEEE Transactions on Medical Imaging.

[32]  Baidaa Al-Bander,et al.  Multiscale sequential convolutional neural networks for simultaneous detection of fovea and optic disc , 2018, Biomed. Signal Process. Control..

[33]  Andrew Zisserman,et al.  Fully‐automated alignment of 3D fetal brain ultrasound to a canonical reference space using multi‐task learning , 2018, Medical Image Anal..

[34]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[35]  Bram van Ginneken,et al.  Off-the-shelf convolutional neural network features for pulmonary nodule detection in computed tomography scans , 2015, 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI).

[36]  Bram van Ginneken,et al.  Fast detection of the optic disc and fovea in color fundus photographs , 2009, Medical Image Anal..

[37]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[38]  José García Rodríguez,et al.  A survey on deep learning techniques for image and video semantic segmentation , 2018, Appl. Soft Comput..

[39]  José Manuel Bravo,et al.  Obtaining optic disc center and pixel region by automatic thresholding methods on morphologically processed fundus images , 2015, Comput. Methods Programs Biomed..