Calibrate the inter-observer segmentation uncertainty via diagnosis-first principle

On the medical images, many of the tissues/lesions may be ambiguous. That is why the medical segmentation is typically annotated by a group of clinical experts to mitigate the personal bias. However, this clinical routine also brings new challenges to the application of machine learning algorithms. Without a definite ground-truth, it will be difficult to train and evaluate the deep learning models. When the annotations are collected from different graders, a common choice is majority vote. However such a strategy ignores the difference between the grader expertness. In this paper, we consider the task of predicting the segmentation with the calibrated inter-observer uncertainty. We note that in clinical practice, the medical image segmentation is usually used to assist the disease diagnosis. Inspired by this observation, we propose diagnosis-first principle, which is to take disease diagnosis as the criterion to calibrate the inter-observer segmentation uncertainty. Following this idea, a framework named Diagnosis First segmentation Framework (DiFF) is proposed to estimate diagnosis-first segmentation from the raw images.Specifically, DiFF will first learn to fuse the multi-rater segmentation labels to a single ground-truth which could maximize the disease diagnosis performance. We dubbed the fused ground-truth as Diagnosis First Ground-truth (DF-GT).Then, we further propose Take and Give Modelto segment DF-GT from the raw image. We verify the effectiveness of DiFF on three different medical segmentation tasks: OD/OC segmentation on fundus images, thyroid nodule segmentation on ultrasound images, and skin lesion segmentation on dermoscopic images. Experimental results show that the proposed DiFF is able to significantly facilitate the corresponding disease diagnosis, which outperforms previous state-of-the-art multi-rater learning methods.

[1]  Yehui Yang,et al.  SeATrans: Learning Segmentation-Assisted diagnosis model via Transformer , 2022, MICCAI.

[2]  Yehui Yang,et al.  Learning self-calibrated optic disc and cup segmentation from multi-rater annotations , 2022, MICCAI.

[3]  Haidar A. Almubarak,et al.  REFUGE2 Challenge: Treasure for Multi-Domain Learning in Glaucoma Assessment , 2022, ArXiv.

[4]  Tao Lin,et al.  TNSNet: Thyroid nodule segmentation in ultrasound imaging using soft shape supervision , 2021, Comput. Methods Programs Biomed..

[5]  Qi Bi,et al.  Learning Calibrated Medical Image Segmentation via Multi-rater Agreement Modeling , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Mingzhi Mao,et al.  Multi-Task Learning For Thyroid Nodule Segmentation With Thyroid Region Prior , 2021, 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI).

[7]  Ari S. Morcos,et al.  ConViT: improving vision transformers with soft convolutional inductive biases , 2021, ICML.

[8]  Segmentation, Classification, and Registration of Multi-modality Medical Imaging Data: MICCAI 2020 Challenges, ABCs 2020, L2R 2020, TN-SCUI 2020, Held in Conjunction with MICCAI 2020, Lima, Peru, October 4–8, 2020, Proceedings , 2021, MICCAI.

[9]  Zhao Jing,et al.  Application of an attention U-Net incorporating transfer learning for optic disc and cup segmentation , 2020, Signal, Image and Video Processing.

[10]  Shuang Yu,et al.  Difficulty-aware Glaucoma Classification with Multi-Rater Consensus Modeling , 2020, MICCAI.

[11]  Shuang Yu,et al.  Leveraging Undiagnosed Data for Glaucoma Classification with Teacher-Student Learning , 2020, MICCAI.

[12]  Nicolas Usunier,et al.  End-to-End Object Detection with Transformers , 2020, ECCV.

[13]  Şaban Öztürk,et al.  Skin Lesion Segmentation with Improved Convolutional Neural Network , 2020, Journal of Digital Imaging.

[14]  Viksit Kumar,et al.  Automated Segmentation of Thyroid Nodule, Gland, and Cystic Components From Ultrasound Images Using Deep Learning , 2020, IEEE Access.

[15]  Lequan Yu,et al.  MS-Net: Multi-Site Network for Improving Prostate Segmentation With Heterogeneous MRI Data , 2020, IEEE Transactions on Medical Imaging.

[16]  Yang Song,et al.  3D APA-Net: 3D Adversarial Pyramid Anisotropic Convolutional Network for Prostate Segmentation in MR Images , 2020, IEEE Transactions on Medical Imaging.

[17]  Jiang Liu,et al.  A retrospective comparison of deep learning to manual annotations for optic disc and optic cup segmentation in fundus photos , 2020, medRxiv.

[18]  Martin Aastrup Olsen,et al.  Improving Uncertainty Estimation in Convolutional Neural Networks Using Inter-rater Agreement , 2019, MICCAI.

[19]  Hao Chen,et al.  Robust Multimodal Brain Tumor Segmentation via Feature Disentanglement and Gated Fusion , 2019, MICCAI.

[20]  H. Rolf Jäger,et al.  Let's agree to disagree: learning highly debatable multirater labelling , 2019, MICCAI.

[21]  Junde Wu,et al.  Universal, transferable and targeted adversarial attacks , 2019, ArXiv.

[22]  Shihao Zhang,et al.  Attention Guided Network for Retinal Image Segmentation , 2019, MICCAI.

[23]  Andreas Dengel,et al.  Two-stage framework for optic disc localization and glaucoma classification in retinal fundus images using deep learning , 2019, BMC Medical Informatics and Decision Making.

[24]  Enes Ayan,et al.  Skin Lesion Segmentation in Dermoscopic Images with Combination of YOLO and GrabCut Algorithm , 2019, Diagnostics.

[25]  Chi-Wing Fu,et al.  Boundary and Entropy-driven Adversarial Learning for Fundus Image Segmentation , 2019, MICCAI.

[26]  Ender Konukoglu,et al.  PHiSeg: Capturing Uncertainty in Medical Image Segmentation , 2019, MICCAI.

[27]  Ling Shao,et al.  Collaborative Learning of Semi-Supervised Segmentation and Classification for Medical Images , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Yogesan Kanagasingam,et al.  Robust optic disc and cup segmentation with deep learning for glaucoma detection , 2019, Comput. Medical Imaging Graph..

[29]  Yizhou Wang,et al.  Max-MIG: an Information Theoretic Approach for Joint Learning from Crowds , 2019, ICLR.

[30]  Chi-Chun Lee,et al.  Every Rating Matters: Joint Learning of Subjective Labels and Individual Annotators for Speech Emotion Classification , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[31]  Changxing Ding,et al.  Dual-force convolutional neural networks for accurate brain tumor segmentation , 2019, Pattern Recognit..

[32]  Xiaofei Wang,et al.  Attention Based Glaucoma Detection: A Large-Scale Database and CNN Model , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Shenghua Gao,et al.  CE-Net: Context Encoder Network for 2D Medical Image Segmentation , 2019, IEEE Transactions on Medical Imaging.

[34]  Hang Li,et al.  Dense Deconvolutional Network for Skin Lesion Segmentation , 2019, IEEE Journal of Biomedical and Health Informatics.

[35]  Chi-Wing Fu,et al.  Patch-Based Output Space Adversarial Learning for Joint Optic Disc and Cup Segmentation , 2019, IEEE Transactions on Medical Imaging.

[36]  Swami Sankaranarayanan,et al.  Learning From Noisy Labels by Regularized Estimation of Annotator Confusion , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Noel C. F. Codella,et al.  Skin Lesion Analysis Toward Melanoma Detection 2018: A Challenge Hosted by the International Skin Imaging Collaboration (ISIC) , 2019, ArXiv.

[38]  Li Cheng,et al.  Supervised Segmentation of Un-Annotated Retinal Fundus Images by Synthesis , 2019, IEEE Transactions on Medical Imaging.

[39]  Kai Liu,et al.  Thyroid Nodule Segmentation in Ultrasound Images Based on Cascaded Convolutional Neural Network , 2018, ICONIP.

[40]  Mun-Taek Choi,et al.  Skin lesion segmentation in dermoscopy images via deep full resolution convolutional networks , 2018, Comput. Methods Programs Biomed..

[41]  Ghassan Hamarneh,et al.  Star Shape Prior in Fully Convolutional Networks for Skin Lesion Segmentation , 2018, MICCAI.

[42]  Klaus H. Maier-Hein,et al.  A Probabilistic U-Net for Segmentation of Ambiguous Images , 2018, NeurIPS.

[43]  Mauricio Reyes,et al.  On the Effect of Inter-observer Variability for a Reliable Estimation of Uncertainty of Medical Image Segmentation , 2018, MICCAI.

[44]  Xiaochun Cao,et al.  Disc-Aware Ensemble Network for Glaucoma Screening From Fundus Image , 2018, IEEE Transactions on Medical Imaging.

[45]  Francisco C. Pereira,et al.  Deep learning from crowds , 2017, AAAI.

[46]  Geoffrey E. Hinton,et al.  Who Said What: Modeling Individual Labelers Improves Classification , 2017, AAAI.

[47]  Fa Wu,et al.  Ultrasound image-based thyroid nodule automatic segmentation using convolutional neural networks , 2017, International Journal of Computer Assisted Radiology and Surgery.

[48]  Sergio Guadarrama,et al.  The Devil is in the Decoder , 2017, BMVC.

[49]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[50]  Vigneswaran Thangaraj,et al.  Glaucoma diagnosis using support vector machine , 2017, 2017 International Conference on Intelligent Computing and Control Systems (ICICCS).

[51]  Yading Yuan,et al.  Automatic Skin Lesion Segmentation Using Deep Fully Convolutional Networks With Jaccard Distance , 2017, IEEE Transactions on Medical Imaging.

[52]  Kevin Gimpel,et al.  Gaussian Error Linear Units (GELUs) , 2016 .

[53]  Seyed-Ahmad Ahmadi,et al.  V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation , 2016, 2016 Fourth International Conference on 3D Vision (3DV).

[54]  Nassir Navab,et al.  AggNet: Deep Learning From Crowds for Mitosis Detection in Breast Cancer Histology Images , 2016, IEEE Trans. Medical Imaging.

[55]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[56]  Fabián Narváez,et al.  An open access thyroid ultrasound image database , 2015, Other Conferences.

[57]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[58]  S. Chandrika,et al.  Analysis Of Cdr Detection For Glaucoma Diagnosis , 2013 .

[59]  Gerardo Hermosillo,et al.  Learning From Crowds , 2010, J. Mach. Learn. Res..

[60]  R. Truog,et al.  Assessment of Communication Skills and Self-Appraisal in the Simulated Environment: Feasibility of Multirater Feedback with Gap Analysis , 2009, Simulation in healthcare : journal of the Society for Simulation in Healthcare.

[61]  J. Grob,et al.  First prospective study of the recognition process of melanoma in dermatological practice. , 2005, Archives of dermatology.

[62]  William M. Wells,et al.  Simultaneous truth and performance level estimation (STAPLE): an algorithm for the validation of image segmentation , 2004, IEEE Transactions on Medical Imaging.

[63]  D. Garway-Heath,et al.  Vertical cup/disc ratio in relation to optic disc size: its value in the assessment of the glaucoma suspect , 1998, The British journal of ophthalmology.

[64]  Ron Kikinis,et al.  Markov random field segmentation of brain MR images , 1997, IEEE Transactions on Medical Imaging.

[65]  T. J. Hebert,et al.  Fast iterative segmentation of high resolution medical images , 1996, 1996 IEEE Nuclear Science Symposium. Conference Record.

[66]  A Horsman,et al.  Tumour volume determination from MR images by morphological segmentation , 1996, Physics in medicine and biology.

[67]  C. Davatzikos,et al.  Using a deformable surface model to obtain a shape representation of the cortex , 1995, Proceedings of International Symposium on Computer Vision - ISCV.

[68]  D. Louis Collins,et al.  Automatic 3‐D model‐based neuroanatomical segmentation , 1995 .