Improving automatic delineation for head and neck organs at risk by Deep Learning Contouring.

INTRODUCTION Adequate head and neck (HN) organ-at-risk (OAR) delineation is crucial for HN radiotherapy and for investigating the relationships between radiation dose to OARs and radiation-induced side effects. The automatic contouring algorithms that are currently in clinical use, such as atlas-based contouring (ABAS), leave room for improvement. The aim of this study was to use a comprehensive evaluation methodology to investigate the performance of HN OAR auto-contouring when using deep learning contouring (DLC), compared to ABAS. METHODS The DLC neural network was trained on 589 HN cancer patients. DLC was compared to ABAS by providing each method with an independent validation cohort of 104 patients, which had also been manually contoured. For each of the 22 OAR contours - glandular, upper digestive tract and central nervous system (CNS)-related structures - the dice similarity coefficient (DICE), and absolute mean and max dose differences (|Δmean-dose| and |Δmax-dose|) performance measures were obtained. For a subset of 7 OARs, an evaluation of contouring time, inter-observer variation and subjective judgement was performed. RESULTS DLC resulted in equal or significantly improved quantitative performance measures in 19 out of 22 OARs, compared to the ABAS (DICE/|Δmean dose|/|Δmax dose|: 0.59/4.2/4.1 Gy (ABAS); 0.74/1.1/0.8 Gy (DLC)). The improvements were mainly for the glandular and upper digestive tract OARs. DLC significantly reduced the delineation time for the inexperienced observer. The subjective evaluation showed that DLC contours were more often preferable to the ABAS contours overall, were considered to be more precise, and more often confused with manual contours. Manual contours still outperformed both DLC and ABAS; however, DLC results were within or bordering the inter-observer variability for the manual edited contours in this cohort. CONCLUSION The DLC, trained on a large HN cancer patient cohort, outperformed the ABAS for the majority of HN OARs.

[1]  Alex Lallement,et al.  Survey on deep learning for radiotherapy , 2018, Comput. Biol. Medicine.

[2]  A. Garden,et al.  Candidate dosimetric predictors of long-term swallowing dysfunction after oropharyngeal intensity-modulated radiotherapy. , 2010, International journal of radiation oncology, biology, physics.

[3]  Xiao Han,et al.  Clinical validation of atlas-based auto-segmentation of multiple target volumes and normal tissue (swallowing/mastication) structures in the head and neck. , 2011, International journal of radiation oncology, biology, physics.

[4]  W. Ng,et al.  Automatic segmentation for adaptive planning in nasopharyngeal carcinoma IMRT: Time, geometrical, and dosimetric analysis. , 2020, Medical dosimetry : official journal of the American Association of Medical Dosimetrists.

[5]  Paul Aljabar,et al.  Comparative evaluation of autocontouring in clinical practice: A practical method using the Turing test , 2018, Medical physics.

[6]  Nicholas Slevin,et al.  Evaluation of an automatic segmentation algorithm for definition of head and neck organs at risk , 2014, Radiation oncology.

[7]  Djamal Boukerroui,et al.  An Evaluation of Atlas Selection Methods for Atlas-Based Automatic Segmentation in Radiotherapy Treatment Planning , 2019, IEEE Transactions on Medical Imaging.

[8]  Johannes A Langendijk,et al.  NTCP models for patient-rated xerostomia and sticky saliva after treatment with intensity modulated radiotherapy for head and neck cancer: the role of dosimetric and clinical factors. , 2012, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[9]  L. R. Dice Measures of the Amount of Ecologic Association Between Species , 1945 .

[10]  Torsten Rohlfing,et al.  Quo Vadis, Atlas-Based Segmentation? , 2005 .

[11]  Joseph O Deasy,et al.  Radiotherapy dose-volume effects on salivary gland function. , 2010, International journal of radiation oncology, biology, physics.

[12]  V. Budach,et al.  Protection of quality and innovation in radiation oncology: The prospective multicenter trial the German Society of Radiation Oncology (DEGRO-QUIRO study) , 2014, Strahlentherapie und Onkologie.

[13]  Martin Styner,et al.  Comparison and Evaluation of Methods for Liver Segmentation From CT Datasets , 2009, IEEE Transactions on Medical Imaging.

[14]  Vincenzo Valentini,et al.  Recommendations on how to establish evidence from auto-segmentation software in radiotherapy. , 2014, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[15]  Kuo Men,et al.  Automatic segmentation of the clinical target volume and organs at risk in the planning CT for rectal cancer using deep dilated convolutional neural networks , 2017, Medical physics.

[16]  G. Sharp,et al.  Vision 20/20: perspectives on automated image segmentation for radiotherapy. , 2014, Medical physics.

[17]  E. R. van den Heuvel,et al.  3D Variation in delineation of head and neck organs at risk , 2012, Radiation oncology.

[18]  Indrin J Chetty,et al.  Analysis of deformable image registration accuracy using computational modeling. , 2010, Medical physics.

[19]  Paul Aljabar,et al.  Autosegmentation for thoracic radiation treatment planning: A grand challenge at AAPM 2017 , 2018, Medical physics.

[20]  Johannes A Langendijk,et al.  Advances in Radiotherapy for Head and Neck Cancer. , 2015, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[21]  Fan Tang,et al.  Deep-learning-based detection and segmentation of organs at risk in nasopharyngeal carcinoma computed tomographic images for radiotherapy planning , 2018, European Radiology.

[22]  Bulat Ibragimov,et al.  Combining deep learning with anatomical analysis for segmentation of the portal vein for liver SBRT planning , 2017, Physics in medicine and biology.

[23]  R. Steenbakkers,et al.  CT-based delineation of organs at risk in the head and neck region: DAHANCA, EORTC, GORTEC, HKNPCSG, NCIC CTG, NCRI, NRG Oncology and TROG consensus guidelines. , 2015, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[24]  A. Larrue,et al.  The impact of the number of atlases on the performance of automatic multi-atlas contouring , 2015 .

[25]  T Kadir,et al.  TU-AB-202-10: How Effective Are Current Atlas Selection Methods for Atlas-Based Auto-Contouring in Radiotherapy Planning? , 2016, Medical physics.

[26]  Xiao Han,et al.  Atlas-Based Auto-segmentation of Head and Neck CT Images , 2008, MICCAI.

[27]  Bulat Ibragimov,et al.  Segmentation of organs‐at‐risks in head and neck CT images using convolutional neural networks , 2017, Medical physics.

[28]  A. van der Schaaf,et al.  Development and Validation of a Prediction Model for Tube Feeding Dependence after Curative (Chemo-) Radiation in Head and Neck Cancer , 2014, PloS one.

[29]  Jean-François Daisne,et al.  Inter-observer variability in the delineation of pharyngo-laryngeal tumor, parotid glands and cervical spinal cord: comparison between CT-scan and MRI. , 2005, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[30]  S Nill,et al.  Geometric and dosimetric evaluations of atlas-based segmentation methods of MR images in the head and neck region , 2018, Physics in medicine and biology.

[31]  Paul Aljabar,et al.  Clinical evaluation of atlas and deep learning based automatic contouring for lung cancer. , 2017, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[32]  H. Duvernoy,et al.  Practical contouring guidelines with an MR-based atlas of brainstem structures involved in radiation-induced nausea and vomiting. , 2019, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[33]  M. Leech,et al.  Use of auto-segmentation in the delineation of target volumes and organs at risk in head and neck , 2016, Acta oncologica.

[34]  Eric Achten,et al.  Optimal number of atlases and label fusion for automatic multi-atlas-based brachial plexus contouring in radiotherapy treatment planning , 2016, Radiation Oncology.

[35]  Mary Feng,et al.  Normal tissue anatomy for oropharyngeal cancer: contouring variability and its impact on optimization. , 2012, International journal of radiation oncology, biology, physics.

[36]  Ryan L. Smith,et al.  Performance of 12 DIR algorithms in low-contrast regions for mass and density conserving deformation. , 2013, Medical physics.

[37]  Djamal Boukerroui,et al.  Can Atlas-Based Auto-Segmentation Ever Be Perfect? Insights From Extreme Value Theory , 2019, IEEE Transactions on Medical Imaging.