Use of Crowd Innovation to Develop an Artificial Intelligence–Based Solution for Radiation Therapy Targeting

Importance Radiation therapy (RT) is a critical cancer treatment, but the existing radiation oncologist work force does not meet growing global demand. One key physician task in RT planning involves tumor segmentation for targeting, which requires substantial training and is subject to significant interobserver variation. Objective To determine whether crowd innovation could be used to rapidly produce artificial intelligence (AI) solutions that replicate the accuracy of an expert radiation oncologist in segmenting lung tumors for RT targeting. Design, Setting, and Participants We conducted a 10-week, prize-based, online, 3-phase challenge (prizes totaled $55 000). A well-curated data set, including computed tomographic (CT) scans and lung tumor segmentations generated by an expert for clinical care, was used for the contest (CT scans from 461 patients; median 157 images per scan; 77 942 images in total; 8144 images with tumor present). Contestants were provided a training set of 229 CT scans with accompanying expert contours to develop their algorithms and given feedback on their performance throughout the contest, including from the expert clinician. Main Outcomes and Measures The AI algorithms generated by contestants were automatically scored on an independent data set that was withheld from contestants, and performance ranked using quantitative metrics that evaluated overlap of each algorithm’s automated segmentations with the expert’s segmentations. Performance was further benchmarked against human expert interobserver and intraobserver variation. Results A total of 564 contestants from 62 countries registered for this challenge, and 34 (6%) submitted algorithms. The automated segmentations produced by the top 5 AI algorithms, when combined using an ensemble model, had an accuracy (Dice coefficient = 0.79) that was within the benchmark of mean interobserver variation measured between 6 human experts. For phase 1, the top 7 algorithms had average custom segmentation scores (S scores) on the holdout data set ranging from 0.15 to 0.38, and suboptimal performance using relative measures of error. The average S scores for phase 2 increased to 0.53 to 0.57, with a similar improvement in other performance metrics. In phase 3, performance of the top algorithm increased by an additional 9%. Combining the top 5 algorithms from phase 2 and phase 3 using an ensemble model, yielded an additional 9% to 12% improvement in performance with a final S score reaching 0.68. Conclusions and Relevance A combined crowd innovation and AI approach rapidly produced automated algorithms that replicated the skills of a highly trained physician for a critical task in radiation therapy. These AI algorithms could improve cancer care globally by transferring the skills of expert clinicians to under-resourced health care settings.

[1]  Quynh-Thu Le,et al.  Institutional clinical trial accrual volume and survival of patients with head and neck cancer. , 2015, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[2]  N. Datta,et al.  Radiation therapy infrastructure and human resources in low- and middle-income countries: present status and projections for 2020. , 2014, International journal of radiation oncology, biology, physics.

[3]  Geraint Rees,et al.  Deep learning to achieve clinically applicable segmentation of head and neck anatomy for radiotherapy , 2018, ArXiv.

[4]  Jingbo Shang,et al.  Stepwise Distributed Open Innovation Contests for Software Development: Acceleration of Genome-Wide Association Analysis , 2017, GigaScience.

[5]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling , 2015, CVPR 2015.

[6]  Jialin Peng,et al.  Automatic 3D liver segmentation based on deep learning and globally optimized surface evolution , 2016, Physics in medicine and biology.

[7]  P. Lambin,et al.  Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach , 2014, Nature Communications.

[8]  Subhashini Venugopalan,et al.  Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. , 2016, JAMA.

[9]  T. Coroller,et al.  Radiologic-pathologic correlation of response to chemoradiation in resectable locally advanced NSCLC. , 2016, Lung cancer.

[10]  Hao Chen,et al.  Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: The LUNA16 challenge , 2016, Medical Image Anal..

[11]  Clifton D Fuller,et al.  Deep Learning Algorithm for Auto-Delineation of High-Risk Oropharyngeal Clinical Target Volumes With Built-In Dice Similarity Coefficient Parameter Optimization Function. , 2018, International journal of radiation oncology, biology, physics.

[12]  C. Gatsonis,et al.  Reduced Lung-Cancer Mortality with Low-Dose Computed Tomographic Screening , 2012 .

[13]  István Csabai,et al.  Detecting and classifying lesions in mammograms with Deep Learning , 2017, Scientific Reports.

[14]  Laura M. Heiser,et al.  A community effort to assess and improve drug sensitivity prediction algorithms , 2014, Nature Biotechnology.

[15]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[16]  Elena Marchiori,et al.  Location Sensitive Deep Convolutional Neural Networks for Segmentation of White Matter Hyperintensities , 2016, Scientific Reports.

[17]  Ron Kikinis,et al.  Volumetric CT-based segmentation of NSCLC using 3D-Slicer , 2013, Scientific Reports.

[18]  Ahmed Meghzifene,et al.  Improving Quality and Access to Radiation Therapy-An IAEA Perspective. , 2017, Seminars in radiation oncology.

[19]  Rebecca L. Siegel Mph,et al.  Cancer statistics, 2018 , 2018 .

[20]  Nick Lynch,et al.  Sequence squeeze: an open contest for sequence compression , 2013, GigaScience.

[21]  Anant Madabhushi,et al.  Accurate and reproducible invasive breast cancer detection in whole-slide images: A Deep Learning approach for quantifying tumor extent , 2017, Scientific Reports.

[22]  L. Wilson,et al.  Patients Selected for Definitive Concurrent Chemoradiation at High-volume Facilities Achieve Improved Survival in Stage III Non–Small-Cell Lung Cancer , 2015, Journal of thoracic oncology : official publication of the International Association for the Study of Lung Cancer.

[23]  Nci Dream Community A community effort to assess and improve drug sensitivity prediction algorithms , 2014 .

[24]  Philippe Lambin,et al.  PET-CT-based auto-contouring in non-small-cell lung cancer correlates with pathology and reduces interobserver variability in the delineation of the primary tumor and involved nodal volumes. , 2007, International journal of radiation oncology, biology, physics.

[25]  R. Groen,et al.  A Systematic Review of Radiotherapy Capacity in Low- and Middle-Income Countries , 2015, Frontiers in Oncology.

[26]  Tao Zhang,et al.  Deep Deconvolutional Neural Network for Target Segmentation of Nasopharyngeal Cancer in Planning Computed Tomography Images , 2017, Front. Oncol..

[27]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[28]  W. Curran,et al.  Institutional Enrollment and Survival Among NSCLC Patients Receiving Chemoradiation: NRG Oncology Radiation Therapy Oncology Group (RTOG) 0617. , 2016, Journal of the National Cancer Institute.

[29]  Stéphane Supiot,et al.  Comparison of Automated Atlas-Based Segmentation Software for Postoperative Prostate Cancer Radiotherapy , 2016, Front. Oncol..

[30]  J. Galvin,et al.  Contouring variations and the role of atlas in non-small cell lung cancer radiation therapy: Analysis of a multi-institutional preclinical trial planning study. , 2015, Practical radiation oncology.

[31]  C. Scott,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence , 2009 .

[32]  Daniel S. Kermany,et al.  Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning , 2018, Cell.

[33]  C. Mathers,et al.  Cancer incidence and mortality worldwide: Sources, methods and major patterns in GLOBOCAN 2012 , 2015, International journal of cancer.

[34]  Richard C. Pais,et al.  The Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): a completed reference database of lung nodules on CT scans. , 2011, Medical physics.

[35]  Diana S. M. Buist,et al.  Will Machine Learning Tip the Balance in Breast Cancer Screening? , 2017, JAMA oncology.

[36]  Arjan Bel,et al.  Definition of gross tumor volume in lung cancer: inter-observer variability. , 2002, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[37]  Y. Ung,et al.  Automatic Segmentation of Lung Carcinoma Using 3D Texture Features in 18-FDG PET/CT , 2013, International journal of molecular imaging.

[38]  A. Jemal,et al.  Cancer treatment and survivorship statistics, 2016 , 2016, CA: a cancer journal for clinicians.

[39]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[40]  Roberto Cipolla,et al.  Bayesian SegNet: Model Uncertainty in Deep Convolutional Encoder-Decoder Architectures for Scene Understanding , 2015, BMVC.

[41]  William Pao,et al.  A Pilot Study of Volume Measurement as a Method of Tumor Response Evaluation to Aid Biomarker Development , 2010, Clinical Cancer Research.

[42]  Bram van Ginneken,et al.  Pulmonary Nodule Detection in CT Images: False Positive Reduction Using Multi-View Convolutional Networks , 2016, IEEE Transactions on Medical Imaging.

[43]  Bulat Ibragimov,et al.  Segmentation of organs‐at‐risks in head and neck CT images using convolutional neural networks , 2017, Medical physics.

[44]  Nico Karssemeijer,et al.  Large scale deep learning for computer aided detection of mammographic lesions , 2017, Medical Image Anal..

[45]  L. Wilson,et al.  The future of radiation oncology in the United States from 2010 to 2020: will supply keep pace with demand? , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[46]  Brian O'Sullivan,et al.  Critical impact of radiotherapy protocol compliance and quality in the treatment of advanced head and neck cancer: results from TROG 02.02. , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[47]  Adam P Dicker,et al.  Radiotherapy protocol deviations and clinical outcomes: a meta-analysis of cooperative group clinical trials. , 2013, Journal of the National Cancer Institute.

[48]  Toniann Pitassi,et al.  The reusable holdout: Preserving validity in adaptive data analysis , 2015, Science.

[49]  Eric Lonstein,et al.  Prize-based contests can provide solutions to computational biology problems , 2013, Nature Biotechnology.

[50]  L. Tanoue,et al.  Reduced lung cancer mortality with low-dose computed tomographic screening , 2011 .