AI-based pathology predicts origins for cancers of unknown primary

Cancer of unknown primary (CUP) is an enigmatic group of diagnoses where the primary anatomical site of tumor origin cannot be determined. This poses a significant challenge since modern therapeutics such as chemotherapy regimen and immune checkpoint inhibitors are specific to the primary tumor. Recent work has focused on using genomics and transcriptomics for identification of tumor origins. However, genomic testing is not conducted for every patient and lacks clinical penetration in low resource settings. Herein, to overcome these challenges, we present a deep learning-based computational pathology algorithm-TOAD-that can provide a differential diagnosis for CUP using routinely acquired histology slides. We used 17,486 gigapixel whole slide images with known primaries spread over 18 common origins to train a multi-task deep model to simultaneously identify the tumor as primary or metastatic and predict its site of origin. We tested our model on an internal test set of 4,932 cases with known primaries and achieved a top-1 accuracy of 0.84, a top-3 accuracy of 0.94 while on our external test set of 662 cases from 202 different hospitals, it achieved a top-1 and top-3 accuracy of 0.79 and 0.93 respectively. We further curated a dataset of 717 CUP cases from 151 different medical centers and identified a subset of 290 cases for which a differential diagnosis was assigned. Our model predictions resulted in concordance for 50% of cases (\k{appa}=0.4 when adjusted for agreement by chance) and a top-3 agreement of 75%. Our proposed method can be used as an assistive tool to assign differential diagnosis to complicated metastatic and CUP cases and could be used in conjunction with or in lieu of immunohistochemical analysis and extensive diagnostic work-ups to reduce the occurrence of CUP.

[1]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[2]  N. Pavlidis,et al.  The currently declining incidence of cancer of unknown primary. , 2019, Cancer epidemiology.

[3]  N. Razavian,et al.  Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning , 2018, Nature Medicine.

[4]  Timo Kohlberger,et al.  An augmented reality microscope with real-time artificial intelligence integration for cancer diagnosis , 2019, Nature Medicine.

[5]  N. Pavlidis,et al.  Progress in refining the clinical management of cancer of unknown primary in the molecular era , 2020, Nature Reviews Clinical Oncology.

[6]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[7]  Ming Y. Lu,et al.  Data-efficient and weakly supervised computational pathology on whole-slide images , 2020, Nature Biomedical Engineering.

[8]  Jin Tae Kwak,et al.  Hover-Net: Simultaneous segmentation and classification of nuclei in multi-tissue histology images , 2018, Medical Image Anal..

[9]  Michael V. McConnell,et al.  Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning , 2017, Nature Biomedical Engineering.

[10]  Honglak Lee,et al.  Near real-time intraoperative brain tumor diagnosis using stimulated Raman histology and deep neural networks , 2019, Nature Medicine.

[11]  Steven J. M. Jones,et al.  Application of a Neural Network Whole Transcriptome–Based Pan-Cancer Method for Diagnosis of Primary and Metastatic Cancers , 2019, JAMA network open.

[12]  Nuno A. Fonseca,et al.  A deep learning system accurately classifies primary and metastatic cancers using passenger mutation patterns , 2020, Nature Communications.

[13]  Curtis P. Langlotz,et al.  Video-based AI for beat-to-beat assessment of cardiac function , 2020, Nature.

[14]  M. O'brien,et al.  Metastatic adenocarcinoma of an unknown primary site. A comparison of the relative contributions of morphology, minimal essential clinical data and CEA immunostaining status. , 1993, American journal of clinical pathology.

[15]  R. Tothill,et al.  CUP-AI-Dx: A tool for inferring cancer tissue of origin and molecular subtype using RNA gene-expression data and artificial intelligence , 2020, EBioMedicine.

[16]  Christophe Massard,et al.  Carcinomas of an unknown primary origin—diagnosis and treatment , 2011, Nature Reviews Clinical Oncology.

[17]  Alexander V Penson,et al.  Development of Genome-Derived Tumor Type Prediction to Inform Clinical Cancer Care. , 2019, JAMA oncology.

[18]  Jakob Nikolas Kather,et al.  Deep learning can predict microsatellite instability directly from histology in gastrointestinal cancer , 2019, Nature Medicine.

[19]  Iris Barshack,et al.  MiR‐92b and miR‐9/9* Are Specifically Expressed in Brain Primary Tumors and Can Be Used to Differentiate Primary from Metastatic Brain Tumors , 2008, Brain pathology.

[20]  Rainer Hofmann-Wellenhof,et al.  A deep learning system for differential diagnosis of skin diseases , 2019, Nature Medicine.

[21]  Clinton J. V. Campbell,et al.  Pan-cancer diagnostic consensus through searching archival histopathology images using artificial intelligence. , 2020, NPJ digital medicine.

[22]  S. Tomida,et al.  Site-Specific and Targeted Therapy Based on Molecular Profiling by Next-Generation Sequencing for Cancer of Unknown Primary Site: A Nonrandomized Phase 2 Clinical Trial. , 2020, JAMA oncology.

[23]  Thomas J. Fuchs,et al.  Clinical-grade computational pathology using weakly supervised deep learning on whole slide images , 2019, Nature Medicine.

[24]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[25]  Qinjie Chu,et al.  TOD-CUP: a gene expression rank-based majority vote algorithm for tissue origin diagnosis of cancers of unknown primary , 2020, Briefings Bioinform..

[26]  Klaus-Robert Müller,et al.  Machine learning analysis of DNA methylation profiles distinguishes primary lung squamous cell carcinomas from head and neck metastases , 2019, Science Translational Medicine.