A Pathology Deep Learning System Capable of Triage of Melanoma Specimens Utilizing Dermatopathologist Consensus as Ground Truth *

Although melanoma occurs more rarely than several other skin cancers, patients’ long term survival rate is extremely low if the diagnosis is missed. Diagnosis is complicated by a high discordance rate among pathologists when distinguishing between melanoma and benign melanocytic lesions. A tool that allows pathology labs to sort and prioritize melanoma cases in their workflow could improve turnaround time by prioritizing challenging cases and routing them directly to the appropriate sub-specialist. We present a pathology deep learning system (PDLS) that performs hierarchical classification of digitized whole slide image (WSI) specimens into six classes defined by their morphological characteristics, including classification of "Melanocytic Suspect" specimens likely representing melanoma or severe dysplastic nevi . We trained the system on 7,685 images from a single lab (the reference lab), including the the largest set of triple-concordant melanocytic specimens compiled to date, and tested the sys-tem on 5,099 images from two distinct validation labs. We achieved Area Underneath the ROC Curve (AUC) values of 0.93 classifying Melanocytic Suspect specimens on the reference lab, 0.95 on the first validation lab, and 0.82 on the second validation lab. We demonstrate that the PDLS is capable of automatically sorting and triaging skin specimens with high sensitivity to Melanocytic Suspect cases and that a pathologist would only need between 30% and 60% of the caseload to address all melanoma specimens.

[1]  Thomas J. Fuchs,et al.  Clinical-grade computational pathology using weakly supervised deep learning on whole slide images , 2019, Nature Medicine.

[2]  Joann G Elmore,et al.  Pathologists ' diagnosis of invasive melanoma and melanocytic proliferations : observer accuracy and reproducibility study , 2019 .

[3]  Jason Y. Park,et al.  Trends in the US and Canadian Pathologist Workforces From 2007 to 2017 , 2019, JAMA network open.

[4]  B. Hankey,et al.  Surveillance, Epidemiology, and End Results Program , 1999 .

[5]  Zoubin Ghahramani,et al.  Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.

[6]  Geert J. S. Litjens,et al.  Quantifying the effects of data augmentation and stain color normalization in convolutional neural networks for computational pathology , 2019, Medical Image Anal..

[7]  M A Weinstock,et al.  Epidemiology of melanoma. , 2017, Cancer treatment and research.

[8]  Jesús Chamorro-Martínez,et al.  Diatom autofocusing in brightfield microscopy: a comparative study , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[9]  Liron Pantanowitz,et al.  An artificial intelligence algorithm for prostate cancer diagnosis in whole slide images of core needle biopsies: a blinded clinical validation and deployment study. , 2020, The Lancet. Digital health.

[10]  J. Elmore,et al.  The MPATH-Dx reporting schema for melanocytic proliferations and melanoma. , 2014, Journal of the American Academy of Dermatology.

[11]  B. B. Weitner,et al.  Histomorphologic Assessment and Interobserver Diagnostic Reproducibility of Atypical Spitzoid Melanocytic Neoplasms With Long-term Follow-up , 2014, The American journal of surgical pathology.

[12]  Lynne Penberthy,et al.  Cancer Incidence and Survival Trends by Subtype Using Data from the Surveillance Epidemiology and End Results Program, 1992–2013 , 2016, Cancer Epidemiology, Biomarkers & Prevention.

[13]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[14]  M. Kucenic,et al.  Diagnostic concordance rates in the subtyping of basal cell carcinoma by different dermatopathologists , 2014, Journal of cutaneous pathology.

[15]  Bram van Ginneken,et al.  The importance of stain normalization in colorectal tissue classification with convolutional networks , 2017, 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017).

[16]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Domenec Puig,et al.  Analysis of focus measure operators for shape-from-focus , 2013, Pattern Recognit..

[18]  Claes Lundström,et al.  A Closer Look at Domain Shift for Deep Learning in Histopathology , 2019, ArXiv.

[19]  Ming Y. Lu,et al.  Data-efficient and weakly supervised computational pathology on whole-slide images , 2020, Nature Biomedical Engineering.

[20]  Max Welling,et al.  Attention-based Deep Multiple Instance Learning , 2018, ICML.

[21]  D. J. Geijs,et al.  End-to-end classification on basal-cell carcinoma histopathology whole-slides images , 2021, Medical Imaging.

[22]  Julianna D. Ianni,et al.  Tailored for Real-World: A Whole Slide Image Classification System Validated on Uncurated Multi-Site Data Emulating the Prospective Pathology Workload , 2019, Scientific Reports.

[23]  Simon M Thomas,et al.  Interpretable deep learning systems for multi-class segmentation and classification of non-melanoma skin cancer , 2020, Medical Image Anal..

[24]  Yinyin Yuan,et al.  SuperHistopath: A Deep Learning Pipeline for Mapping Tumor Heterogeneity on Low-Resolution Whole-Slide Digital Histopathology Images , 2021, Frontiers in Oncology.

[25]  R. Corona,et al.  Interobserver variability on the histopathologic diagnosis of cutaneous melanoma and other pigmented skin lesions. , 1996, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[26]  D. Massi,et al.  Recognition of Cutaneous Melanoma on Digitized Histopathological Slides via Artificial Intelligence Algorithm , 2020, Frontiers in Oncology.

[27]  Andrea C. Radick,et al.  Accuracy of Digital Pathologic Analysis vs Traditional Microscopy in the Interpretation of Melanocytic Lesions , 2018, JAMA dermatology.