Morphological and molecular breast cancer profiling through explainable machine learning

Recent advances in cancer research and diagnostics largely rely on new developments in microscopic or molecular profiling techniques, offering high levels of detail with respect to either spatial or molecular features, but usually not both. Here, we present an explainable machine-learning approach for the integrated profiling of morphological, molecular and clinical features from breast cancer histology. First, our approach allows for the robust detection of cancer cells and tumour-infiltrating lymphocytes in histological images, providing precise heatmap visualizations explaining the classifier decisions. Second, molecular features, including DNA methylation, gene expression, copy number variations, somatic mutations and proteins are predicted from histology. Molecular predictions reach balanced accuracies up to 78%, whereas accuracies of over 95% can be achieved for subgroups of patients. Finally, our explainable AI approach allows assessment of the link between morphological and molecular cancer properties. The resulting computational multiplex-histology analysis can help promote basic cancer research and precision medicine through an integrated diagnostic scoring of histological, clinical and molecular features. Cancers are complex diseases that are increasingly studied using a diverse set of omics data. At the same time, histological images show the interaction of cells, which is not visible with bulk omics methods. Binder and colleagues present a method to learn from both kinds of data, such that molecular markers can be associated with visible patterns in the tissue samples and be used for more accurate breast cancer diagnosis.

[1]  Alexander Binder,et al.  Unmasking Clever Hans predictors and assessing what machines really learn , 2019, Nature Communications.

[2]  Sarah N. Dudgeon,et al.  Report on computational assessment of Tumor Infiltrating Lymphocytes from the International Immuno-Oncology Biomarker Working Group. , 2020, NPJ breast cancer.

[3]  Andrew H. Beck,et al.  Systematic Analysis of Breast Cancer Morphology Uncovers Stromal Features Associated with Survival , 2011, Science Translational Medicine.

[4]  Gunnar Rätsch,et al.  The SHOGUN Machine Learning Toolbox , 2010, J. Mach. Learn. Res..

[5]  M. Salido,et al.  Defective Cyclin B1 Induction in Trastuzumab-emtansine (T-DM1) Acquired Resistance in HER2-positive Breast Cancer , 2017, Clinical Cancer Research.

[6]  Alexander Binder,et al.  Analyzing Classifiers: Fisher Vectors and Deep Neural Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  P. Fasching,et al.  Tumour-infiltrating lymphocytes and prognosis in different subtypes of breast cancer: a pooled analysis of 3771 patients treated with neoadjuvant therapy. , 2018, The Lancet. Oncology.

[8]  R. Mirimanoff,et al.  Inhibition of the Kit Ligand/c-Kit Axis Attenuates Metastasis in a Mouse Model Mimicking Local Breast Cancer Relapse after Radiotherapy , 2012, Clinical Cancer Research.

[9]  Hai Su,et al.  Pathologist-level interpretable whole-slide cancer diagnosis with deep learning , 2019, Nat. Mach. Intell..

[10]  W. Weichert,et al.  Classical pathology and mutational load of breast cancer – integration of two worlds , 2015, The journal of pathology. Clinical research.

[11]  Jakob Nikolas Kather,et al.  Deep learning can predict microsatellite instability directly from histology in gastrointestinal cancer , 2019, Nature Medicine.

[12]  Fabio A. González,et al.  Histopathology Image Classification Using Bag of Features and Kernel Functions , 2009, AIME.

[13]  J. Harrell,et al.  Estrogen induces c-Kit and an aggressive phenotype in a model of invasive lobular breast cancer , 2017, Oncogenesis.

[14]  Andrew H. Beck,et al.  Report on computational assessment of Tumor Infiltrating Lymphocytes from the International Immuno-Oncology Biomarker Working Group , 2020, npj Breast Cancer.

[15]  N. Razavian,et al.  Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning , 2018, Nature Medicine.

[16]  Wojciech Samek,et al.  Methods for interpreting and understanding deep neural networks , 2017, Digit. Signal Process..

[17]  Motoaki Kawanabe,et al.  Enhanced representation and multi-task learning for image annotation , 2013, Comput. Vis. Image Underst..

[18]  Ying Jiang,et al.  Foxo3a Expression Is a Prognostic Marker in Breast Cancer , 2013, PloS one.

[19]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[20]  Yinyin Yuan Spatial Heterogeneity in the Tumor Microenvironment. , 2016, Cold Spring Harbor perspectives in medicine.

[21]  Ce Zhang,et al.  Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features , 2016, Nature Communications.

[22]  Nicholas Ayache,et al.  Endomicroscopic video retrieval using mosaicing and visualwords , 2010, 2010 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[23]  Heikki Joensuu,et al.  Tumor-Infiltrating Lymphocytes and Prognosis: A Pooled Individual Patient Analysis of Early-Stage Triple-Negative Breast Cancers. , 2019, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[24]  K-R Müller,et al.  Scoring of tumor-infiltrating lymphocytes: From visual estimation to machine learning. , 2018, Seminars in cancer biology.

[25]  Michael Y. Gerner,et al.  Histo-cytometry: a method for highly multiplex quantitative tissue imaging analysis applied to dendritic cell subset microanatomy in lymph nodes. , 2012, Immunity.

[26]  A. Gown,et al.  The path to a better biomarker: application of a risk management framework for the implementation of PD‐L1 and TILs as immuno‐oncology biomarkers in breast cancer clinical trials and daily practice , 2020, The Journal of pathology.

[27]  Cheng Soon Ong,et al.  Multiclass multiple kernel learning , 2007, ICML '07.

[28]  Alexander Binder,et al.  On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation , 2015, PloS one.

[29]  Klaus-Robert Müller,et al.  Towards Explainable Artificial Intelligence , 2019, Explainable AI.

[30]  P. Diggle,et al.  Spatial point pattern analysis and its application in geographical epidemiology , 1996 .

[31]  A. Madabhushi,et al.  Histopathological Image Analysis: A Review , 2009, IEEE Reviews in Biomedical Engineering.

[32]  Wojciech Samek,et al.  Explainable ai – preface , 2019 .

[33]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[34]  D. Rimm Next-gen immunohistochemistry , 2014, Nature Methods.

[35]  Wojciech Samek,et al.  Toward Interpretable Machine Learning: Transparent Deep Neural Networks and Beyond , 2020, ArXiv.

[36]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumors , 2012, Nature.

[37]  Gunnar Rätsch,et al.  An introduction to kernel-based learning algorithms , 2001, IEEE Trans. Neural Networks.

[38]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.