Survival Prediction via Hierarchical Multimodal Co-Attention Transformer: A Computational Histology-Radiology Solution

The rapid advances in deep learning-based computational pathology and radiology have demonstrated the promise of using whole slide images (WSIs) and radiology images for survival prediction in cancer patients. However, most image-based survival prediction methods are limited to using either histology or radiology alone, leaving integrated approaches across histology and radiology relatively underdeveloped. There are two main challenges in integrating WSIs and radiology images: (1) the gigapixel nature of WSIs and (2) the vast difference in spatial scales between WSIs and radiology images. To address these challenges, in this work, we propose an interpretable, weakly-supervised, multimodal learning framework, called Hierarchical Multimodal Co-Attention Transformer (HMCAT), to integrate WSIs and radiology images for survival prediction. Our approach first uses hierarchical feature extractors to capture various information including cellular features, cellular organization, and tissue phenotypes in WSIs. Then the hierarchical radiology-guided co- attention (HRCA) in HMCAT characterizes the multimodal interactions between hierarchical histology-based visual concepts and radiology features and learns hierarchical co- attention mappings for two modalities. Finally, HMCAT combines their complementary information into a multimodal risk score and discovers prognostic features from two modalities by multimodal interpretability. We apply our approach to two cancer datasets (365 WSIs with matched magnetic resonance [MR] images and 213 WSIs with matched computed tomography [CT] images). Our results demonstrate that the proposed HMCAT consistently achieves superior performance over the unimodal approaches trained on either histology or radiology data alone, as well as other state-of-the-art methods.

[1]  R. G. Krishnan,et al.  Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  W. Wang,et al.  Predicting peritoneal recurrence and disease-free survival from CT images in gastric cancer with multitask deep learning: a retrospective study. , 2022, The Lancet. Digital health.

[3]  Liansheng Wang,et al.  CT-Guided Survival Prediction of Esophageal Cancer , 2021, IEEE Journal of Biomedical and Health Informatics.

[4]  S. Shah,et al.  Harnessing multimodal data integration to advance precision oncology , 2021, Nature Reviews Cancer.

[5]  Ming Y. Lu,et al.  Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide Images , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[6]  Carola-Bibiane Schönlieb,et al.  Radiological tumor classification across imaging modality and histology , 2021, Nature Machine Intelligence.

[7]  Martin C. Stumpe,et al.  Deep Orthogonal Fusion: Multimodal Prognostic Biomarker Discovery Integrating Radiology, Pathology, Genomic, and Clinical Data , 2021, MICCAI.

[8]  Jing Xiao,et al.  DeepPrognosis: Preoperative prediction of pancreatic cancer survival and surgical margin via comprehensive understanding of dynamic contrast-enhanced CT imaging and tumor-vascular contact parsing , 2021, Medical Image Anal..

[9]  Ruijiang Li,et al.  Radiographical assessment of tumour stroma and treatment outcomes using deep learning: a retrospective, multicohort study. , 2021, The Lancet. Digital health.

[10]  Julien Mairal,et al.  Emerging Properties in Self-Supervised Vision Transformers , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[11]  Di Dong,et al.  Joint Multi-Task Learning for Survival Prediction of Gastric Cancer Patients using CT Images , 2021, 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI).

[12]  Andrew H. Beck,et al.  Human-interpretable image features derived from densely mapped cancer pathology slides predict diverse molecular phenotypes , 2021, Nature Communications.

[13]  Ming Y. Lu,et al.  Pan-Cancer Integrative Histology-Genomic Analysis via Interpretable Multimodal Deep Learning , 2021, Poster Presentations - Proffered Abstracts.

[14]  K. Eliceiri,et al.  Dual-stream Multiple Instance Learning Network for Whole Slide Image Classification with Self-supervised Contrastive Learning , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Jitendra Jonnagaddala,et al.  Whole slide images based cancer survival prediction using attention guided deep multiple instance learning networks , 2020, Medical Image Anal..

[16]  Ming Y. Lu,et al.  AI-based pathology predicts origins for cancers of unknown primary , 2020, Nature.

[17]  Nasir Rajpoot,et al.  Capturing Cellular Topology in Multi-Gigapixel Pathology Images , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[18]  Fan Yang,et al.  Predicting Lymph Node Metastasis Using Histopathological Images Based on Multiple Instance Learning With Deep Graph Convolution , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Ling Shao,et al.  M2Net: Multi-modal Multi-channel Network for Overall Survival Time Prediction of Brain Tumor Patients , 2020, MICCAI.

[20]  Ming Y. Lu,et al.  Data-efficient and weakly supervised computational pathology on whole-slide images , 2020, Nature Biomedical Engineering.

[21]  Matthias Schmid,et al.  Bias in Cross-Entropy-Based Training of Deep Survival Networks , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Jun Cheng,et al.  Integrative Analysis of Pathological Images and Multi-Dimensional Genomic Data for Early-Stage Cancer Prognosis , 2020, IEEE Transactions on Medical Imaging.

[23]  Ming Y. Lu,et al.  Pathomic Fusion: An Integrated Framework for Fusing Histopathology and Genomic Features for Cancer Diagnosis and Prognosis , 2019, IEEE Transactions on Medical Imaging.

[24]  Ellery Wulczyn,et al.  Deep learning-based survival prediction for multiple cancer types using histopathology images , 2019, PloS one.

[25]  G. Wainrib,et al.  Deep learning-based classification of mesothelioma improves prediction of patient outcome , 2019, Nature Medicine.

[26]  Angela Lin,et al.  Multimodal Multitask Representation Learning for Pathology Biobank Metadata Prediction , 2019, ArXiv.

[27]  Thomas J. Fuchs,et al.  Clinical-grade computational pathology using weakly supervised deep learning on whole slide images , 2019, Nature Medicine.

[28]  C. Sautès-Fridman,et al.  Tertiary lymphoid structures in the era of cancer immunotherapy , 2019, Nature Reviews Cancer.

[29]  Jasjit S. Suri,et al.  The present and future of deep learning in radiology. , 2019, European journal of radiology.

[30]  Olivier Gevaert,et al.  Deep learning with multimodal representation for pancancer prognosis prediction , 2019, bioRxiv.

[31]  Shaoqun Zeng,et al.  From Detection of Individual Metastases to Classification of Lymph Node Status at the Patient Level: The CAMELYON17 Challenge , 2019, IEEE Transactions on Medical Imaging.

[32]  R. Gillies,et al.  Repeatability and Reproducibility of Radiomic Features: A Systematic Review , 2018, International journal of radiation oncology, biology, physics.

[33]  C. Hutter,et al.  The Cancer Genome Atlas: Creating Lasting Value beyond Its Data , 2018, Cell.

[34]  Max Welling,et al.  Attention-based Deep Multiple Instance Learning , 2018, ICML.

[35]  Andrew H. Beck,et al.  Diagnostic Assessment of Deep Learning Algorithms for Detection of Lymph Node Metastases in Women With Breast Cancer , 2017, JAMA.

[36]  Andriy Fedorov,et al.  Computational Radiomics System to Decode the Radiographic Phenotype. , 2017, Cancer research.

[37]  D. Brat,et al.  Predicting cancer outcomes from histology and genomics using convolutional networks , 2017, Proceedings of the National Academy of Sciences.

[38]  Sepp Hochreiter,et al.  Self-Normalizing Neural Networks , 2017, NIPS.

[39]  Guido Gerig,et al.  ITK-SNAP: An interactive tool for semi-automatic segmentation of multi-modality biomedical images , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[40]  Paul Kinahan,et al.  Radiomics: Images Are More than Pictures, They Are Data , 2015, Radiology.

[41]  Benjamin Haibe-Kains,et al.  A comparative study of survival models for breast cancer prognostication based on microarray data: does a single gene beat them all? , 2008, Bioinform..

[42]  Douglas G Altman,et al.  The logrank test , 2004, BMJ : British Medical Journal.

[43]  Z. Ying,et al.  Checking the Cox model with cumulative sums of martingale-based residuals , 1993 .

[44]  Christian Desrosiers,et al.  Predicting the Gene Status and Survival Outcome of Lower Grade Glioma Patients With Multimodal MRI Features , 2019, IEEE Access.