CAMR: cross-aligned multimodal representation learning for cancer survival prediction

Abstract Motivation Accurately predicting cancer survival is crucial for helping clinicians to plan appropriate treatments, which largely improves the life quality of cancer patients and spares the related medical costs. Recent advances in survival prediction methods suggest that integrating complementary information from different modalities, e.g. histopathological images and genomic data, plays a key role in enhancing predictive performance. Despite promising results obtained by existing multimodal methods, the disparate and heterogeneous characteristics of multimodal data cause the so-called modality gap problem, which brings in dramatically diverse modality representations in feature space. Consequently, detrimental modality gaps make it difficult for comprehensive integration of multimodal information via representation learning and therefore pose a great challenge to further improvements of cancer survival prediction. Results To solve the above problems, we propose a novel method called cross-aligned multimodal representation learning (CAMR), which generates both modality-invariant and -specific representations for more accurate cancer survival prediction. Specifically, a cross-modality representation alignment learning network is introduced to reduce modality gaps by effectively learning modality-invariant representations in a common subspace, which is achieved by aligning the distributions of different modality representations through adversarial training. Besides, we adopt a cross-modality fusion module to fuse modality-invariant representations into a unified cross-modality representation for each patient. Meanwhile, CAMR learns modality-specific representations which complement modality-invariant representations and therefore provides a holistic view of the multimodal data for cancer survival prediction. Comprehensive experiment results demonstrate that CAMR can successfully narrow modality gaps and consistently yields better performance than other survival prediction methods using multimodal data. Availability and implementation CAMR is freely available at https://github.com/wxq-ustc/CAMR. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  Yu Zhang,et al.  Adaptive risk-aware sharable and individual subspace learning for cancer survival analysis with multi-modality data , 2022, Briefings Bioinform..

[2]  A. McPherson,et al.  Multimodal data integration using machine learning improves risk stratification of high-grade serous ovarian cancer , 2022, Nature Cancer.

[3]  Donglin Di,et al.  Big-Hypergraph Factorization Neural Network for Survival Prediction From Whole Slide Image , 2022, IEEE Transactions on Image Processing.

[4]  Qianjin Feng,et al.  Multi-Constraint Latent Representation Learning for Prognosis Analysis Using Multi-Modal Data , 2021, IEEE Transactions on Neural Networks and Learning Systems.

[5]  Chao Tu,et al.  Relation-Aware Shared Representation Learning for Cancer Prognosis Analysis With Auxiliary Clinical Variables and Incomplete Multi-Modality Data , 2021, IEEE Transactions on Medical Imaging.

[6]  Kun Huang,et al.  Weakly Supervised Deep Ordinal Cox Model for Survival Prediction From Whole-Slide Pathological Images , 2021, IEEE Transactions on Medical Imaging.

[7]  Jianliang Gao,et al.  Predicting the Survival of Cancer Patients With Multimodal Graph Neural Network , 2021, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[8]  Ao Li,et al.  GPDBN: deep bilinear network integrating both genomic data and pathological images for breast cancer prognosis prediction , 2021, Bioinform..

[9]  Minh N. Do,et al.  Multimodal Fusion Using Sparse Cca For Breast Cancer Survival Prediction , 2021, 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI).

[10]  Yu Wang,et al.  Deep Semisupervised Multitask Learning Model and Its Interpretability for Survival Analysis , 2021, IEEE Journal of Biomedical and Health Informatics.

[11]  M. Westerhoff,et al.  Two-stage Cox-nnet: biologically interpretable neural-network model for prognosis prediction and its application in liver cancer survival using histopathology and transcriptomic data , 2021, NAR genomics and bioinformatics.

[12]  Heng Tao Shen,et al.  Learning Cross-Modal Common Representations by Private–Shared Subspaces Separation , 2020, IEEE Transactions on Cybernetics.

[13]  Jitendra Jonnagaddala,et al.  Whole slide images based cancer survival prediction using attention guided deep multiple instance learning networks , 2020, Medical Image Anal..

[14]  Yuhualei Pan,et al.  A five‐microRNA signature for individualized prognosis evaluation and radiotherapy guidance in patients with diffuse lower‐grade glioma , 2020, Journal of cellular and molecular medicine.

[15]  Roger Zimmermann,et al.  MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis , 2020, ACM Multimedia.

[16]  T. Yankeelov,et al.  Deep learning-based classification of breast cancer cells using transmembrane receptor dynamics , 2020 .

[17]  Qing Xiao,et al.  Integrative analysis of cross-modal features for the prognosis prediction of clear cell renal cell carcinoma , 2020, Bioinform..

[18]  Zhenchang Wang,et al.  Emergence in protein derived nanomedicine as anticancer therapeutics: More than a tour de force. , 2020, Seminars in cancer biology.

[19]  Jun Cheng,et al.  Integrative Analysis of Pathological Images and Multi-Dimensional Genomic Data for Early-Stage Cancer Prognosis , 2020, IEEE Transactions on Medical Imaging.

[20]  Ming Y. Lu,et al.  Pathomic Fusion: An Integrated Framework for Fusing Histopathology and Genomic Features for Cancer Diagnosis and Prognosis , 2019, IEEE Transactions on Medical Imaging.

[21]  Seungyeoun Lee,et al.  Review of statistical methods for survival analysis using genomic data , 2019, Genomics & informatics.

[22]  Haifeng Hu,et al.  Modality to Modality Translation: An Adversarial Representation Learning and Graph Fusion Network for Multimodal Fusion , 2019, AAAI.

[23]  Ntikurako Guy-Fernand Kazihise,et al.  MLW-gcForest: a multi-weighted gcForest model towards the staging of lung adenocarcinoma based on multi-modal genetic data , 2019, BMC Bioinformatics.

[24]  Ao Li,et al.  LSCDFS-MKL: A multiple kernel based method for lung squamous cell carcinomas disease-free survival prediction with pathological and genomic data , 2019, J. Biomed. Informatics.

[25]  Dongdong Sun,et al.  A Multimodal Deep Neural Network for Human Breast Cancer Prognosis Prediction by Integrating Multi-Dimensional Data , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[26]  Olivier Gevaert,et al.  Deep learning with multimodal representation for pancancer prognosis prediction , 2019, bioRxiv.

[27]  Maher Rizkalla,et al.  SALMON: Survival Analysis Learning With Multi-Omics Neural Networks on Breast Cancer , 2019, Front. Genet..

[28]  Tieliu Shi,et al.  Deep Learning-Based Multi-Omics Data Integration Reveals Two Prognostic Subtypes in High-Risk Neuroblastoma , 2018, Front. Genet..

[29]  Yi Xie,et al.  Breast cancer prognosis signature: linking risk stratification to disease subtypes , 2018, Briefings Bioinform..

[30]  Dongdong Sun,et al.  Integrating genomic data and pathological images to effectively predict breast cancer clinical outcome , 2018, Comput. Methods Programs Biomed..

[31]  Louis-Philippe Morency,et al.  Efficient Low-rank Multimodal Fusion With Modality-Specific Factors , 2018, ACL.

[32]  Uri Shaham,et al.  DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network , 2016, BMC Medical Research Methodology.

[33]  Qianjin Feng,et al.  Integrative Analysis of Histopathological Images and Genomic Data Predicts Clear Cell Renal Cell Carcinoma Prognosis. , 2017, Cancer research.

[34]  Yang Yang,et al.  Adversarial Cross-Modal Retrieval , 2017, ACM Multimedia.

[35]  D. Brat,et al.  Predicting cancer outcomes from histology and genomics using convolutional networks , 2017, Proceedings of the National Academy of Sciences.

[36]  Junzhou Huang,et al.  WSISA: Making Survival Prediction from Whole Slide Histopathological Images , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Erik Cambria,et al.  Tensor Fusion Network for Multimodal Sentiment Analysis , 2017, EMNLP.

[38]  Louis-Philippe Morency,et al.  Multimodal Machine Learning: A Survey and Taxonomy , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Jin Gu,et al.  Evaluating the molecule-based prediction of clinical drug responses in cancer , 2016, Bioinform..

[40]  Ce Zhang,et al.  Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features , 2016, Nature Communications.

[41]  Tao Mei,et al.  Jointly Modeling Embedding and Translation to Bridge Video and Language , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  Wei Chen,et al.  Jointly Modeling Deep Video and Compositional Text to Bridge Vision and Language in a Unified Framework , 2015, AAAI.

[43]  Peng Qiu,et al.  TCGA-Assembler: open-source software for retrieving and processing TCGA data , 2014, Nature Methods.

[44]  F. Markowetz,et al.  Quantitative Image Analysis of Cellular Heterogeneity in Breast Tumors Complements Genomic Profiling , 2012, Science Translational Medicine.

[45]  Anne-Laure Boulesteix,et al.  Survival prediction using gene expression data: A review and comparison , 2009, Comput. Stat. Data Anal..

[46]  Hemant Ishwaran,et al.  Random Survival Forests , 2008, Wiley StatsRef: Statistics Reference Online.

[47]  Anne E Carpenter,et al.  CellProfiler: image analysis software for identifying and quantifying cell phenotypes , 2006, Genome Biology.

[48]  Bart De Moor,et al.  Predicting the prognosis of breast cancer by integrating clinical and microarray data with Bayesian networks , 2006, ISMB.

[49]  R. Tibshirani The lasso method for variable selection in the Cox model. , 1997, Statistics in medicine.

[50]  H. Zou,et al.  A cocktail algorithm for solving the elastic net penalized Cox’s regression in high dimensions , 2013 .