Multi-modal learning with missing data for cancer diagnosis using histopathological and genomic data

Multi-modal learning (e.g., integrating pathological images with genomic features) tends to improve the accuracy of cancer diagnosis and prognosis as compared to learning with a single modality. However, missing data is a common problem in clinical practice, i.e., not every patient has all modalities available. Most of the previous works directly discarded samples with missing modalities, which might lose information in these data and increase the likelihood of overfitting. In this work, we generalize the multi-modal learning in cancer diagnosis with the capacity of dealing with missing data using histological images and genomic data. Our integrated model can utilize all available data from patients with both complete and partial modalities. The experiments on the public TCGA-GBM and TCGA-LGG datasets show that the data with missing modalities can contribute to multi-modal learning, which improvesthe model performance in grade classification of glioma cancer.

[1]  Ao Li,et al.  GPDBN: deep bilinear network integrating both genomic data and pathological images for breast cancer prognosis prediction , 2021, Bioinform..

[2]  Minh N. Do,et al.  Multimodal Fusion Using Sparse Cca For Breast Cancer Survival Prediction , 2021, 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI).

[3]  Liang Sun,et al.  Multi-task multi-modal learning for joint diagnosis and prognosis of human cancers , 2020, Medical Image Anal..

[4]  Hong Qin,et al.  A Novel Pathological Images and Genomic Data Fusion Framework for Breast Cancer Survival Prediction , 2020, 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC).

[5]  Karl Rohr,et al.  Pan-Cancer Prognosis Prediction Using Multimodal Deep Learning , 2020, 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI).

[6]  Ming Y. Lu,et al.  Pathomic Fusion: An Integrated Framework for Fusing Histopathology and Genomic Features for Cancer Diagnosis and Prognosis , 2019, IEEE Transactions on Medical Imaging.

[7]  Olivier Gevaert,et al.  Deep learning with multimodal representation for pancancer prognosis prediction , 2019, bioRxiv.

[8]  B. Engelhardt,et al.  Joint analysis of expression levels and histological images identifies genes associated with tissue morphology , 2018, Nature Communications.

[9]  Jian Ma,et al.  Correlating cellular features with gene expression using CCA , 2018, 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018).

[10]  D. Brat,et al.  Predicting cancer outcomes from histology and genomics using convolutional networks , 2017, Proceedings of the National Academy of Sciences.

[11]  Junzhou Huang,et al.  Deep Correlational Learning for Survival Prediction from Multi-modality Data , 2017, MICCAI.

[12]  Sepp Hochreiter,et al.  Self-Normalizing Neural Networks , 2017, NIPS.