Deep multi-modal fusion network with gated unit for breast cancer survival prediction.

Accurate survival prediction is a critical goal in the prognosis of breast cancer patients because it can help physicians make more patient-friendly decisions and further guide appropriate treatment. Breast cancer is often caused by genetic abnormalities, which prompts researchers to consider information such as gene expression and copy number variation in addition to clinical data in their studies. The integration of these multi-modal data can improve the predictive power of models. However, with the highly unbalanced information of breast cancer patient data, it becomes a new challenge for breast cancer patient survival prediction to fully extract the characteristic information of these multi-modal data and to consider the complementarity of this information. To this end, we propose a deep multi-modal fusion network (DMMFN) to predict the five-year survival of breast cancer patients by integrating clinical data, copy number variation data, and gene expression data. The imbalanced dataset is first processed using the oversampling method SMOTE-NC. Then the abstract modal features of the multi-modal data are extracted by the two-layer one-dimensional convolutional neural network and the bi-directional long short-term memory network. Next, the weight coefficients of each modal data are dynamically adjusted using gated multimodal units to obtain fusion features. Finally, the fusion features are fed into the MaxoutMLP classifier to obtain the final prediction results. We conducted experiments on the METABRIC dataset to verify the validity of the multi-modal data and compared it with other methods. The comprehensive performance evaluation shows that DMMFN has better prediction performance.

[1]  H. Zhang,et al.  A Novel Deep Learning Method to Predict Lung Cancer Long-Term Survival With Biological Knowledge Incorporated Gene Expression Images and Clinical Data , 2022, Frontiers in Genetics.

[2]  Shoubin Dong,et al.  A Multi-modal Fusion Framework Based on Multi-task Correlation Learning for Cancer Prognosis Prediction , 2022, Artif. Intell. Medicine.

[3]  Inderveer Chana,et al.  BSense: A parallel Bayesian hyperparameter optimized Stacked ensemble model for breast cancer survival prediction , 2022, J. Comput. Sci..

[4]  Mostafa Reisi Gahrooei,et al.  Multimodal data fusion for systems improvement: A review , 2021, IISE Transactions.

[5]  M. Gupta,et al.  A Comparative Analysis of Deep Learning Approaches for Predicting Breast Cancer Survivability , 2021, Archives of Computational Methods in Engineering.

[6]  D. Fotiadis,et al.  Applied machine learning in cancer research: A systematic review for patient diagnosis, classification and prognosis , 2021, Computational and structural biotechnology journal.

[7]  Shuihua Wang,et al.  A systematic survey of deep learning in breast cancer , 2021, Int. J. Intell. Syst..

[8]  Yi Zhang,et al.  An Ovarian Cancer Susceptible Gene Prediction Method Based on Deep Learning Methods , 2021, Frontiers in Cell and Developmental Biology.

[9]  J. Lee,et al.  Deep Learning-Based Prediction Model for Breast Cancer Recurrence Using Adjuvant Breast Cancer Cohort in Tertiary Cancer Center Registry , 2021, Frontiers in Oncology.

[10]  Sriparna Saha,et al.  Multi-modal advanced deep learning architectures for breast cancer survival prediction , 2021, Knowl. Based Syst..

[11]  Isabelle Bichindaritz,et al.  Integrative survival analysis of breast cancer with gene expression and DNA methylation data , 2021, Bioinform..

[12]  Sahar A. ElRahman,et al.  Predicting breast cancer survivability based on machine learning and features selection algorithms: a comparative study , 2020, Journal of Ambient Intelligence and Humanized Computing.

[13]  Nashwa El-Bendary,et al.  A feature-fusion framework of clinical, genomics, and histopathological data for METABRIC breast cancer subtype classification , 2020, Appl. Soft Comput..

[14]  S. Rauschert,et al.  Machine learning and clinical epigenetics: a review of challenges for diagnosis and classification , 2020, Clinical Epigenetics.

[15]  Xiangqian Guo,et al.  The Application of Deep Learning in Cancer Prognosis Prediction , 2020, Cancers.

[16]  Fabio A. González,et al.  Gated multimodal networks , 2020, Neural Computing and Applications.

[17]  Chaofeng Li,et al.  A deep survival analysis method based on ranking , 2019, Artif. Intell. Medicine.

[18]  Maher Rizkalla,et al.  SALMON: Survival Analysis Learning With Multi-Omics Neural Networks on Breast Cancer , 2019, Front. Genet..

[19]  Xun Zhu,et al.  Cox-nnet: An artificial neural network method for prognosis prediction of high-throughput omics data , 2018, PLoS Comput. Biol..

[20]  Uri Shaham,et al.  DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network , 2016, BMC Medical Research Methodology.

[21]  Verónica Bolón-Canedo,et al.  Fast‐mRMR: Fast Minimum Redundancy Maximum Relevance Algorithm for High‐Dimensional Big Data , 2017, Int. J. Intell. Syst..

[22]  Kun-Huang Chen,et al.  A hybrid classifier combining SMOTE with PSO to estimate 5-year survivability of breast cancer patients , 2014, Appl. Soft Comput..

[23]  F. Markowetz,et al.  The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups , 2012, Nature.

[24]  S. Hochreiter,et al.  Long Short-Term Memory , 1997, Neural Computation.

[25]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[26]  Ali Kashif Bashir,et al.  A Comprehensive Review on Medical Diagnosis Using Machine Learning , 2021, Computers, Materials & Continua.

[27]  Mete Celik,et al.  Hybrid models based on genetic algorithm and deep learning algorithms for nutritional Anemia disease classification , 2021, Biomed. Signal Process. Control..

[28]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..