Attention-Based Multi-NMF Deep Neural Network with Multimodality Data for Breast Cancer Prognosis Model

Today, it has become a hot issue in cancer research to make precise prognostic prediction for breast cancer patients, which can not only effectively avoid overtreatment and medical resources waste, but also provide scientific basis to help medical staff and patients family members to make right medical decisions. As well known, cancer is a partly inherited disease with various important biological markers, especially the gene expression profile data and clinical data. Therefore, the accuracy of prediction model can be improved by integrating gene expression profile data and clinical data. In this paper, we proposed an end-to-end model, Attention-based Multi-NMF DNN (AMND), which combines clinical data and gene expression data extracted by Multiple Nonnegative Matrix Factorization algorithms (Multi-NMF) for the prognostic prediction of breast cancer. The innovation of this method is highlighted through using clinical data and combining multiple feature selection methods with the help of Attention mechanism. The results of comprehensive performance evaluation show that the proposed model reports better predictive performances than either models only using data of single modality, e.g., gene or clinical, or models based on any single NMF improved methods which only use one of the NMF algorithms to extract features. The performance of our model is competitive or even better than other previously reported models. Meanwhile, AMND can be extended to the survival prediction of other cancer diseases, providing a new strategy for breast cancer prognostic prediction.

[1]  W. Coleman,et al.  Molecular and cellular heterogeneity in breast cancer: challenges for personalized medicine. , 2013, The American journal of pathology.

[2]  J S Hoch,et al.  Cost analysis of breast cancer diagnostic assessment programs. , 2017, Current oncology.

[3]  Eduardo D. Sontag,et al.  Neural Networks with Quadratic VC Dimension , 1995, J. Comput. Syst. Sci..

[4]  Xiaofeng Dai,et al.  Breast Cancer Cell Line Classification and Its Relevance with Breast Tumor Subtyping , 2017, Journal of Cancer.

[5]  Xiao Liu,et al.  Hessian regularization based non-negative matrix factorization for gene expression data clustering , 2015, 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[6]  Neil P. Jerome,et al.  Support vector machine for breast cancer classification using diffusion‐weighted MRI histogram features: Preliminary study , 2018, Journal of magnetic resonance imaging : JMRI.

[7]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[8]  A. Jemal,et al.  Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries , 2018, CA: a cancer journal for clinicians.

[9]  Uwe Siebert,et al.  Personalized treatment of women with early breast cancer: a risk-group specific cost-effectiveness analysis of adjuvant chemotherapy accounting for companion prognostic tests OncotypeDX and Adjuvant!Online , 2017, BMC Cancer.

[10]  Gunhee Lee,et al.  Classification of Genes Based on Age-Related Differential Expression in Breast Cancer , 2017, Genomics & informatics.

[11]  Yudong D. He,et al.  A Gene-Expression Signature as a Predictor of Survival in Breast Cancer , 2002 .

[12]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[13]  Alex Graves,et al.  Recurrent Models of Visual Attention , 2014, NIPS.

[14]  Jan Koster,et al.  Association between gene expression profile of the primary tumor and chemotherapy response of metastatic breast cancer , 2017, BMC Cancer.

[15]  Yoshua Bengio,et al.  Attention-Based Models for Speech Recognition , 2015, NIPS.

[16]  Stephen A. Vavasis,et al.  On the Complexity of Nonnegative Matrix Factorization , 2007, SIAM J. Optim..

[17]  Nedialko S. Nedialkov,et al.  Probabilistic Graphical Models and Deep Belief Networks for Prognosis of Breast Cancer , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).

[18]  Ya Zhang,et al.  A gene signature for breast cancer prognosis using support vector machine , 2012, 2012 5th International Conference on BioMedical Engineering and Informatics.

[19]  Christos Boutsidis,et al.  SVD based initialization: A head start for nonnegative matrix factorization , 2008, Pattern Recognit..

[20]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[21]  Gregory J. Wolff,et al.  Optimal Brain Surgeon and general network pruning , 1993, IEEE International Conference on Neural Networks.

[22]  Chih-Jen Lin,et al.  Projected Gradient Methods for Nonnegative Matrix Factorization , 2007, Neural Computation.

[23]  Li Liu,et al.  Improved breast cancer prognosis through the combination of clinical and genetic markers , 2007, Bioinform..

[24]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[25]  Bart De Moor,et al.  Predicting the prognosis of breast cancer by integrating clinical and microarray data with Bayesian networks , 2006, ISMB.

[26]  Shu Ichihara,et al.  Breast cancer prognostic classification in the molecular era: the role of histological grade , 2010, Breast Cancer Research.

[27]  Vessela N. Kristensen,et al.  Prognostic value of PAM50 and risk of recurrence score in patients with early-stage breast cancer with long-term follow-up , 2017, Breast Cancer Research.

[28]  P. Paatero,et al.  Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values† , 1994 .

[29]  Belhassen Bayar,et al.  Probabilistic Non-Negative Matrix Factorization: Theory and Application to microarray Data Analysis , 2014, J. Bioinform. Comput. Biol..

[30]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.