ndmaSNF: cancer subtype discovery based on integrative framework assisted by network diffusion model

Recently, with the rapid progress of high-throughput sequencing technology, diverse genomic data are easy to be obtained. To effectively exploit the value of those data, integrative methods are urgently needed. In this paper, based on SNF (Similarity Network Diffusion) [1], we proposed a new integrative method named ndmaSNF (network diffusion model assisted SNF), which can be used for cancer subtype discovery with the advantage of making use of somatic mutation data and other discrete data. Firstly, we incorporate network diffusion model on mutation data to make it smoothed and adaptive. Then, the mutation data along with other data types are utilized in the SNF framework by constructing patient-by-patient similarity networks for each data type. Finally, a fused patient network containing all the information from different input data types is obtained by using a nonlinear iterative method. The fused network can be used for cancer subtype discovery through the clustering algorithm. Experimental results on four cancer datasets showed that our ndmaSNF method can find subtypes with significant differences in the survival profile and other clinical features.

[1]  Roded Sharan,et al.  Associating Genes and Protein Complexes with Disease via Network Propagation , 2010, PLoS Comput. Biol..

[2]  Joshua M. Korn,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.

[3]  Hongjuan Zhao,et al.  TP53 mutation status and gene expression profiles are powerful prognostic markers of breast cancer , 2007, Breast Cancer Research.

[4]  Javed Siddiqui,et al.  Activating ESR1 mutations in hormone-resistant metastatic breast cancer , 2013, Nature Genetics.

[5]  Etienne Rouleau,et al.  PIK3R1 underexpression is an independent prognostic marker in breast cancer , 2013, BMC Cancer.

[6]  Bin Gu,et al.  Incremental Support Vector Learning for Ordinal Regression , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Andrew M. Gross,et al.  Network-based stratification of tumor mutations , 2013, Nature Methods.

[8]  R. Nusse,et al.  The Wnt signaling pathway in development and disease. , 2004, Annual review of cell and developmental biology.

[9]  K. Jirström,et al.  Association between Pak1 expression and subcellular localization and tamoxifen resistance in breast cancer patients. , 2006, Journal of the National Cancer Institute.

[10]  Carsten Denkert,et al.  Differential expression of histone deacetylases HDAC1, 2 and 3 in human breast cancer - overexpression of HDAC2 and HDAC3 is associated with clinicopathological indicators of disease progression , 2013, BMC Cancer.

[11]  Zhongming Zhao,et al.  GenRev: exploring functional relevance of genes in molecular networks. , 2012, Genomics.

[12]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[13]  Bin Gu,et al.  A Robust Regularization Path Algorithm for $\nu $ -Support Vector Classification , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[14]  Yijie Wang,et al.  Transcriptional factors p300 and MRTF-A synergistically enhance the expression of migration-related genes in MCF-7 breast cancer cells. , 2015, Biochemical and biophysical research communications.

[15]  T. Crook,et al.  The p53 pathway in breast cancer , 2002, Breast Cancer Research.

[16]  Michael Q. Zhang,et al.  Fast dimension reduction and integrative clustering of multi-omics data using low-rank approximation: application to cancer molecular classification , 2015, BMC Genomics.

[17]  Yi Shen,et al.  Growth and motility inhibition of breast cancer cells by epidermal growth factor receptor degradation is correlated with inactivation of Cdc42. , 2006, Cancer research.

[18]  H. Savlı,et al.  Identification of ApoA1, HPX and POTEE genes by omic analysis in breast cancer. , 2014, Oncology reports.

[19]  D. Conrad,et al.  The role of Tyk2 in regulation of breast cancer growth. , 2011, Journal of interferon & cytokine research : the official journal of the International Society for Interferon and Cytokine Research.

[20]  Jonathan W. Pillow,et al.  POSTER PRESENTATION Open Access , 2013 .

[21]  S. Eschrich,et al.  Persistent Activation of Stat3 Signaling Induces Survivin Gene Expression and Confers Resistance to Apoptosis in Human Breast Cancer Cells , 2006, Clinical Cancer Research.

[22]  P. ten Dijke,et al.  The tumor suppressor Smad4 is required for transforming growth factor beta-induced epithelial to mesenchymal transition and bone metastasis of breast cancer cells. , 2006, Cancer research.

[23]  Adam A. Margolin,et al.  The Cancer Cell Line Encyclopedia enables predictive modeling of anticancer drug sensitivity , 2012, Nature.

[24]  Junnian Zheng,et al.  BRG1 Is a Prognostic Marker and Potential Therapeutic Target in Human Breast Cancer , 2013, PloS one.

[25]  Gary D Bader,et al.  International network of cancer genome projects , 2010, Nature.

[26]  Zhuowen Tu,et al.  Similarity network fusion for aggregating data types on a genomic scale , 2014, Nature Methods.

[27]  A. Bashashati,et al.  DriverNet: uncovering the impact of somatic driver mutations on transcriptional networks in cancer , 2012, Genome Biology.

[28]  Hyun-soo Cho,et al.  Structure of the Extracellular Region of HER3 Reveals an Interdomain Tether , 2002, Science.

[29]  Adam B. Olshen,et al.  Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis , 2009, Bioinform..

[30]  Oliver E. Sturm,et al.  Computational modelling of the receptor-tyrosine-kinase-activated MAPK pathway. , 2005, The Biochemical journal.

[31]  Jiguo Yu,et al.  Block-Constraint Robust Principal Component Analysis and its Application to Integrated Analysis of TCGA Data , 2016, IEEE Transactions on NanoBioscience.