NIBNA: a network-based node importance approach for identifying breast cancer drivers

MOTIVATION Identifying meaningful cancer driver genes in a cohort of tumors is a challenging task in cancer genomics. Although existing studies have identified known cancer drivers, most of them focus on detecting coding drivers with mutations. It is acknowledged that non-coding drivers can regulate driver mutations to promote cancer growth. In this work, we propose a novel node importance based network analysis (NIBNA) framework to detect coding and non-coding cancer drivers. We hypothesize that cancer drivers are crucial to the formation of community structures in cancer network, and removing them from the network greatly perturbs the network structure thereby critically affecting the functioning of the network. NIBNA detects cancer drivers using a three-step process; first, a condition-specific network is built by incorporating gene expression data and gene networks, second, the community structures in the network are estimated and third, a centrality-based metric is applied to compute node importance. RESULTS We apply NIBNA to the BRCA dataset and it outperforms existing state-of-art methods in detecting coding cancer drivers. NIBNA also predicts 265 miRNA drivers and majority of these drivers have been validated in literature. Further we apply NIBNA to detect cancer subtype-specific drivers and several predicted drivers have been validated to be associated with cancer subtypes. Lastly, we evaluate NIBNA's performance in detecting epithelial-mesenchymal transition (EMT) drivers, and we confirmed 8 coding and 13 miRNA drivers in the list of known genes. AVAILABILITY The source code can be accessed at: https://github.com/mandarsc/NIBNA. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Ga-Hang Lee,et al.  FYN promotes mesenchymal phenotypes of basal type breast cancer cells through STAT5/NOTCH2 signaling node , 2018, Oncogene.

[2]  J. P. Hou,et al.  DawnRank: discovering personalized driver genes in cancer , 2014, Genome Medicine.

[3]  Steven A. Roberts,et al.  Mutational heterogeneity in cancer and the search for new cancer genes , 2014 .

[4]  Ying Fan,et al.  Identifying and Characterizing Nodes Important to Community Structure Using the Spectrum of the Graph , 2011, PloS one.

[5]  J. Bonneterre,et al.  Plasma insulin-like growth factor-1 (IGF-1) concentrations in human breast cancer. , 1993, European journal of cancer.

[6]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[7]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[8]  G. Goodall,et al.  The miR-200 family and miR-205 regulate epithelial to mesenchymal transition by targeting ZEB1 and SIP1 , 2008, Nature Cell Biology.

[9]  H. Dweep,et al.  miRWalk2.0: a comprehensive atlas of microRNA-target interactions , 2015, Nature Methods.

[10]  Edward Ott,et al.  Characterizing the dynamical importance of network nodes and links. , 2006, Physical review letters.

[11]  J. Ingle,et al.  Differential Gene Expression of TGFβ Inducible Early Gene (TIEG), Smad7, Smad2 and Bard1 in Normal and Malignant Breast Tissue , 2004, Breast Cancer Research and Treatment.

[12]  David Tamborero,et al.  OncodriveCLUST: exploiting the positional clustering of somatic mutations to identify cancer genes , 2013, Bioinform..

[13]  Mithat Gönen,et al.  The JAK2/STAT3 signaling pathway is required for growth of CD44⁺CD24⁻ stem cell-like breast cancer cells in human tumors. , 2011, The Journal of clinical investigation.

[14]  Steve Harenberg,et al.  Community detection in large‐scale networks: a survey and empirical evaluation , 2014 .

[15]  E. Ott,et al.  Spectral properties of networks with community structure. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[16]  Sun-Mi Park,et al.  The miR-200 family determines the epithelial phenotype of cancer cells by targeting the E-cadherin repressors ZEB1 and ZEB2. , 2008, Genes & development.

[17]  A. Nobel,et al.  Supervised risk predictor of breast cancer based on intrinsic subtypes. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[18]  A. Vinayagam,et al.  A Directed Protein Interaction Network for Investigating Intracellular Signal Transduction , 2011, Science Signaling.

[19]  Finn Drabløs,et al.  Update of the FANTOM web resource: high resolution transcriptome of diverse cell types in mammals , 2016, Nucleic Acids Res..

[20]  Athanasios Fevgas,et al.  DIANA-TarBase v7.0: indexing more than half a million experimentally supported miRNA:mRNA interactions , 2014, Nucleic Acids Res..

[21]  Jinguang Wang,et al.  Bioinformatics analysis of gene expression profiles to identify causal genes in luminal B2 breast cancer , 2017, Oncology letters.

[22]  M. Cronin,et al.  A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. , 2004, The New England journal of medicine.

[23]  T. Tan,et al.  Epithelial-mesenchymal transition spectrum quantification and its efficacy in deciphering survival and drug responses of cancer patients , 2014, EMBO molecular medicine.

[24]  A. Gonzalez-Perez,et al.  Functional impact bias reveals cancer drivers , 2012, Nucleic acids research.

[25]  R. Walker,et al.  Survivin is an independent predictor of short-term survival in poor prognostic breast cancer patients , 2007, British Journal of Cancer.

[26]  T. Hubbard,et al.  A census of human cancer genes , 2004, Nature Reviews Cancer.

[27]  M. Asaduzzaman,et al.  Tumour suppressor EP300, a modulator of paclitaxel resistance and stemness, is downregulated in metaplastic breast cancer , 2017, Breast Cancer Research and Treatment.

[28]  M. J. van de Vijver,et al.  Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis. , 2006, Journal of the National Cancer Institute.

[29]  L. Saal,et al.  Refinement of breast cancer molecular classification by miRNA expression profiles , 2019, BMC Genomics.

[30]  Illés J. Farkas,et al.  CFinder: locating cliques and overlapping modules in biological networks , 2006, Bioinform..

[31]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[32]  H. Kim,et al.  Prognostic and predictive values of EGFR overexpression and EGFR copy number alteration in HER2-positive breast cancer , 2014, British Journal of Cancer.

[33]  Jiuyong Li,et al.  CBNA: A control theory based method for identifying coding and non-coding cancer drivers , 2019, PLoS Comput. Biol..

[34]  W. Mikulits,et al.  Impact of constitutive IGF1/IGF2 stimulation on the transcriptional program of human breast cancer cells. , 2007, Carcinogenesis.

[35]  Kaitlin G Scheer,et al.  Combinatorial Targeting by MicroRNAs Co-ordinates Post-transcriptional Control of EMT. , 2018, Cell systems.

[36]  E. Mardis,et al.  PAM50 gene signatures and breast cancer prognosis with adjuvant anthracycline- and taxane-based chemotherapy: correlative analysis of C9741 (Alliance) , 2016, npj Breast Cancer.

[37]  O. Wolkenhauer,et al.  E2F1 induces miR‐224/452 expression to drive EMT through TXNIP downregulation , 2014, EMBO reports.

[38]  Zemin Zhang,et al.  GEPIA2: an enhanced web server for large-scale expression profiling and interactive analysis , 2019, Nucleic Acids Res..

[39]  Hsien-Da Huang,et al.  miRTarBase 2016: updates to the experimentally validated miRNA-target interactions database , 2015, Nucleic Acids Res..

[40]  B Marshall,et al.  Gene Ontology Consortium: The Gene Ontology (GO) database and informatics resource , 2004, Nucleic Acids Res..

[41]  D. Bartel,et al.  Predicting effective microRNA target sites in mammalian mRNAs , 2015, eLife.

[42]  Ming Lu,et al.  TransmiR: a transcription factor–microRNA regulation database , 2009, Nucleic Acids Res..

[43]  A. Barabasi,et al.  Controllability analysis of the directed human protein interaction network identifies disease genes and drug targets , 2015, Proceedings of the National Academy of Sciences.

[44]  A. Bashashati,et al.  DriverNet: uncovering the impact of somatic driver mutations on transcriptional networks in cancer , 2012, Genome Biology.

[45]  Xiaowei Wang,et al.  OncomiR: an online resource for exploring pan-cancer microRNA dysregulation , 2018, Bioinform..

[46]  Joshua D. Campbell,et al.  NetSig: network-based discovery from cancer genomes , 2017, Nature Methods.

[47]  Nitesh V. Chawla,et al.  Complex networks as a unified framework for descriptive analysis and predictive modeling in climate science , 2011, Stat. Anal. Data Min..

[48]  Xiulan Zhao,et al.  The suppression of DUSP5 expression correlates with paclitaxel resistance and poor prognosis in basal-like breast cancer , 2018, International journal of medical sciences.

[49]  Dong Chen,et al.  JunD and JunB integrate prostaglandin E2 activation of breast cancer-associated proximal aromatase promoters. , 2011, Molecular endocrinology.

[50]  H. Lane,et al.  Modulation of p27/Cdk2 complex formation through 4D5-mediated inhibition of HER2 receptor signaling. , 2001, Annals of oncology : official journal of the European Society for Medical Oncology.

[51]  Gary D Bader,et al.  Systematic analysis of somatic mutations in phosphorylation signaling predicts novel cancer drivers , 2013 .

[52]  Mingming Jia,et al.  COSMIC: exploring the world's knowledge of somatic mutations in human cancer , 2014, Nucleic Acids Res..