Prioritizing Cancer Genes Based on an Improved Random Walk Method

Identifying driver genes that contribute to cancer progression from numerous passenger genes, although a central goal, is a major challenge. The protein–protein interaction network provides convenient and reasonable assistance for driver gene discovery. Random walk–based methods have been widely used to prioritize nodes in social or biological networks. However, most studies select the next arriving node uniformly from the random walker's neighbors. Few consider transiting preference according to the degree of random walker's neighbors. In this study, based on the random walk method, we propose a novel approach named Driver_IRW (Driver genes discovery with Improved Random Walk method), to prioritize cancer genes in cancer-related network. The key idea of Driver_IRW is to assign different transition probabilities for different edges of a constructed cancer-related network in accordance with the degree of the nodes' neighbors. Furthermore, the global centrality (here is betweenness centrality) and Katz feedback centrality are incorporated into the framework to evaluate the probability to walk to the seed nodes. Experimental results on four cancer types indicate that Driver_IRW performs more efficiently than some previously published methods for uncovering known cancer-related genes. In conclusion, our method can aid in prioritizing cancer-related genes and complement traditional frequency and network-based methods.

[1]  Roland C. Grafström,et al.  Fibronectin 1 is a potential biomarker for radioresistance in head and neck squamous cell carcinoma , 2010, Cancer biology & therapy.

[2]  Jun Wang,et al.  MICA: A fast short-read aligner that takes full advantage of Many Integrated Core Architecture (MIC) , 2014, BMC Bioinformatics.

[3]  D. Karolchik,et al.  The UCSC Genome Browser database: 2016 update , 2015, bioRxiv.

[4]  M. Tangney,et al.  Modulation of p21‐activated kinase 1 alters the behavior of renal cell carcinoma , 2007, International journal of cancer.

[5]  Lei Zhang,et al.  Discovering personalized driver mutation profiles of single samples in cancer by network control strategy , 2018, Bioinform..

[6]  Steven A. Roberts,et al.  Mutational heterogeneity in cancer and the search for new cancer genes , 2014 .

[7]  Shi-Hua Zhang,et al.  Discovery of co-occurring driver pathways in cancer , 2014, BMC Bioinformatics.

[8]  H. Kocher,et al.  Pancreatic Cancer , 2019, Methods in Molecular Biology.

[9]  Xing Chen,et al.  IRWRLDA: improved random walk with restart for lncRNA-disease association prediction , 2016, Oncotarget.

[10]  Santhilata Kuppili Venkata,et al.  The Network of Cancer Genes (NCG): a comprehensive catalogue of known and candidate cancer genes from cancer sequencing screens , 2018, Genome Biology.

[11]  Shihua Zhang,et al.  Discovery of cancer common and specific driver gene sets , 2016, Nucleic acids research.

[12]  W. Birchmeier,et al.  Wnt signalling and its impact on development and cancer , 2008, Nature Reviews Cancer.

[13]  Tao Zeng,et al.  A novel network control model for identifying personalized driver genes in cancer , 2019, PLoS Comput. Biol..

[14]  P. Robinson,et al.  Walking the interactome for prioritization of candidate disease genes. , 2008, American journal of human genetics.

[15]  Feng Wang,et al.  A random walk-based method to identify driver genes by integrating the subcellular localization and variation frequency into bipartite graph , 2019, BMC Bioinformatics.

[16]  Yuanyuan Li,et al.  Detecting Essential Proteins Based on Network Topology, Gene Expression Data, and Gene Ontology Information , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[17]  Eli Upfal,et al.  De Novo Discovery of Mutated Driver Pathways in Cancer , 2011, RECOMB.

[18]  Hao Wu,et al.  Identifying overlapping mutated driver pathways by constructing gene networks in cancer , 2015, BMC Bioinformatics.

[19]  Michael P. Schroeder,et al.  IntOGen-mutations identifies cancer drivers across tumor types , 2013, Nature Methods.

[20]  Zerrin Isik,et al.  Integration of RNA-Seq and RPPA data for survival time prediction in cancer patients , 2017, Comput. Biol. Medicine.

[21]  Huanming Yang,et al.  Frequent mutations of chromatin remodeling genes in transitional cell carcinoma of the bladder , 2011, Nature Genetics.

[22]  Yuquan Wei,et al.  Ivermectin Induces Cytostatic Autophagy by Blocking the PAK1/Akt Axis in Breast Cancer. , 2016, Cancer research.

[23]  Niko Beerenwinkel,et al.  Network-based integration of multi-omics data for prioritizing cancer genes , 2018, Bioinform..

[24]  J. P. Hou,et al.  DawnRank: discovering personalized driver genes in cancer , 2014, Genome Medicine.

[25]  Matthew B. Callaway,et al.  MuSiC: Identifying mutational significance in cancer genomes , 2012, Genome research.

[26]  Benjamin J. Raphael,et al.  Identifying driver mutations in sequenced cancer genomes: computational approaches to enable precision medicine , 2014, Genome Medicine.

[27]  M. Stratton,et al.  The cancer genome , 2009, Nature.

[28]  Kern Rei Chng,et al.  Patient-specific driver gene prediction and risk assessment through integrated network analysis of cancer omics profiles , 2015, Nucleic acids research.

[29]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..

[30]  Y. Moreau,et al.  Computational tools for prioritizing candidate genes: boosting disease gene discovery , 2012, Nature Reviews Genetics.

[31]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[32]  A. Bashashati,et al.  DriverNet: uncovering the impact of somatic driver mutations on transcriptional networks in cancer , 2012, Genome Biology.

[33]  C. Cole,et al.  The COSMIC Cancer Gene Census: describing genetic dysfunction across all human cancers , 2018, Nature Reviews Cancer.

[34]  E. Birney,et al.  Patterns of somatic mutation in human cancer genomes , 2007, Nature.

[35]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[36]  K. Kinzler,et al.  Cancer Genome Landscapes , 2013, Science.

[37]  D. Haber,et al.  Cancer: Drivers and passengers , 2007, Nature.

[38]  Gary D Bader,et al.  International network of cancer genome projects , 2010, Nature.

[39]  Joshua M. Korn,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.

[40]  G. Parmigiani,et al.  Core Signaling Pathways in Human Pancreatic Cancers Revealed by Global Genomic Analyses , 2008, Science.

[41]  H. Boushey,et al.  Asthmatics with exacerbation during acute respiratory illness exhibit unique transcriptional signatures within the nasal mucosa , 2014, Genome Medicine.

[42]  E. Koonin,et al.  Cancer-mutation network and the number and specificity of driver mutations , 2018, Proceedings of the National Academy of Sciences.

[43]  Peng Zhang,et al.  DPRank centrality: Finding important vertices based on random walks with a new defined transition matrix , 2017, Future Gener. Comput. Syst..

[44]  Junhua Zhang,et al.  The Discovery of Mutated Driver Pathways in Cancer: Models and Algorithms , 2016, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[45]  C. Sander,et al.  Mutual exclusivity analysis identifies oncogenic network modules. , 2012, Genome research.

[46]  Ruth Nussinov,et al.  Review: Precision medicine and driver mutations: Computational methods, functional assays and conformational principles for interpreting cancer drivers , 2019, PLoS Comput. Biol..

[47]  E. Marcotte,et al.  Prioritizing candidate disease genes by network-based boosting of genome-wide association data. , 2011, Genome research.

[48]  A. Børresen-Dale,et al.  TP53 mutations in human cancers: functional selection and impact on cancer prognosis and outcomes , 2007, Oncogene.

[49]  Lincoln Stein,et al.  Reactome: a database of reactions, pathways and biological processes , 2010, Nucleic Acids Res..

[50]  Núria Queralt-Rosinach,et al.  DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants , 2016, Nucleic Acids Res..

[51]  Hyunju Lee,et al.  DEOD: uncovering dominant effects of cancer-driver genes based on a partial covariance selection method , 2015, Bioinform..

[52]  Fei-Fei Liu,et al.  HPV Associated Head and Neck Cancer , 2016, Cancers.

[53]  Kenneth H. Buetow,et al.  PID: the Pathway Interaction Database , 2008, Nucleic Acids Res..

[54]  Anupama E. Gururaj,et al.  p21-activated kinases in cancer , 2006, Nature Reviews Cancer.

[55]  J. Minna,et al.  Genetic alteration of the β-catenin gene (CTNNB1) in human lung cancer and malignant mesothelioma and identification of a new 3p21.3 homozygous deletion , 2001, Oncogene.

[56]  F. Supek,et al.  MUFFINN: cancer gene discovery via network analysis of somatic mutation data , 2016, Genome Biology.

[57]  Luca Becchetti,et al.  Biological Random Walks: Integrating heterogeneous data in disease gene prioritization , 2019, 2019 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB).

[58]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..

[59]  R. Dahiya,et al.  MicroRNA-1826 directly targets beta-catenin (CTNNB1) and MEK1 (MAP2K1) in VHL-inactivated renal cancer. , 2012, Carcinogenesis.