GenHITS: A network science approach to driver gene detection in human regulatory network using gene's influence evaluation

Cancer is among the diseases causing death, in which, cells uncontrollably grow and reproduce beyond the cell regulatory mechanism. In this disease, some genes are initiators of abnormalities and then transmit them to other genes through protein interactions. Accordingly, these genes are known as cancer driver genes (CDGs). In this regard, several methods have been previously developed for identifying cancer driver genes. Most of these methods are computational-based, which use the concept of mutation to predict CDGs. In this research, a method has been proposed for identifying CDGs in the transcription regulatory network using the concept of influence diffusion and by modifying the Hyperlink-Induced Topic Search algorithm based on the diffusion concept. Due to the type of these networks and the processes of abnormality progression in cells and the formation of cancerous tumors, high-influence genes can be the most likely considered as the driver genes. Therefore, we can use the influence diffusion concept as an acceptable theory to identify these genes. Recently, a method has been proposed to detect CDGs with the concept of the influence maximization. One of the challenges in these types of networks is finding the power of regulatory interaction between genes. Moreover, we have proposed a novel method to calculate the weight of regulatory interactions, based on the concept of diffusion. The performance of the proposed method was compared with other seventeen computational and network tools. Correspondingly, three cancer types were used as benchmarks as follows: breast invasive carcinoma (BRCA), Colon adenocarcinoma (COAD), and lung squamous cell carcinoma (LUSC). In addition, to determine the accuracy of the detected drivers using each method, CGC (Cancer Gene Census) and Mut-driver gene lists were utilized as gold standard. The results show that GenHITS performs better compared to the most of the other computational and network methods. Besides, it is also able to identify genes that have been identified by none of the other methods yet.

[1]  Benjamin M. Bolstad,et al.  affy - analysis of Affymetrix GeneChip data at the probe level , 2004, Bioinform..

[2]  M. Elhefnawi,et al.  Co-regulatory Network of Oncosuppressor miRNAs and Transcription Factors for Pathology of Human Hepatic Cancer Stem Cells (HCSC) , 2019, Scientific Reports.

[3]  Mario Cataldi,et al.  Emerging topic detection on Twitter based on temporal and social terms evaluation , 2010, MDMKDD '10.

[4]  Steven A. Roberts,et al.  Mutational heterogeneity in cancer and the search for new cancer genes , 2014 .

[5]  K. Tomczak,et al.  The Cancer Genome Atlas (TCGA): an immeasurable source of knowledge , 2015, Contemporary oncology.

[6]  Joshua M. Korn,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.

[7]  Z. Yakhini,et al.  Identifying In-Trans Process Associated Genes in Breast Cancer by Integrated Analysis of Copy Number and Expression Data , 2013, PloS one.

[8]  A. Bashashati,et al.  DriverNet: uncovering the impact of somatic driver mutations on transcriptional networks in cancer , 2012, Genome Biology.

[9]  K. Kinzler,et al.  Cancer Genome Landscapes , 2013, Science.

[10]  Gary D. Bader,et al.  The mutational landscape of phosphorylation signaling in cancer , 2013, Scientific Reports.

[11]  C. Sander,et al.  Mutual exclusivity analysis identifies oncogenic network modules. , 2012, Genome research.

[12]  David Tamborero,et al.  OncodriveCLUST: exploiting the positional clustering of somatic mutations to identify cancer genes , 2013, Bioinform..

[13]  Richard Simon,et al.  Identifying cancer driver genes in tumor genome sequencing studies , 2011, Bioinform..

[15]  Hao Sun,et al.  mTFkb: a knowledgebase for fundamental annotation of mouse transcription factors , 2017, Scientific Reports.

[16]  J. P. Hou,et al.  DawnRank: discovering personalized driver genes in cancer , 2014, Genome Medicine.

[17]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[18]  C. Sander,et al.  Automated Network Analysis Identifies Core Pathways in Glioblastoma , 2010, PloS one.

[19]  A. Gonzalez-Perez,et al.  Functional impact bias reveals cancer drivers , 2012, Nucleic acids research.

[20]  Lev Muchnik,et al.  Identifying influential spreaders in complex networks , 2010, 1001.5285.

[21]  Shi-Hua Zhang,et al.  Efficient methods for identifying mutated driver pathways in cancer , 2012, Bioinform..

[22]  Benjamin J. Raphael,et al.  De novo discovery of mutated driver pathways in cancer , 2011 .

[23]  T. Hubbard,et al.  A census of human cancer genes , 2004, Nature Reviews Cancer.

[24]  Wei-Chung Cheng,et al.  DriverDBv2: a database for human cancer driver gene research , 2015, Nucleic Acids Res..

[25]  Shi-Hua Zhang,et al.  Discovery of co-occurring driver pathways in cancer , 2014, BMC Bioinformatics.

[26]  O. Wolkenhauer,et al.  MicroRNA and Transcription Factor Gene Regulatory Network Analysis Reveals Key Regulatory Elements Associated with Prostate Cancer Progression , 2016, PloS one.

[27]  Hyojin Kim,et al.  TRRUST v2: an expanded reference database of human and mouse transcriptional regulatory interactions , 2017, Nucleic Acids Res..

[28]  David Haussler,et al.  Discovering causal pathways linking genomic events to transcriptional states using Tied Diffusion Through Interacting Events (TieDIE) , 2013, Bioinform..

[29]  Babak Teimourpour,et al.  Cancer driver gene discovery in transcriptional regulatory networks using influence maximization approach , 2019, Comput. Biol. Medicine.

[30]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[31]  Xia Yang,et al.  Mergeomics: a web server for identifying pathological pathways, networks, and key regulators via multidimensional data integration , 2016, BMC Genomics.

[32]  Juan M. Vaquerizas,et al.  A census of human transcription factors: function, expression and evolution , 2009, Nature Reviews Genetics.

[33]  Christos Faloutsos,et al.  Understanding and Managing Cascades on Large Graphs , 2012, Proc. VLDB Endow..

[34]  Adam Godzik,et al.  e-Driver: a novel method to identify protein regions driving cancer , 2014, Bioinform..

[35]  Vasileios Karyotis,et al.  Diffusion Models for Information Dissemination Dynamics in Wireless Complex Communication Networks , 2013 .