Computational methods for cancer driver discovery: A survey

Identifying the genes responsible for driving cancer is of critical importance for directing treatment. Accordingly, multiple computational tools have been developed to facilitate this task. Due to the different methods employed by these tools, different data considered by the tools, and the rapidly evolving nature of the field, the selection of an appropriate tool for cancer driver discovery is not straightforward. This survey seeks to provide a comprehensive review of the different computational methods for discovering cancer drivers. We categorise the methods into three groups; methods for single driver identification, methods for driver module identification, and methods for identifying personalised cancer drivers. In addition to providing a “one-stop” reference of these methods, by evaluating and comparing their performance, we also provide readers the information about the different capabilities of the methods in identifying biologically significant cancer drivers. The biologically relevant information identified by these tools can be seen through the enrichment of discovered cancer drivers in GO biological processes and KEGG pathways and through our identification of a small cancer-driver cohort that is capable of stratifying patient survival.

[1]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[2]  angesichts der Corona-Pandemie,et al.  UPDATE , 1973, The Lancet.

[3]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[4]  Philippe Dessen,et al.  Atlas of Genetics and Cytogenetics in Oncology and Haematology, an Interactive Database , 2000, Nucleic Acids Res..

[5]  T. Hubbard,et al.  A census of human cancer genes , 2004, Nature Reviews Cancer.

[6]  Marina Pollán,et al.  Positive correlation between the expression of X‐chromosome RBM genes (RBMX, RBM3, RBM10) and the proapoptotic Bax gene in human breast cancer , 2006, Journal of cellular biochemistry.

[7]  T. Hampton,et al.  The Cancer Genome Atlas , 2020, Indian Journal of Medical and Paediatric Oncology.

[8]  A. Mele,et al.  Ago HITS-CLIP decodes miRNA-mRNA interaction maps , 2009, Nature.

[9]  Scott B. Dewell,et al.  Transcriptome-wide Identification of RNA-Binding Protein and MicroRNA Target Sites by PAR-CLIP , 2010, Cell.

[10]  W. Linehan,et al.  The genetic basis of kidney cancer: a metabolic disease , 2010, Nature Reviews Urology.

[11]  Mingming Jia,et al.  COSMIC (the Catalogue of Somatic Mutations in Cancer): a resource to investigate acquired mutations in human cancer , 2009, Nucleic Acids Res..

[12]  D. Pe’er,et al.  An Integrated Approach to Uncover Drivers of Cancer , 2010, Cell.

[13]  Ming Lu,et al.  TransmiR: a transcription factor–microRNA regulation database , 2009, Nucleic Acids Res..

[14]  C. Sander,et al.  Automated Network Analysis Identifies Core Pathways in Glioblastoma , 2010, PloS one.

[15]  A. Vinayagam,et al.  A Directed Protein Interaction Network for Investigating Intracellular Signal Transduction , 2011, Science Signaling.

[16]  Ling Lin,et al.  PathScan: a tool for discerning mutational significance in groups of putative cancer genes , 2011, Bioinform..

[17]  Syed Haider,et al.  International Cancer Genome Consortium Data Portal—a one-stop shop for cancer genomics data , 2011, Database J. Biol. Databases Curation.

[18]  Mingming Jia,et al.  COSMIC: mining complete cancer genomes in the Catalogue of Somatic Mutations in Cancer , 2010, Nucleic Acids Res..

[19]  A. Gonzalez-Perez,et al.  Functional impact bias reveals cancer drivers , 2012, Nucleic acids research.

[20]  A. Børresen-Dale,et al.  The landscape of cancer genes and mutational processes in breast cancer , 2012, Nature.

[21]  C. Sander,et al.  Mutual exclusivity analysis identifies oncogenic network modules. , 2012, Genome research.

[22]  Adam A. Margolin,et al.  The Cancer Cell Line Encyclopedia enables predictive modeling of anticancer drug sensitivity , 2012, Nature.

[23]  Doron Betel,et al.  Genome-wide identification of miRNA targets by PAR-CLIP. , 2012, Methods.

[24]  A. Bashashati,et al.  DriverNet: uncovering the impact of somatic driver mutations on transcriptional networks in cancer , 2012, Genome Biology.

[25]  Ö. Türeci,et al.  NCOA3 is a selective co-activator of estrogen receptor α-mediated transactivation of PLAC1 in MCF-7 breast cancer cells , 2013, BMC Cancer.

[26]  Michael P. Schroeder,et al.  IntOGen-mutations identifies cancer drivers across tumor types , 2013, Nature Methods.

[27]  Joshua F. McMichael,et al.  DGIdb - Mining the druggable genome , 2013, Nature Methods.

[28]  D. Wheeler,et al.  Identification of a pan-cancer oncogenic microRNA superfamily anchored by a central core seed motif , 2013, Nature Communications.

[29]  David Haussler,et al.  Discovering causal pathways linking genomic events to transcriptional states using Tied Diffusion Through Interacting Events (TieDIE) , 2013, Bioinform..

[30]  David S. Goodsell,et al.  The RCSB Protein Data Bank: new resources for research and education , 2012, Nucleic Acids Res..

[31]  Benjamin E. Gross,et al.  Integrative Analysis of Complex Cancer Genomics and Clinical Profiles Using the cBioPortal , 2013, Science Signaling.

[32]  Steven A. Roberts,et al.  Mutational heterogeneity in cancer and the search for new cancer-associated genes , 2013 .

[33]  David Tamborero,et al.  OncodriveCLUST: exploiting the positional clustering of somatic mutations to identify cancer genes , 2013, Bioinform..

[34]  Gary D Bader,et al.  Systematic analysis of somatic mutations in phosphorylation signaling predicts novel cancer drivers , 2013 .

[35]  Shi-Hua Zhang,et al.  Identification of mutated core cancer modules by integrating somatic mutation, copy number variation, and gene expression data , 2013, BMC Systems Biology.

[36]  C. Sander,et al.  Genome-wide analysis of non-coding regulatory mutations in cancer , 2014, Nature Genetics.

[37]  Samy Lamouille,et al.  Molecular mechanisms of epithelial–mesenchymal transition , 2014, Nature Reviews Molecular Cell Biology.

[38]  J. P. Hou,et al.  DawnRank: discovering personalized driver genes in cancer , 2014, Genome Medicine.

[39]  Zhuowen Tu,et al.  Similarity network fusion for aggregating data types on a genomic scale , 2014, Nature Methods.

[40]  R. C. Poulos,et al.  The search for cis-regulatory driver mutations in cancer genomes , 2015, Oncotarget.

[41]  A. Valencia,et al.  Non-coding recurrent mutations in chronic lymphocytic leukaemia , 2015, Nature.

[42]  CoMEt: A Statistical Approach to Identify Combinations of Mutually Exclusive Alterations in Cancer , 2015, RECOMB.

[43]  A. Chella,et al.  microRNA classifiers are powerful diagnostic/prognostic tools in ALK-, EGFR-, and KRAS-driven lung cancers , 2015, Proceedings of the National Academy of Sciences.

[44]  Zhongming Zhao,et al.  Systematic Prioritization of Druggable Mutations in ∼5000 Genomes Across 16 Cancer Types Using a Structural Genomics-based Approach* , 2015, Molecular & Cellular Proteomics.

[45]  Niko Beerenwinkel,et al.  Identification of Constrained Cancer Driver Genes Based on Mutation Timing , 2015, PLoS Comput. Biol..

[46]  Mingming Jia,et al.  COSMIC: exploring the world's knowledge of somatic mutations in human cancer , 2014, Nucleic Acids Res..

[47]  Adam Godzik,et al.  Cancer3D: understanding cancer mutations through protein structures , 2014, Nucleic Acids Res..

[48]  Hsien-Da Huang,et al.  miRTarBase 2016: updates to the experimentally validated miRNA-target interactions database , 2015, Nucleic Acids Res..

[49]  A. Gonzalez-Perez,et al.  OncodriveFML: a general framework to identify coding and non-coding regions with cancer driver mutations , 2016, Genome Biology.

[50]  Ananya Gupta,et al.  NCOA3 coactivator is a transcriptional target of XBP1 and regulates PERK–eIF2α–ATF4 signalling in breast cancer , 2016, Oncogene.

[51]  K. Kinzler,et al.  Evaluating the evaluation of cancer driver genes , 2016, Proceedings of the National Academy of Sciences.

[52]  A. Barabasi,et al.  Controllability analysis of the directed human protein interaction network identifies disease genes and drug targets , 2015, Proceedings of the National Academy of Sciences.

[53]  Francesca D. Ciccarelli,et al.  NCG 5.0: updates of a manually curated repository of cancer genes and associated properties from cancer mutational screenings , 2015, Nucleic Acids Res..

[54]  Li Ding,et al.  Protein-structure-guided discovery of functional mutations across 19 cancer types , 2016, Nature Genetics.

[55]  Allison P. Heath,et al.  Toward a Shared Vision for Cancer Genomic Data. , 2016, The New England journal of medicine.

[56]  Andrew D. Rouillard,et al.  Enrichr: a comprehensive gene set enrichment analysis web server 2016 update , 2016, Nucleic Acids Res..

[57]  Nicola D. Roberts,et al.  Genomic Classification and Prognosis in Acute Myeloid Leukemia. , 2016, The New England journal of medicine.

[58]  C. Sander,et al.  3D clusters of somatic mutations in cancer reveal numerous rare mutations as functional targets , 2017, Genome Medicine.

[59]  C. Croce,et al.  miR-579-3p controls melanoma progression and resistance to target therapy , 2016, Proceedings of the National Academy of Sciences.

[60]  Jiuyong Li,et al.  CancerSubtypes: an R/Bioconductor package for molecular cancer subtype identification, validation and visualization , 2017, Bioinform..

[61]  M. Rubin,et al.  Non-coding genetic variation in cancer. , 2017, Current opinion in systems biology.

[62]  Emanuela M. Ghia,et al.  MicroRNA dysregulation to identify therapeutic target combinations for chronic lymphocytic leukemia , 2017, Proceedings of the National Academy of Sciences.

[63]  Abstract 4494: Tumor suppressive role of ZFP36L1 by suppressing HIF1α and Cyclin D1 in bladder and breast cancer , 2017 .

[64]  Teresa M. Przytycka,et al.  WeSME: uncovering mutual exclusivity of cancer drivers and beyond , 2016, Bioinform..

[65]  Niko Beerenwinkel,et al.  Computational approaches for the identification of cancer genes and pathways , 2016, Wiley interdisciplinary reviews. Systems biology and medicine.

[66]  Kaitlin G Scheer,et al.  Post-transcriptional control of EMT is coordinated through combinatorial targeting by multiple microRNAs , 2017, bioRxiv.

[67]  Liming Xu,et al.  Enhancement of Proliferation and Invasion of Gastric Cancer Cell by KDM5C Via Decrease in p53 Expression , 2016, Technology in cancer research & treatment.

[68]  Fabio Vandin Computational Methods for Characterizing Cancer Mutational Heterogeneity , 2017, Front. Genet..

[69]  Nicholas A. Sinnott-Armstrong,et al.  Pan-cancer screen for mutations in non-coding elements with conservation and cancer specificity reveals correlations with expression and survival , 2017, npj Genomic Medicine.

[70]  Lei Zhang,et al.  Discovering personalized driver mutation profiles of single samples in cancer by network control strategy , 2018, Bioinform..

[71]  Ruth Nussinov,et al.  Precision medicine review: rare driver mutations and their biophysical classification , 2019, Biophysical Reviews.

[72]  Xiaowei Wang,et al.  OncomiR: an online resource for exploring pan-cancer microRNA dysregulation , 2018, Bioinform..

[73]  Junpeng Zhang,et al.  Identifying miRNA-mRNA regulatory relationships in breast cancer with invariant causal prediction , 2018, BMC Bioinformatics.

[74]  Tao Zeng,et al.  A novel network control model for identifying personalized driver genes in cancer , 2019, PLoS Comput. Biol..

[75]  Jiuyong Li,et al.  CBNA: A control theory based method for identifying coding and non-coding cancer drivers , 2019, PLoS Comput. Biol..

[76]  X. Hua,et al.  DriverML: a machine learning algorithm for identifying driver genes in cancer sequencing studies , 2019, Nucleic acids research.

[77]  Ruth Nussinov,et al.  Review: Precision medicine and driver mutations: Computational methods, functional assays and conformational principles for interpreting cancer drivers , 2019, PLoS Comput. Biol..

[78]  R. Nussinov,et al.  Why Are Some Driver Mutations Rare? , 2019, Trends in pharmacological sciences.

[79]  Jiuyong Li,et al.  Identifying miRNA synergism using multiple-intervention causal inference , 2019, BMC Bioinformatics.

[80]  Shaoyong Lu,et al.  AlloDriver: a method for the identification and analysis of cancer driver targets , 2019, Nucleic Acids Res..

[81]  Ruth Nussinov,et al.  Autoinhibition can identify rare driver mutations and advise pharmacology , 2020, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[82]  Vu V H Pham,et al.  DriverGroup: A novel method for identifying driver gene groups , 2020, bioRxiv.

[83]  J. Downing,et al.  CICERO: a versatile method for detecting complex and diverse driver fusions using cancer RNA sequencing data , 2020, Genome Biology.

[84]  Vu V H Pham,et al.  pDriver : A novel method for unravelling personalised coding and miRNA cancer drivers , 2020, bioRxiv.