Prediction of Driver Modules via Balancing Exclusive Coverages of Mutations in Cancer Samples

Abstract Mutual exclusivity of cancer driving mutations is a frequently observed phenomenon in the mutational landscape of cancer. The long tail of rare mutations complicates the discovery of mutually exclusive driver modules. The existing methods usually suffer from the problem that only few genes in some identified modules cover most of the cancer samples. To overcome this hurdle, an efficient method UniCovEx is presented via identifying mutually exclusive driver modules of balanced exclusive coverages. UniCovEx first searches for candidate driver modules with a strong topological relationship in signaling networks using a greedy strategy. It then evaluates the candidate modules by considering their coverage, exclusivity, and balance of coverage, using a novel metric termed exclusive entropy of modules, which measures how balanced the modules are. Finally, UniCovEx predicts sample‐specific driver modules by solving a minimum set cover problem using a greedy strategy. When tested on 12 The Cancer Genome Atlas datasets of different cancer types, UniCovEx shows a significant superiority over the previous methods. The software is available at: https://sourceforge.net/projects/cancer‐pathway/files/.

[1]  Adam A. Margolin,et al.  The Cancer Cell Line Encyclopedia enables predictive modeling of anticancer drug sensitivity , 2012, Nature.

[2]  Benjamin J. Raphael,et al.  Identifying driver mutations in sequenced cancer genomes: computational approaches to enable precision medicine , 2014, Genome Medicine.

[3]  Benjamin J. Raphael,et al.  A weighted exact test for mutually exclusive mutations in cancer , 2016, Bioinform..

[4]  Yoram Cohen,et al.  Early Occurrence of RASSF1A Hypermethylation and Its Mutual Exclusion with BRAF Mutation in Thyroid Tumorigenesis , 2004, Cancer Research.

[5]  Ratna Chakrabarti,et al.  MicroRNA expressions associated with progression of prostate cancer cells to antiandrogen therapy resistance , 2014, Molecular Cancer.

[6]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[7]  D. El-Lebedy,et al.  Complement factor H polymorphism rs1061170 and the effect of cigarette smoking on the risk of lung cancer , 2015, Contemporary oncology.

[8]  Benjamin J. Raphael,et al.  Advances for studying clonal evolution in cancer. , 2013, Cancer letters.

[9]  Brian H. Dunford-Shore,et al.  Somatic mutations affect key pathways in lung adenocarcinoma , 2008, Nature.

[10]  Shi-Hua Zhang,et al.  Efficient methods for identifying mutated driver pathways in cancer , 2012, Bioinform..

[11]  Shihua Zhang,et al.  Discovery of cancer common and specific driver gene sets , 2016, Nucleic acids research.

[12]  Wei Wang,et al.  High prevalence and mutual exclusivity of genetic alterations in the phosphatidylinositol-3-kinase/akt pathway in thyroid tumors. , 2007, The Journal of clinical endocrinology and metabolism.

[13]  Benjamin J. Raphael,et al.  Pan-Cancer Network Analysis Identifies Combinations of Rare Somatic Mutations across Pathways and Protein Complexes , 2014, Nature Genetics.

[14]  Teresa M. Przytycka,et al.  MEMCover: integrated analysis of mutual exclusivity and functional network reveals dysregulated pathways across multiple cancer types , 2015, Bioinform..

[15]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[16]  C. Yeang,et al.  Combinatorial patterns of somatic gene mutations in cancer , 2008, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[17]  Benjamin J. Raphael,et al.  CoMEt: a statistical approach to identify combinations of mutually exclusive alterations in cancer , 2015, Genome Biology.

[18]  K. Becker,et al.  The Genetic Association Database , 2004, Nature Genetics.

[19]  N. Nowak,et al.  Translocation (4;11)(p12;q23) with rearrangement of FRYL and MLL in therapy-related acute myeloid leukemia. , 2007, Cancer genetics and cytogenetics.

[20]  C. Sander,et al.  Systematic identification of cancer driving signaling pathways based on mutual exclusivity of genomic alterations , 2014, Genome Biology.

[21]  Shi-Hua Zhang,et al.  Discovery of co-occurring driver pathways in cancer , 2014, BMC Bioinformatics.

[22]  Hong-Qiang Wang,et al.  Simulated Annealing Based Algorithm for Identifying Mutated Driver Pathways in Cancer , 2014, BioMed research international.

[23]  K. Kinzler,et al.  Cancer Genome Landscapes , 2013, Science.

[24]  Christopher A. Miller,et al.  Discovering functional modules by identifying recurrent and mutually exclusive mutational patterns in tumors , 2011, BMC Medical Genomics.

[25]  Eli Upfal,et al.  Algorithms for Detecting Significantly Mutated Pathways in Cancer , 2010, RECOMB.

[26]  N. Schork,et al.  Identification of rare cancer driver mutations by network reconstruction. , 2009, Genome research.

[27]  Zhuo Yu,et al.  ANGPTL2/LILRB2 signaling promotes the propagation of lung cancer cells , 2015, OncoTarget.

[28]  D. Birnbaum,et al.  Mutual exclusion of ASXL1 and NPM1 mutations in a series of acute myeloid leukemias , 2010, Leukemia.

[29]  A. Bashashati,et al.  DriverNet: uncovering the impact of somatic driver mutations on transcriptional networks in cancer , 2012, Genome Biology.

[30]  Guojun Li,et al.  Identification of driver modules in pan-cancer via coordinating coverage and exclusivity , 2017, Oncotarget.

[31]  M. Stratton,et al.  The cancer genome , 2009, Nature.

[32]  Lisa J Zimmerman,et al.  Identification of Proteomic Features To Distinguish Benign Pulmonary Nodules from Lung Adenocarcinoma. , 2017, Journal of proteome research.

[33]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[34]  Gary D Bader,et al.  International network of cancer genome projects , 2010, Nature.

[35]  E. Birney,et al.  Patterns of somatic mutation in human cancer genomes , 2007, Nature.

[36]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[37]  Roded Sharan,et al.  Simultaneous Identification of Multiple Driver Pathways in Cancer , 2013, PLoS Comput. Biol..

[38]  Benjamin J. Raphael,et al.  De novo discovery of mutated driver pathways in cancer , 2011 .

[39]  W. Hahn,et al.  Modelling the molecular circuitry of cancer , 2002, Nature Reviews Cancer.

[40]  Joshua M. Korn,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.

[41]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[42]  Junhua Zhang,et al.  The Discovery of Mutated Driver Pathways in Cancer: Models and Algorithms , 2016, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[43]  C. Sander,et al.  Mutual exclusivity analysis identifies oncogenic network modules. , 2012, Genome research.

[44]  Francesca D. Ciccarelli,et al.  NCG 5.0: updates of a manually curated repository of cancer genes and associated properties from cancer mutational screenings , 2015, Nucleic Acids Res..

[45]  K. Kinzler,et al.  Cancer genes and the pathways they control , 2004, Nature Medicine.

[46]  Teresa M. Przytycka,et al.  WeSME: uncovering mutual exclusivity of cancer drivers and beyond , 2016, Bioinform..