An Integrated Framework for Identifying Mutated Driver Pathway and Cancer Progression

Next-generation sequencing (NGS) technologies provide amount of somatic mutation data in a large number of patients. The identification of mutated driver pathway and cancer progression from these data is a challenging task because of the heterogeneity of interpatient. In addition, cancer progression at the pathway level has been proved to be more reasonable than at the gene level. In this paper, we introduce an integrated framework to identify mutated driver pathways and cancer progression (iMDPCP) at the pathway level from somatic mutation data. First, we use uncertainty coefficient to quantify mutual exclusivity on gene driver pathways and develop a computational framework to identify mutated driver pathways based on the adaptive discrete differential evolution algorithm. Then, we construct cancer progression model for driver pathways based on the Bayesian Network. Finally, we evaluate the performance of iMDPCP on real cancer somatic mutation datasets. The experimental results indicate that iMDPCP is more accurate than state-of-the-art methods according to the enrichment of KEGG pathways, and it also provides new insights on identifying cancer progression at the pathway level.

[1]  K. Plate,et al.  p53 Mutations versus EGF Receptor Expression in Giant Cell Glioblastomas , 1997, Journal of neuropathology and experimental neurology.

[2]  Kamran Ayub,et al.  NSAIDs Modulate CDKN2A, TP53, and DNA Content Risk for Progression to Esophageal Adenocarcinoma , 2007, PLoS medicine.

[3]  A. Nalla,et al.  A novel approach to identify driver genes involved in androgen-independent prostate cancer , 2014, Molecular Cancer.

[4]  D. Eisenberg,et al.  Use of Logic Relationships to Decipher Protein Network Organization , 2004, Science.

[5]  C. Sander,et al.  Mutual exclusivity analysis identifies oncogenic network modules. , 2012, Genome research.

[6]  Benjamin J. Raphael,et al.  Pan-Cancer Network Analysis Identifies Combinations of Rare Somatic Mutations across Pathways and Protein Complexes , 2014, Nature Genetics.

[7]  A. Ashworth,et al.  Genetic Interactions in Cancer Progression and Treatment , 2011, Cell.

[8]  Teresa M. Przytycka,et al.  MEMCover: integrated analysis of mutual exclusivity and functional network reveals dysregulated pathways across multiple cancer types , 2015, Bioinform..

[9]  Thomas Lengauer,et al.  Mtreemix: a software package for learning and using mixture models of mutagenetic trees , 2005, Bioinform..

[10]  Ruibin Xi,et al.  Inferring the progression of multifocal liver cancer from spatial and temporal genomic heterogeneity , 2015, Oncotarget.

[11]  R. Lothe,et al.  The order of genetic events associated with colorectal cancer progression inferred from meta‐analysis of copy number changes , 2006, Genes, chromosomes & cancer.

[12]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[13]  Chitta Baral,et al.  Joint learning of logic relationships for studying protein function using phylogenetic profiles and the rosetta stone method , 2006, IEEE Transactions on Signal Processing.

[14]  Jack Kuipers,et al.  pathTiMEx: Joint Inference of Mutually Exclusive Cancer Pathways and Their Progression Dynamics , 2017, J. Comput. Biol..

[15]  Eli Upfal,et al.  De Novo Discovery of Mutated Driver Pathways in Cancer , 2011, RECOMB.

[16]  Christopher A. Miller,et al.  Discovering functional modules by identifying recurrent and mutually exclusive mutational patterns in tumors , 2011, BMC Medical Genomics.

[17]  Shi-Hua Zhang,et al.  Discovery of co-occurring driver pathways in cancer , 2014, BMC Bioinformatics.

[18]  Eli Upfal,et al.  Algorithms for Detecting Significantly Mutated Pathways in Cancer , 2010, RECOMB.

[19]  Y. Yonekawa,et al.  Overexpression of the EGF receptor and p53 mutations are mutually exclusive in the evolution of primary and secondary glioblastomas. , 1996, Brain pathology.

[20]  Junfeng Xia,et al.  Identification of mutated driver pathways in cancer using a multi-objective optimization model , 2016, Comput. Biol. Medicine.

[21]  Benjamin J. Raphael,et al.  CoMEt: a statistical approach to identify combinations of mutually exclusive alterations in cancer , 2015, Genome Biology.

[22]  Franziska Michor,et al.  A Mathematical Methodology for Determining the Temporal Order of Pathway Alterations Arising during Gliomagenesis , 2012, PLoS Comput. Biol..

[23]  E. E. Gresch Genetic Alterations During Colorectal-Tumor Development , 1989 .

[24]  Jingxuan Yang,et al.  Deregulated Signaling Pathways in Glioblastoma Multiforme: Molecular Mechanisms and Therapeutic Targets , 2012, Cancer investigation.

[25]  H. Greulich,et al.  The genomics of lung adenocarcinoma: opportunities for targeted therapies. , 2010, Genes & cancer.

[26]  Niko Beerenwinkel,et al.  Identification of Constrained Cancer Driver Genes Based on Mutation Timing , 2015, PLoS Comput. Biol..

[27]  Giancarlo Mauri,et al.  Inferring Tree Causal Models of Cancer Progression with Probability Raising , 2013, bioRxiv.

[28]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumours , 2013 .

[29]  S. Vacher,et al.  Differential Distribution of erbB Receptors in Human Glioblastoma Multiforme: Expression of erbB3 in CD133-Positive Putative Cancer Stem Cells , 2010, Journal of neuropathology and experimental neurology.

[30]  J. P. Hou,et al.  DawnRank: discovering personalized driver genes in cancer , 2014, Genome Medicine.

[31]  Roded Sharan,et al.  Simultaneous Identification of Multiple Driver Pathways in Cancer , 2013, PLoS Comput. Biol..

[32]  E Premkumar Reddy,et al.  Understanding the temporal sequence of genetic events that lead to prostate cancer progression and metastasis , 2013, Proceedings of the National Academy of Sciences.

[33]  Lin Gao,et al.  Detection of driver pathways using mutated gene network in cancer. , 2016, Molecular bioSystems.

[34]  Benjamin J. Raphael,et al.  Simultaneous Inference of Cancer Pathways and Tumor Progression from Cross-Sectional Mutation Data , 2015, J. Comput. Biol..

[35]  Hao Wu,et al.  Network-Based Method for Inferring Cancer Progression at the Pathway Level from Cross-Sectional Mutation Data , 2016, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[36]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[37]  D. Louis,et al.  The retinoblastoma gene is involved in malignant progression of astrocytomas , 1994, Annals of neurology.

[38]  Shi-Hua Zhang,et al.  Efficient methods for identifying mutated driver pathways in cancer , 2012, Bioinform..

[39]  E. Sprinzak,et al.  Utilizing logical relationships in genomic data to decipher cellular processes , 2005, The FEBS journal.

[40]  Niko Beerenwinkel,et al.  Efficient sampling for Bayesian inference of conjunctive Bayesian networks , 2012, Bioinform..

[41]  X. Hua,et al.  DrGaP: a powerful tool for identifying driver genes and pathways in cancer sequencing studies. , 2013, American journal of human genetics.

[42]  J. Uhm Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2009 .

[43]  J. Mora,et al.  Exome and deep sequencing of clinically aggressive neuroblastoma reveal somatic mutations that affect key pathways involved in cancer progression , 2016, Oncotarget.

[44]  Hao Wu,et al.  Identifying overlapping mutated driver pathways by constructing gene networks in cancer , 2015, BMC Bioinformatics.

[45]  Alberto Orfao,et al.  Amplified and Homozygously Deleted Genes in Glioblastoma: Impact on Gene Expression Levels , 2012, PloS one.

[46]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[47]  Gary D Bader,et al.  Comprehensive identification of mutational cancer driver genes across 12 tumor types , 2013, Scientific Reports.

[48]  Giancarlo Mauri,et al.  CAPRI: Efficient Inference of Cancer Progression Models from Cross-sectional Data , 2014, bioRxiv.

[49]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[50]  Shihua Zhang,et al.  Discovery of cancer common and specific driver gene sets , 2016, Nucleic acids research.

[51]  A. Schäffer,et al.  Graph models of oncogenesis with an application to melanoma. , 2001, Journal of theoretical biology.

[52]  G. Parmigiani,et al.  Core Signaling Pathways in Human Pancreatic Cancers Revealed by Global Genomic Analyses , 2008, Science.

[53]  Brian H. Dunford-Shore,et al.  Somatic mutations affect key pathways in lung adenocarcinoma , 2008, Nature.

[54]  Jianxin Shi,et al.  MEGSA: A powerful and flexible framework for analyzing mutual exclusivity of tumor mutations , 2015, bioRxiv.

[55]  Niko Beerenwinkel,et al.  Quantifying cancer progression with conjunctive Bayesian networks , 2009, Bioinform..

[56]  Nicholas Eriksson,et al.  The Temporal Order of Genetic and Pathway Alterations in Tumorigenesis , 2011, PloS one.

[57]  Junhua Zhang,et al.  The Discovery of Mutated Driver Pathways in Cancer: Models and Algorithms , 2016, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[58]  C. Sander,et al.  Automated Network Analysis Identifies Core Pathways in Glioblastoma , 2010, PloS one.