pathTiMEx: Joint Inference of Mutually Exclusive Cancer Pathways and Their Dependencies in Tumor Progression

In recent years, high-throughput sequencing technologies have facilitated the generation of an unprecedented amount of genomic cancer data, opening the way to a more profound understanding of tumorigenesis. In this regard, two fundamental questions have emerged: (1) which alterations drive tumor progression? and (2) what are the evolutionary constraints on the order in which these alterations occur? Answering these questions is crucial for therapeutic decisions involving targeted agents, which are often based on the identification of early genetic events. Mainly because of interpatient heterogeneity, progression at the level of pathways has been shown to be more robust than progression at the level of single genes. Here, we introduce pathTiMEx, a probabilistic generative model of tumor progression at the level of mutually exclusive driver pathways. pathTiMEx employs a stochastic optimization procedure to jointly optimize the assignment of genes to pathways and the evolutionary order constraints among pathways. On cancer data, pathTiMEx recapitulates previous knowledge on tumorigenesis, such as the temporal order among pathways which include APC, KRAS and TP53 in colorectal cancer, while also proposing new biological hypotheses, such as the existence of a single early causal event consisting of the amplification of CDK4 and the deletion of CDKN2A in glioblastoma. The pathTiMEx R package is available at https://github.com/cbg-ethz/pathTiMEx. Supplementary Material for this article is available online.

[1]  Steven A. Roberts,et al.  Mutational heterogeneity in cancer and the search for new cancer-associated genes , 2013 .

[2]  Benjamin J. Raphael,et al.  Integrated Genomic Analyses of Ovarian Carcinoma , 2011, Nature.

[3]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[4]  J. Lagergren,et al.  Learning Oncogenetic Networks by Reducing to Mixed Integer Linear Programming , 2013, PloS one.

[5]  F. Markowetz,et al.  Cancer Evolution: Mathematical Models and Computational Inference , 2014, Systematic biology.

[6]  Camille Roth,et al.  Natural Scales in Geographical Patterns , 2017, Scientific Reports.

[7]  Benjamin J. Raphael,et al.  CoMEt: a statistical approach to identify combinations of mutually exclusive alterations in cancer , 2015, Genome Biology.

[8]  Upender Manne,et al.  Prognostic value of mucin 4 expression in colorectal adenocarcinomas , 2010, Cancer.

[9]  D. Haussler,et al.  The Somatic Genomic Landscape of Glioblastoma , 2013, Cell.

[10]  Jens Lagergren,et al.  New Probabilistic Network Models and Algorithms for Oncogenesis , 2006, J. Comput. Biol..

[11]  Nicholas Eriksson,et al.  Conjunctive Bayesian networks , 2006, math/0608417.

[12]  L. Trusolino,et al.  Oncogene addiction as a foundational rationale for targeted anti-cancer therapy: promises and perils , 2011, EMBO molecular medicine.

[13]  Guanming Wu,et al.  Manic fringe promotes a claudin-low breast cancer phenotype through notch-mediated PIK3CG induction. , 2015, Cancer research.

[14]  I. Weinstein Addiction to Oncogenes--the Achilles Heal of Cancer , 2002, Science.

[15]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumors , 2012, Nature.

[16]  Ramon Diaz-Uriarte,et al.  Identifying restrictions in the order of accumulation of mutations during tumor progression: effects of passengers, evolutionary models, and sampling , 2015, BMC Bioinformatics.

[17]  K. Kinzler,et al.  Cancer Genome Landscapes , 2013, Science.

[18]  M. Levandowsky,et al.  Distance between Sets , 1971, Nature.

[19]  Benjamin E. Gross,et al.  The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. , 2012, Cancer discovery.

[20]  B. Vogelstein,et al.  A genetic model for colorectal tumorigenesis , 1990, Cell.

[21]  Giancarlo Mauri,et al.  CAPRI: Efficient Inference of Cancer Progression Models from Cross-sectional Data , 2014, bioRxiv.

[22]  C. Sander,et al.  Systematic identification of cancer driving signaling pathways based on mutual exclusivity of genomic alterations , 2014 .

[23]  Benjamin J. Raphael,et al.  Simultaneous Inference of Cancer Pathways and Tumor Progression from Cross-Sectional Mutation Data , 2015, J. Comput. Biol..

[24]  Camille Stephan-Otto Attolini,et al.  A mathematical framework to determine the temporal sequence of somatic genetic events in cancer , 2010, Proceedings of the National Academy of Sciences.

[25]  Giancarlo Mauri,et al.  Efficient inference of cancer progression models , 2014 .

[26]  Eytan Ruppin,et al.  Predicting Cancer-Specific Vulnerability via Data-Driven Detection of Synthetic Lethality , 2014, Cell.

[27]  Teresa M. Przytycka,et al.  MEMCover: integrated analysis of mutual exclusivity and functional network reveals dysregulated pathways across multiple cancer types , 2015, Bioinform..

[28]  Eli Upfal,et al.  De Novo Discovery of Mutated Driver Pathways in Cancer , 2011, RECOMB.

[29]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[30]  A. Sparks,et al.  The Genomic Landscapes of Human Breast and Colorectal Cancers , 2007, Science.

[31]  Niko Beerenwinkel,et al.  Modeling Mutual Exclusivity of Cancer Mutations , 2014, RECOMB.

[32]  Niko Beerenwinkel,et al.  Efficient sampling for Bayesian inference of conjunctive Bayesian networks , 2012, Bioinform..

[33]  M. Stratton,et al.  The cancer genome , 2009, Nature.

[34]  Niko Beerenwinkel,et al.  Quantifying cancer progression with conjunctive Bayesian networks , 2009, Bioinform..

[35]  Giancarlo Mauri,et al.  Inferring Tree Causal Models of Cancer Progression with Probability Raising , 2013, bioRxiv.

[36]  Seth Sullivant,et al.  Markov models for accumulating mutations , 2007, 0709.2646.

[37]  E. Fearon Molecular genetics of colorectal cancer. , 2011, Annual review of pathology.

[38]  G. Getz,et al.  GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers , 2011, Genome Biology.

[39]  Roded Sharan,et al.  Simultaneous Identification of Multiple Driver Pathways in Cancer , 2013, PLoS Comput. Biol..

[40]  Steven J. M. Jones,et al.  Comprehensive molecular portraits of human breast tumours , 2013 .

[41]  Feng Jiang,et al.  Inferring Tree Models for Oncogenesis from Comparative Genome Hybridization Data , 1999, J. Comput. Biol..

[42]  Franziska Michor,et al.  A Mathematical Methodology for Determining the Temporal Order of Pathway Alterations Arising during Gliomagenesis , 2012, PLoS Comput. Biol..

[43]  J. York,et al.  Bayesian Graphical Models for Discrete Data , 1995 .

[44]  Benjamin J. Raphael,et al.  Abstract 1936: CoMEt: A statistical approach to identify combinations of mutually exclusive alterations in cancer , 2015 .

[45]  Nicholas Eriksson,et al.  The Temporal Order of Genetic and Pathway Alterations in Tumorigenesis , 2011, PloS one.

[46]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[47]  C. Sander,et al.  Mutual exclusivity analysis identifies oncogenic network modules. , 2012, Genome research.

[48]  Niko Beerenwinkel,et al.  TiMEx: a waiting time model for mutually exclusive cancer alterations , 2015, Bioinform..