Modelling cancer progression using Mutual Hazard Networks

Abstract Motivation Cancer progresses by accumulating genomic events, such as mutations and copy number alterations, whose chronological order is key to understanding the disease but difficult to observe. Instead, cancer progression models use co-occurrence patterns in cross-sectional data to infer epistatic interactions between events and thereby uncover their most likely order of occurrence. State-of-the-art progression models, however, are limited by mathematical tractability and only allow events to interact in directed acyclic graphs, to promote but not inhibit subsequent events, or to be mutually exclusive in distinct groups that cannot overlap. Results Here we propose Mutual Hazard Networks (MHN), a new Machine Learning algorithm to infer cyclic progression models from cross-sectional data. MHN model events by their spontaneous rate of fixation and by multiplicative effects they exert on the rates of successive events. MHN compared favourably to acyclic models in cross-validated model fit on four datasets tested. In application to the glioblastoma dataset from The Cancer Genome Atlas, MHN proposed a novel interaction in line with consecutive biopsies: IDH1 mutations are early events that promote subsequent fixation of TP53 mutations. Availability and implementation Implementation and data are available at https://github.com/RudiSchill/MHN. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  C. Yeang,et al.  Combinatorial patterns of somatic gene mutations in cancer , 2008, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[2]  Benjamin J. Raphael,et al.  CoMEt: a statistical approach to identify combinations of mutually exclusive alterations in cancer , 2015, Genome Biology.

[3]  Jörg Rahnenführer,et al.  Variable selection for disease progression models: methods for oncogenetic trees and application to cancer and HIV , 2017, BMC Bioinformatics.

[4]  H Buerger,et al.  Different genetic pathways in the evolution of invasive breast cancer are associated with distinct morphological subtypes , 1999, The Journal of pathology.

[5]  Jack Kuipers,et al.  pathTiMEx: Joint Inference of Mutually Exclusive Cancer Pathways and Their Progression Dynamics , 2017, J. Comput. Biol..

[6]  F. Markowetz,et al.  Cancer Evolution: Mathematical Models and Computational Inference , 2014, Systematic biology.

[7]  Jens Lagergren,et al.  New Probabilistic Network Models and Algorithms for Oncogenesis , 2006, J. Comput. Biol..

[8]  Nicholas Eriksson,et al.  Conjunctive Bayesian networks , 2006, math/0608417.

[9]  J. Lagergren,et al.  Learning Oncogenetic Networks by Reducing to Mixed Integer Linear Programming , 2013, PloS one.

[10]  Jack Kuipers,et al.  Large-scale inference of conjunctive Bayesian networks , 2016, Bioinform..

[11]  H. Akaike A new look at the statistical model identification , 1974 .

[12]  Niko Beerenwinkel,et al.  Modeling Mutual Exclusivity of Cancer Mutations , 2014, RECOMB.

[13]  J. Brooks,et al.  Mutations of the VHL tumour suppressor gene in renal carcinoma , 1994, Nature Genetics.

[14]  Wayne R. Dyksen,et al.  Efficient vector and parallel manipulation of tensor products , 1996, TOMS.

[15]  W. Rathmell,et al.  VHL gene mutations in renal cell carcinoma: Role as a biomarker of disease outcome and drug efficacy , 2009, Current oncology reports.

[16]  P. Nowell The clonal evolution of tumor cell populations. , 1976, Science.

[17]  Feng Jiang,et al.  Inferring Tree Models for Oncogenesis from Comparative Genome Hybridization Data , 1999, J. Comput. Biol..

[18]  Fabio Vandin Computational Methods for Characterizing Cancer Mutational Heterogeneity , 2017, Front. Genet..

[19]  Christopher A. Miller,et al.  Discovering functional modules by identifying recurrent and mutually exclusive mutational patterns in tumors , 2011, BMC Medical Genomics.

[20]  Jie Chen,et al.  Gene expression in cardiac tissues from infants with idiopathic conotruncal defects , 2011, BMC Medical Genomics.

[21]  Thomas Lengauer,et al.  Learning Multiple Evolutionary Pathways from Cross-Sectional Data , 2005, J. Comput. Biol..

[22]  A. Hanby,et al.  Comparative genomic hybridization of breast tumors stratified by histological grade reveals new insights into the biological progression of breast cancer. , 1999, Cancer research.

[23]  J. Rahnenführer,et al.  Cumulative disease progression models for cross‐sectional data: A review and comparison , 2012, Biometrical journal. Biometrische Zeitschrift.

[24]  Peter Buchholz,et al.  Structured analysis approaches for large Markov chains , 1999 .

[25]  B. Vogelstein,et al.  A genetic model for colorectal tumorigenesis , 1990, Cell.

[26]  Giancarlo Mauri,et al.  CAPRI: Efficient Inference of Cancer Progression Models from Cross-sectional Data , 2014, bioRxiv.

[27]  Brian Schryver,et al.  A homologue of Drosophila aurora kinase is oncogenic and amplified in human colorectal cancers , 1998, The EMBO journal.

[28]  Martin Vingron,et al.  Inferring the paths of somatic evolution in cancer , 2014, Bioinform..

[29]  G. De Micheli,et al.  Computer-Oriented Formulation of Transition-Rate Matrices via Kronecker Algebra , 1981, IEEE Transactions on Reliability.

[30]  A. Schäffer,et al.  The evolution of tumour phylogenetics: principles and practice , 2017, Nature Reviews Genetics.

[31]  D. Cox Regression Models and Life-Tables , 1972 .

[32]  Nicholas Eriksson,et al.  The Temporal Order of Genetic and Pathway Alterations in Tumorigenesis , 2011, PloS one.

[33]  Joshua M. Korn,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.

[34]  Roded Sharan,et al.  Simultaneous Identification of Multiple Driver Pathways in Cancer , 2013, PLoS Comput. Biol..

[35]  P. Kleihues,et al.  IDH1 mutations are early events in the development of astrocytomas and oligodendrogliomas. , 2009, The American journal of pathology.

[36]  Teresa M. Przytycka,et al.  MEMCover: integrated analysis of mutual exclusivity and functional network reveals dysregulated pathways across multiple cancer types , 2015, Bioinform..

[37]  Niko Beerenwinkel,et al.  Quantifying cancer progression with conjunctive Bayesian networks , 2009, Bioinform..

[38]  Michael Baudis,et al.  Progenetix.net: an online repository for molecular cytogenetic aberration data , 2001, Bioinform..

[39]  C. Sander,et al.  Mutual exclusivity analysis identifies oncogenic network modules. , 2012, Genome research.

[40]  Niko Beerenwinkel,et al.  TiMEx: a waiting time model for mutually exclusive cancer alterations , 2015, Bioinform..

[41]  Benjamin E. Gross,et al.  The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. , 2012, Cancer discovery.

[42]  S. Gabriel,et al.  Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. , 2010, Cancer cell.

[43]  Jack Kuipers,et al.  Single-cell sequencing data reveal widespread recurrence and loss of mutational hits in the life histories of tumors , 2017, Genome research.

[44]  Winfried K. Grassmann Transient solutions in markovian queueing systems , 1977, Comput. Oper. Res..

[45]  Benjamin J. Raphael,et al.  Simultaneous Inference of Cancer Pathways and Tumor Progression from Cross-Sectional Mutation Data , 2015, J. Comput. Biol..

[46]  G. Wahl,et al.  MDM2 and MDM4: p53 regulators as targets in anticancer therapy. , 2007, The international journal of biochemistry & cell biology.