Inference of Cancer Progression With Probabilistic Graphical Model From Cross-Sectional Mutation Data

With the advance of high-throughput sequencing technologies, a great amount of somatic mutation data in cancer have been produced, allowing deep analyzing tumor pathogenesis. However, the majority of these data are cross-sectional rather than temporal, and it is difficulty to infer the temporal order of gene mutations from them. In this paper, we first show a probabilistic graphical model (PGM) to infer the temporal order constrains and selectivity relation among the mutation of cancer driver genes which are presented by a directed acyclic graph. We then apply an exponential function based on the mutation probability of these driver genes to obtain their mutation waiting time which can be used to induce mutually exclusive driver pathways. Finally, we evaluate the performance of the PGM both on simulated data and real-cancer somatic mutation data. The experimental results and comparative analysis reveal that the PGM can capture most of the selectivity relation of mutated driver genes which have been validated by previous works. Furthermore, the PGM can provide new insights on simultaneously inferring driver pathways and the temporal order of their mutations from cross-sectional data.

[1]  K. Boucher,et al.  Estimating an oncogenetic tree when false negatives and positives are present. , 2002, Mathematical biosciences.

[2]  A. Buhmeida,et al.  Clinical significance of frequent somatic mutations detected by high-throughput targeted sequencing in archived colorectal cancer samples , 2016, Journal of Translational Medicine.

[3]  Niko Beerenwinkel,et al.  Quantifying cancer progression with conjunctive Bayesian networks , 2009, Bioinform..

[4]  Martin Vingron,et al.  Inferring the paths of somatic evolution in cancer , 2014, Bioinform..

[5]  Alfred V. Aho,et al.  The Transitive Reduction of a Directed Graph , 1972, SIAM J. Comput..

[6]  J. Lagergren,et al.  Learning Oncogenetic Networks by Reducing to Mixed Integer Linear Programming , 2013, PloS one.

[7]  Steven A. Roberts,et al.  Mutational heterogeneity in cancer and the search for new cancer-associated genes , 2013 .

[8]  Joshua M. Korn,et al.  Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2008, Nature.

[9]  Hao Wu,et al.  Network-Based Method for Inferring Cancer Progression at the Pathway Level from Cross-Sectional Mutation Data , 2016, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[10]  Luke Glynn,et al.  A Probabilistic Analysis of Causation , 2011, The British Journal for the Philosophy of Science.

[11]  Andrey Korshunov,et al.  Frequent ATRX mutations and loss of expression in adult diffuse astrocytic tumors carrying IDH1/IDH2 and TP53 mutations , 2012, Acta Neuropathologica.

[12]  Ruibin Xi,et al.  Inferring the progression of multifocal liver cancer from spatial and temporal genomic heterogeneity , 2015, Oncotarget.

[13]  Thomas Lengauer,et al.  Mtreemix: a software package for learning and using mixture models of mutagenetic trees , 2005, Bioinform..

[14]  Shu-Lin Wang,et al.  GSMC: Combining Parallel Gibbs Sampling with Maximal Cliques for Hunting DNA Motif , 2017, J. Comput. Biol..

[15]  Xudong Dai,et al.  Abstract 368: Inferred regulatory interaction network from prostate cancer reveals potential regulators coordinating progression and metastasis , 2014 .

[16]  Juan Wu,et al.  A new algorithm for essential proteins identification based on the integration of protein complex co-expression information and edge clustering coefficient , 2015, Int. J. Data Min. Bioinform..

[17]  Ariel Linden,et al.  Evaluating Disease Management Program Effectiveness , 2005 .

[18]  Giancarlo Mauri,et al.  Inferring Tree Causal Models of Cancer Progression with Probability Raising , 2013, bioRxiv.

[19]  Franziska Michor,et al.  A Mathematical Methodology for Determining the Temporal Order of Pathway Alterations Arising during Gliomagenesis , 2012, PLoS Comput. Biol..

[20]  R. Lothe,et al.  The order of genetic events associated with colorectal cancer progression inferred from meta‐analysis of copy number changes , 2006, Genes, chromosomes & cancer.

[21]  Nicholas Eriksson,et al.  The Temporal Order of Genetic and Pathway Alterations in Tumorigenesis , 2011, PloS one.

[22]  E. Fearon Molecular genetics of colorectal cancer. , 2011, Annual review of pathology.

[23]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[24]  D. Haussler,et al.  The Somatic Genomic Landscape of Glioblastoma , 2013, Cell.

[25]  Naveen K. Chilamkurti,et al.  Colorectal cancer atlas: An integrative resource for genomic and proteomic annotations from colorectal cancer cell lines and tissues , 2015, Nucleic Acids Res..

[26]  Jiawei Luo,et al.  A path-based measurement for human miRNA functional similarities using miRNA-disease associations , 2016, Scientific Reports.

[27]  Cheng Liang,et al.  Predicting MicroRNA-Disease Associations Using Kronecker Regularized Least Squares Based on Heterogeneous Omics Data , 2017, IEEE Access.

[28]  X. Hua,et al.  DrGaP: a powerful tool for identifying driver genes and pathways in cancer sequencing studies. , 2013, American journal of human genetics.

[29]  Christos G. Cassandras,et al.  Integrating mutation and gene expression cross-sectional data to infer cancer progression , 2016, BMC Systems Biology.

[30]  Li Ding,et al.  Somatic Mutations of PIK3R1 Promote Gliomagenesis , 2012, PloS one.

[31]  P. Suppes A Probabilistic Theory Of Causality , 1970 .

[32]  G. Benvenuto,et al.  Epidermal growth factor receptor expression in neurofibromatosis type 1-related tumors and NF1 animal models. , 2000, The Journal of clinical investigation.

[33]  A. Sparks,et al.  The Genomic Landscapes of Human Breast and Colorectal Cancers , 2007, Science.

[34]  Jack Kuipers,et al.  pathTiMEx: Joint Inference of Mutually Exclusive Cancer Pathways and Their Dependencies in Tumor Progression , 2016, RECOMB.

[35]  Feng Jiang,et al.  Distance-Based Reconstruction of Tree Models for Oncogenesis , 2000, J. Comput. Biol..

[36]  Jack Kuipers,et al.  pathTiMEx: Joint Inference of Mutually Exclusive Cancer Pathways and Their Progression Dynamics , 2017, J. Comput. Biol..

[37]  E. E. Gresch Genetic Alterations During Colorectal-Tumor Development , 1989 .

[38]  M. Nowak,et al.  Dynamics of cancer progression , 2004, Nature Reviews Cancer.

[39]  Benjamin J. Raphael,et al.  Simultaneous Inference of Cancer Pathways and Tumor Progression from Cross-Sectional Mutation Data , 2015, J. Comput. Biol..

[40]  Camille Stephan-Otto Attolini,et al.  A mathematical framework to determine the temporal sequence of somatic genetic events in cancer , 2010, Proceedings of the National Academy of Sciences.

[41]  A. Schäffer,et al.  Graph models of oncogenesis with an application to melanoma. , 2001, Journal of theoretical biology.

[42]  Wei Zhang,et al.  An Integrated Framework for Identifying Mutated Driver Pathway and Cancer Progression , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[43]  B. Vogelstein,et al.  A genetic model for colorectal tumorigenesis , 1990, Cell.

[44]  Giancarlo Mauri,et al.  CAPRI: Efficient Inference of Cancer Progression Models from Cross-sectional Data , 2014, bioRxiv.