Quantification of tumour evolution and heterogeneity via Bayesian epiallele detection

BackgroundEpigenetic heterogeneity within a tumour can play an important role in tumour evolution and the emergence of resistance to treatment. It is increasingly recognised that the study of DNA methylation (DNAm) patterns along the genome – so-called ‘epialleles’ – offers greater insight into epigenetic dynamics than conventional analyses which examine DNAm marks individually.ResultsWe have developed a Bayesian model to infer which epialleles are present in multiple regions of the same tumour. We apply our method to reduced representation bisulfite sequencing (RRBS) data from multiple regions of one lung cancer tumour and a matched normal sample. The model borrows information from all tumour regions to leverage greater statistical power. The total number of epialleles, the epiallele DNAm patterns, and a noise hyperparameter are all automatically inferred from the data. Uncertainty as to which epiallele an observed sequencing read originated from is explicitly incorporated by marginalising over the appropriate posterior densities. The degree to which tumour samples are contaminated with normal tissue can be estimated and corrected for. By tracing the distribution of epialleles throughout the tumour we can infer the phylogenetic history of the tumour, identify epialleles that differ between normal and cancer tissue, and define a measure of global epigenetic disorder.ConclusionsDetection and comparison of epialleles within multiple tumour regions enables phylogenetic analyses, identification of differentially expressed epialleles, and provides a measure of epigenetic heterogeneity. R code is available at github.com/james-e-barrett.

[1]  Sheng Li,et al.  Dynamic evolution of clonal epialleles revealed by methclone , 2014, Genome Biology.

[2]  A. Bird,et al.  DNA methylation landscapes: provocative insights from epigenomics , 2008, Nature Reviews Genetics.

[3]  Seongkeon Lee,et al.  New approaches to identify cancer heterogeneity in DNA methylation studies using the lepage test and multinomial logistic regression , 2015, 2015 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB).

[4]  Matthew D Dean,et al.  Genomic landscape of human allele-specific DNA methylation , 2012, Proceedings of the National Academy of Sciences.

[5]  G. Sun,et al.  Global pattern for the effect of climate and land cover on water yield , 2015, Nature Communications.

[6]  Hehuang Xie,et al.  DMEAS: DNA methylation entropy analysis software , 2013, Bioinform..

[7]  Zohar Mukamel,et al.  Epigenetic polymorphism and the stochastic formation of differentially methylated regions in normal and cancerous tissues , 2012, Nature Genetics.

[8]  Kelly Arndt,et al.  Erratum: Genome-wide quantitative assessment of variation in DNA methylation patterns (Nucleic Acids Research (2011) 39 (4099-4108) DOI: 10.1093/nar/gkr017) , 2013 .

[9]  Olivier Elemento,et al.  Epigenomic evolution in diffuse large B-cell lymphomas , 2013, Nature Communications.

[10]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[11]  Michael J. Ziller,et al.  Locally disordered methylation forms the basis of intratumor methylome variation in chronic lymphocytic leukemia. , 2014, Cancer cell.

[12]  Jun Wang,et al.  MICA: A fast short-read aligner that takes full advantage of Many Integrated Core Architecture (MIC) , 2014, BMC Bioinformatics.

[13]  E. Richards Inherited epigenetic variation — revisiting soft inheritance , 2006, Nature Reviews Genetics.

[14]  Joseph R. Ecker,et al.  Detection of allele-specific methylation through a generalized heterogeneous epigenome model , 2012, Bioinform..

[15]  Hailong Zhu,et al.  Predicting protein functions using incomplete hierarchical labels , 2015, BMC Bioinformatics.

[16]  Jun S. Song,et al.  Intratumoral Heterogeneity of the Epigenome. , 2016, Cancer cell.

[17]  Amos Tanay,et al.  Intratumor DNA methylation heterogeneity reflects clonal evolution in aggressive prostate cancer. , 2014, Cell reports.

[18]  M. Soares,et al.  Genome-wide quantitative assessment of variation in DNA methylation patterns , 2011, Nucleic acids research.

[19]  Han Xu,et al.  MethylPurify: tumor purity deconvolution and differential methylation detection from single tumor DNA methylomes , 2014, Genome Biology.

[20]  Neil Priestley,et al.  SHEFFIELD TEACHING HOSPITALS NHS FOUNDATION TRUST , 2012 .

[21]  Zachary D. Smith,et al.  Preparation of reduced representation bisulfite sequencing libraries for genome-scale DNA methylation profiling , 2011, Nature Protocols.

[22]  Nicolai J. Birkbak,et al.  Tracking the Evolution of Non‐Small‐Cell Lung Cancer , 2017, The New England journal of medicine.

[23]  Nathan C. Sheffield,et al.  DNA methylation heterogeneity defines a disease spectrum in Ewing sarcoma , 2017, Nature Medicine.

[24]  Korbinian Strimmer,et al.  APE: Analyses of Phylogenetics and Evolution in R language , 2004, Bioinform..

[25]  Xiaowei Wu,et al.  Nonparametric Bayesian clustering to detect bipolar methylated genomic loci , 2015, BMC Bioinformatics.

[26]  Peijie Lin,et al.  Estimation of the methylation pattern distribution from deep sequencing data , 2015, BMC Bioinformatics.

[27]  Nathan C. Sheffield,et al.  Epigenetik erklärt Vielfalt von Krebs bei Kindern , 2017, TumorDiagnostik & Therapie.

[28]  Daniel Nilsson,et al.  An international effort towards developing standards for best practices in analysis, interpretation and reporting of clinical genome sequencing results in the CLARITY Challenge , 2014, Genome Biology.