Hierarchical Bayesian Models for ChIP-seq Data

Histone modifications (HMs) are post-translational modifications of the nucleosome. Studying the presence or absence of these modifications in genomic regions is a central topic in modern epigenetics. HMs regulate various biological processes by overwriting the DNA-inscribed code. Experimental evidence suggests that they perform this task through a complex biological network. In other words, HMs combinatorially influence gene expression. We present two model-based approaches to decode this mechanism using ChIP-seq data. Both approaches are based on hierarchical Bayesian models. The first model derives a conditional independence structure among the HMs through a graphical model. The challenge here is to model the unobserved binary (presence/absence) status of HMs on the basis of read counts. The other critical aspect is to model the dependence between these latent binaries in a way that allows tractable posterior inference. The second model relates HMs and functional genomics through a local bi-clustering approach. Here HMs are clustered and each HM cluster gives rise to a (nested) partition of genomic loactions, with respect to that subset of HMs. These models are, to the best of our knowledge, the first model-based fully Bayesian approaches to discovering epigenetic associations. Validation with known experimental findings suggests the importance and usefulness of these approaches in our understanding of gene regulation.

[1]  Yuan Ji,et al.  Toward Breaking the Histone Code: Bayesian Graphical Models for Histone Modifications , 2013, Circulation. Cardiovascular genetics.

[2]  Manolis Kellis,et al.  Discovery and characterization of chromatin states for systematic annotation of the human genome , 2010, Nature Biotechnology.

[3]  Philipp Bucher,et al.  ChIP-Seq Data Reveal Nucleosome Architecture of Human Promoters , 2007, Cell.

[4]  Purushottam W. Laud,et al.  A Bayesian subgroup analysis with a zero‐enriched Polya Urn scheme , 2011, Statistics in medicine.

[5]  Andrew J. Bannister,et al.  Regulation of chromatin by histone modifications , 2011, Cell Research.

[6]  Peter Müller,et al.  A Nonparametric Bayesian Model for Local Clustering With Application to Proteomics , 2013, Journal of the American Statistical Association.

[7]  G. Parmigiani,et al.  A statistical framework for expression‐based molecular classification in cancer , 2002 .

[8]  A. Peters,et al.  Mechanisms of transcriptional repression by histone lysine methylation. , 2009, The International journal of developmental biology.

[9]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..

[10]  Mi Zhou,et al.  CTCFBSDB: a CTCF-binding site database for characterization of vertebrate genomic insulators , 2007, Nucleic Acids Res..

[11]  E. Lander,et al.  The Mammalian Epigenome , 2007, Cell.

[12]  Mark S. Kaiser,et al.  Autologistic models with interpretable parameters , 2009 .

[13]  Riten Mitra,et al.  Nonparametric Bayesian Bi-Clustering for Next Generation Sequencing Count Data. , 2013, Bayesian analysis.

[14]  Yuan Ji,et al.  A Bayesian Graphical Model for ChIP-Seq Data on Histone Modifications , 2013 .

[15]  J. Wong,et al.  Relationship between Histone H3 Lysine 9 Methylation, Transcription Repression, and Heterochromatin Protein 1 Recruitment , 2005, Molecular and Cellular Biology.

[16]  Nathaniel D. Heintzman,et al.  Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome , 2007, Nature Genetics.

[17]  Michael Q. Zhang,et al.  High-resolution human core-promoter prediction with CoreBoost_HM. , 2009, Genome research.

[18]  Dustin E. Schones,et al.  High-Resolution Profiling of Histone Methylations in the Human Genome , 2007, Cell.

[19]  Alan P. Wolffe,et al.  A positive role for histone acetylation in transcription factor access to nucleosomal DNA , 1993, Cell.

[20]  K. Struhl Histone acetylation and transcriptional regulatory mechanisms. , 1998, Genes & development.

[21]  K. Sneppen,et al.  Theoretical Analysis of Epigenetic Cell Memory by Nucleosome Modification , 2007, Cell.

[22]  J. Besag Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .

[23]  D. B. Dahl Bayesian Inference for Gene Expression and Proteomics: Model-Based Clustering for Expression Data via a Dirichlet Process Mixture Model , 2006 .

[24]  Andrew J. Bannister,et al.  Selective recognition of methylated lysine 9 on histone H3 by the HP1 chromo domain , 2001, Nature.

[25]  R. Tjian,et al.  Structure and function of a human TAFII250 double bromodomain module. , 2000, Science.

[26]  Karl Mechtler,et al.  Methylation of histone H3 lysine 9 creates a binding site for HP1 proteins , 2001, Nature.

[27]  Yuri B Schwartz,et al.  Polycomb complexes and epigenetic states. , 2008, Current opinion in cell biology.

[28]  Vincenzo Pirrotta,et al.  Polycomb silencing mechanisms and the management of genomic programmes , 2007, Nature Reviews Genetics.

[29]  Michael D. Wilson,et al.  ChIP-seq: using high-throughput sequencing to discover protein-DNA interactions. , 2009, Methods.

[30]  Mikael Sigvardsson,et al.  Epigenetic chromatin states uniquely define the developmental plasticity of murine hematopoietic stem cells. , 2010, Blood.

[31]  M. Grunstein Histone acetylation in chromatin structure and transcription , 1997, Nature.

[32]  Michael Q. Zhang,et al.  Combinatorial patterns of histone acetylations and methylations in the human genome , 2008, Nature Genetics.

[33]  S. Berger The complex language of chromatin regulation during transcription , 2007, Nature.