Inferring nucleosome positions with their histone mark annotation from ChIP data

Motivation: The nucleosome is the basic repeating unit of chromatin. It contains two copies each of the four core histones H2A, H2B, H3 and H4 and about 147 bp of DNA. The residues of the histone proteins are subject to numerous post-translational modifications, such as methylation or acetylation. Chromatin immunoprecipitiation followed by sequencing (ChIP-seq) is a technique that provides genome-wide occupancy data of these modified histone proteins, and it requires appropriate computational methods. Results: We present NucHunter, an algorithm that uses the data from ChIP-seq experiments directed against many histone modifications to infer positioned nucleosomes. NucHunter annotates each of these nucleosomes with the intensities of the histone modifications. We demonstrate that these annotations can be used to infer nucleosomal states with distinct correlations to underlying genomic features and chromatin-related processes, such as transcriptional start sites, enhancers, elongation by RNA polymerase II and chromatin-mediated repression. Thus, NucHunter is a versatile tool that can be used to predict positioned nucleosomes from a panel of histone modification ChIP-seq experiments and infer distinct histone modification patterns associated to different chromatin states. Availability: The software is available at http://epigen.molgen.mpg.de/nuchunter/. Contact: chung@molgen.mpg.de Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Andrew D. Smith,et al.  Bioinformatics Applications Note Gene Expression Identifying Dispersed Epigenomic Domains from Chip-seq Data , 2022 .

[2]  Oscar Flores,et al.  nucleR: a package for non-parametric nucleosome positioning , 2011, Bioinform..

[3]  M. Vingron,et al.  Sequence-dependent nucleosome positioning. , 2009, Journal of molecular biology.

[4]  Marc D. Perry,et al.  ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia , 2012, Genome research.

[5]  B. Turner,et al.  The adjustable nucleosome: an epigenetic signaling module. , 2012, Trends in genetics : TIG.

[6]  Kristin R Brogaard,et al.  A base pair resolution map of nucleosome positions in yeast , 2012, Nature.

[7]  I. Albert,et al.  Translational and rotational settings of H2A.Z nucleosomes across the Saccharomyces cerevisiae genome , 2007, Nature.

[8]  T. Richmond,et al.  DNA binding within the nucleosome core. , 1998, Current opinion in structural biology.

[9]  A. Mortazavi,et al.  Genome-Wide Mapping of in Vivo Protein-DNA Interactions , 2007, Science.

[10]  T. Mikkelsen,et al.  The NIH Roadmap Epigenomics Mapping Consortium , 2010, Nature Biotechnology.

[11]  Nir Friedman,et al.  High-resolution nucleosome mapping reveals transcription-dependent promoter packaging. , 2010, Genome research.

[12]  Gordon Robertson,et al.  Probabilistic Inference for Nucleosome Positioning with MNase-Based or Sonicated Short-Read Data , 2012, PloS one.

[13]  Chen Zeng,et al.  A clustering approach for identification of enriched domains from histone modification ChIP-Seq data , 2009, Bioinform..

[14]  Ben Lehner,et al.  Human genes with CpG island promoters have a distinct transcription-associated chromatin organization , 2012, Genome Biology.

[15]  Jun S. Song,et al.  Identifying Positioned Nucleosomes with Epigenetic Marks in Human from ChIP-Seq , 2008, BMC Genomics.

[16]  Manolis Kellis,et al.  Discovery and characterization of chromatin states for systematic annotation of the human genome , 2010, Nature Biotechnology.

[17]  Naama Barkai,et al.  Expression noise and acetylation profiles distinguish HDAC functions. , 2012, Molecular cell.

[18]  S. Rice Mathematical analysis of random noise , 1944 .

[19]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[20]  Dave Hale,et al.  Recursive Gaussian filters , 2006 .

[21]  D. Reinberg,et al.  The Polycomb complex PRC2 and its mark in life , 2011, Nature.

[22]  Raymond K. Auerbach,et al.  A User's Guide to the Encyclopedia of DNA Elements (ENCODE) , 2011, PLoS biology.