EpiChIP: gene-by-gene quantification of epigenetic modification levels

The combination of chromatin immunoprecipitation with next-generation sequencing technology (ChIP-seq) is a powerful and increasingly popular method for mapping protein–DNA interactions in a genome-wide fashion. The conventional way of analyzing this data is to identify sequencing peaks along the chromosomes that are significantly higher than the read background. For histone modifications and other epigenetic marks, it is often preferable to find a characteristic region of enrichment in sequencing reads relative to gene annotations. For instance, many histone modifications are typically enriched around transcription start sites. Calculating the optimal window that describes this enrichment allows one to quantify modification levels for each individual gene. Using data sets for the H3K9/14ac histone modification in Th cells and an accompanying IgG control, we present an analysis strategy that alternates between single gene and global data distribution levels and allows a clear distinction between experimental background and signal. Curve fitting permits false discovery rate-based classification of genes as modified versus unmodified. We have developed a software package called EpiChIP that carries out this type of analysis, including integration with and visualization of gene expression data.

[1]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[2]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[3]  P. Farnham Insights from genomic profiling of transcription factors , 2009, Nature Reviews Genetics.

[4]  Allen D. Delaney,et al.  Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing , 2007, Nature Methods.

[5]  Suresh Cuddapah,et al.  The genomic landscape of histone modifications in human T cells , 2006, Proceedings of the National Academy of Sciences.

[6]  Zbigniew Darzynkiewicz,et al.  Practical flow cytometry (3rd edn): by Howard M. Shapiro, Wiley-Liss 1995. £49.95 (542 pages) ISBN 0 471 303763 , 1995 .

[7]  Nancy F. Hansen,et al.  Accurate Whole Human Genome Sequencing using Reversible Terminator Chemistry , 2008, Nature.

[8]  Dustin E. Schones,et al.  Characterization of human epigenomes. , 2009, Current opinion in genetics & development.

[9]  Hanlee P. Ji,et al.  Next-generation DNA sequencing , 2008, Nature Biotechnology.

[10]  O. C. Blair,et al.  Practical Flow Cytometry , 1985, The Yale Journal of Biology and Medicine.

[11]  J. Ahringer,et al.  Differential chromatin marking of introns and expressed exons by H3K36me3 , 2008, Nature Genetics.

[12]  H. Shapiro Practical Flow Cytometry: Shapiro/Flow Cytometry 4e , 2005 .

[13]  Dustin E. Schones,et al.  High-Resolution Profiling of Histone Methylations in the Human Genome , 2007, Cell.

[14]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[15]  G. Ast,et al.  Chromatin organization marks exon-intron structure , 2009, Nature Structural &Molecular Biology.

[16]  John J. Wyrick,et al.  Genome-wide location and function of DNA binding proteins. , 2000, Science.

[17]  Nathaniel D. Heintzman,et al.  Histone modifications at human enhancers reflect global cell-type-specific gene expression , 2009, Nature.

[18]  Raymond K. Auerbach,et al.  PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls , 2009, Nature Biotechnology.

[19]  Nathaniel D. Heintzman,et al.  Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome , 2007, Nature Genetics.

[20]  C. Zang,et al.  Discrete roles of STAT4 and STAT6 transcription factors in tuning epigenetic modifications and transcription during T helper cell differentiation. , 2010, Immunity.

[21]  M. Treiber,et al.  LEF-1 Negatively Controls Interleukin-4 Expression through a Proximal Promoter Regulatory Element* , 2008, Journal of Biological Chemistry.

[22]  Mark A. Dawson,et al.  The transcriptional program controlled by the stem cell leukemia gene Scl/Tal1 during early embryonic hematopoietic development. , 2009, Blood.

[23]  Dario Strbenac,et al.  Repitools: an R package for the analysis of enrichment-based epigenomic data , 2010, Bioinform..

[24]  Alexander Varshavsky,et al.  Mapping proteinDNA interactions in vivo with formaldehyde: Evidence that histone H4 is retained on a highly transcribed gene , 1988, Cell.

[25]  Jun Song,et al.  CEAS: cis-regulatory element annotation system , 2006, Nucleic Acids Res..

[26]  Michael Q. Zhang,et al.  Combinatorial patterns of histone acetylations and methylations in the human genome , 2008, Nature Genetics.

[27]  A. Mortazavi,et al.  Computation for ChIP-seq and RNA-seq studies , 2009, Nature Methods.

[28]  Jan Komorowski,et al.  Nucleosomes are well positioned in exons and carry characteristic histone modifications. , 2009, Genome research.

[29]  A. Mortazavi,et al.  Genome-Wide Mapping of in Vivo Protein-DNA Interactions , 2007, Science.

[30]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[31]  P. Park ChIP–seq: advantages and challenges of a maturing technology , 2009, Nature Reviews Genetics.