Genomic Sequence Is Highly Predictive of Local Nucleosome Depletion

The regulation of DNA accessibility through nucleosome positioning is important for transcription control. Computational models have been developed to predict genome-wide nucleosome positions from DNA sequences, but these models consider only nucleosome sequences, which may have limited their power. We developed a statistical multi-resolution approach to identify a sequence signature, called the N-score, that distinguishes nucleosome binding DNA from non-nucleosome DNA. This new approach has significantly improved the prediction accuracy. The sequence information is highly predictive for local nucleosome enrichment or depletion, whereas predictions of the exact positions are only modestly more accurate than a null model, suggesting the importance of other regulatory factors in fine-tuning the nucleosome positions. The N-score in promoter regions is negatively correlated with gene expression levels. Regulatory elements are enriched in low N-score regions. While our model is derived from yeast data, the N-score pattern computed from this model agrees well with recent high-resolution protein-binding data in human.

[1]  I. Albert,et al.  Translational and rotational settings of H2A.Z nucleosomes across the Saccharomyces cerevisiae genome , 2007, Nature.

[2]  E. Trifonov,et al.  Nucleotide sequence-directed mapping of the nucleosomes. , 1983, Nucleic acids research.

[3]  Steven M. Johnson,et al.  Flexibility and constraint in the nucleosome core landscape of Caenorhabditis elegans chromatin. , 2006, Genome research.

[4]  I. Albert,et al.  Nucleosome positions predicted through comparative genomics , 2006, Nature Genetics.

[5]  Massimo Vergassola,et al.  Computational detection of genomic cis-regulatory modules applied to body patterning in the early Drosophila embryo , 2002, BMC Bioinformatics.

[6]  Jun S. Song,et al.  High-throughput mapping of the chromatin structure of human promoters , 2007, Nature Biotechnology.

[7]  W. Wong,et al.  Computational Biology: Toward Deciphering Gene Regulatory Information in Mammalian Genomes , 2006, Biometrics.

[8]  Saurabh Sinha,et al.  Stubb: a program for discovery and analysis of cis-regulatory modules , 2006, Nucleic Acids Res..

[9]  Michael Q. Zhang,et al.  Analysis of the Vertebrate Insulator Protein CTCF-Binding Sites in the Human Genome , 2007, Cell.

[10]  William Stafford Noble,et al.  Nucleosome positioning signals in genomic DNA. , 2007, Genome research.

[11]  V. Iyer,et al.  Poly(dA:dT), a ubiquitous promoter element that stimulates transcription via its intrinsic DNA structure. , 1995, The EMBO journal.

[12]  R. Kornberg,et al.  Twenty-Five Years of the Nucleosome, Fundamental Particle of the Eukaryote Chromosome , 1999, Cell.

[13]  Dustin E. Schones,et al.  High-Resolution Profiling of Histone Methylations in the Human Genome , 2007, Cell.

[14]  J. Widom,et al.  Role of DNA sequence in nucleosome stability and dynamics , 2001, Quarterly Reviews of Biophysics.

[15]  Lani F. Wu,et al.  Genome-Scale Identification of Nucleosome Positions in S. cerevisiae , 2005, Science.

[16]  Nir Friedman,et al.  Dynamics of Replication-Independent Histone Turnover in Budding Yeast , 2007, Science.

[17]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[18]  J. Workman,et al.  Alteration of nucleosome structure as a mechanism of transcriptional regulation. , 1998, Annual review of biochemistry.

[19]  Megan F. Cole,et al.  Genome-wide Map of Nucleosome Acetylation and Methylation in Yeast , 2005, Cell.

[20]  Anton J. Enright,et al.  Prediction of microRNA targets. , 2007, Drug discovery today.

[21]  S. Henikoff,et al.  Genome-scale profiling of histone H3.3 replacement patterns , 2005, Nature Genetics.

[22]  L. Stryer,et al.  Statistical distributions of nucleosomes: nonrandom locations by a stochastic mechanism. , 1988, Nucleic acids research.

[23]  Nathaniel D. Heintzman,et al.  Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome , 2007, Nature Genetics.

[24]  Nicola J. Rinaldi,et al.  Transcriptional regulatory code of a eukaryotic genome , 2004, Nature.

[25]  Renato Paro,et al.  Genome-wide prediction of Polycomb/Trithorax response elements in Drosophila melanogaster. , 2003, Developmental cell.

[26]  Irene K. Moore,et al.  A genomic code for nucleosome positioning , 2006, Nature.

[27]  Alexander J. Hartemink,et al.  Nucleosome Occupancy Information Improves de novo Motif Discovery , 2007, RECOMB.

[28]  A. Hinnen,et al.  Removal of positioned nucleosomes from the yeast PHO5 promoter upon PHO5 induction releases additional upstream activating DNA elements. , 1986, The EMBO journal.

[29]  S. Schreiber,et al.  Global nucleosome occupancy in yeast , 2004, Genome Biology.

[30]  Kami Ahmad,et al.  Rules and regulation in the primary structure of chromatin. , 2007, Current opinion in cell biology.

[31]  Jun S. Liu,et al.  An algorithm for finding protein–DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments , 2002, Nature Biotechnology.

[32]  S. Mallat VI – Wavelet zoom , 1999 .

[33]  Jun S. Liu,et al.  Integrating regulatory motif discovery and genome-wide expression analysis , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[34]  H. Drew,et al.  Sequence periodicities in chicken nucleosome core DNA. , 1986, Journal of molecular biology.

[35]  J. Widom,et al.  Nucleosome packaging and nucleosome positioning of genomic DNA. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Kevin Struhl,et al.  Intrinsic histone-DNA interactions and low nucleosome density are important for preferential accessibility of promoter regions in yeast. , 2005, Molecular cell.

[37]  B. Pugh,et al.  Identification and Distinct Regulation of Yeast TATA Box-Containing Genes , 2004, Cell.

[38]  T. Richmond,et al.  Crystal structure of the nucleosome core particle at 2.8 Å resolution , 1997, Nature.

[39]  J. Lieb,et al.  Evidence for nucleosome depletion at active regulatory regions genome-wide , 2004, Nature Genetics.

[40]  Kenta Nakai,et al.  BTSS, DataBase of Transcriptional Start Sites: progress report 2004 , 2004, Nucleic Acids Res..

[41]  Michael R. Green,et al.  Dissecting the Regulatory Circuitry of a Eukaryotic Genome , 1998, Cell.

[42]  M. Borodovsky,et al.  Nucleosome DNA sequence pattern revealed by multiple alignment of experimentally mapped sequences. , 1996, Journal of molecular biology.

[43]  S. Mallat A wavelet tour of signal processing , 1998 .

[44]  J. Widom,et al.  Poly(dA-dT) Promoter Elements Increase the Equilibrium Accessibility of Nucleosomal DNA Target Sites , 2001, Molecular and Cellular Biology.

[45]  S. Schreiber,et al.  Histone Variant H2A.Z Marks the 5′ Ends of Both Active and Inactive Genes in Euchromatin , 2006, Cell.