G+C content dominates intrinsic nucleosome occupancy

BackgroundThe relative preference of nucleosomes to form on individual DNA sequences plays a major role in genome packaging. A wide variety of DNA sequence features are believed to influence nucleosome formation, including periodic dinucleotide signals, poly-A stretches and other short motifs, and sequence properties that influence DNA structure, including base content. It was recently shown by Kaplan et al. that a probabilistic model using composition of all 5-mers within a nucleosome-sized tiling window accurately predicts intrinsic nucleosome occupancy across an entire genome in vitro. However, the model is complicated, and it is not clear which specific DNA sequence properties are most important for intrinsic nucleosome-forming preferences.ResultsWe find that a simple linear combination of only 14 simple DNA sequence attributes (G+C content, two transformations of dinucleotide composition, and the frequency of eleven 4-bp sequences) explains nucleosome occupancy in vitro and in vivo in a manner comparable to the Kaplan model. G+C content and frequency of AAAA are the most important features. G+C content is dominant, alone explaining ~50% of the variation in nucleosome occupancy in vitro.ConclusionsOur findings provide a dramatically simplified means to predict and understand intrinsic nucleosome occupancy. G+C content may dominate because it both reduces frequency of poly-A-like stretches and correlates with many other DNA structural characteristics. Since G+C content is enriched or depleted at many types of features in diverse eukaryotic genomes, our results suggest that variation in nucleotide composition may have a widespread and direct influence on chromatin structure.

[1]  Andrew V. Colasanti,et al.  A novel roll-and-slide mechanism of DNA folding in chromatin: implications for nucleosome positioning. , 2007, Journal of molecular biology.

[2]  Kevin Struhl,et al.  Intrinsic histone-DNA interactions and low nucleosome density are important for preferential accessibility of promoter regions in yeast. , 2005, Molecular cell.

[3]  G. Ast,et al.  Chromatin organization marks exon-intron structure , 2009, Nature Structural &Molecular Biology.

[4]  G. Christian Overton,et al.  Conformational and physicochemical DNA features specific for transcription factor binding sites , 1999, Bioinform..

[5]  M. Borodovsky,et al.  Nucleosome DNA sequence pattern revealed by multiple alignment of experimentally mapped sequences. , 1996, Journal of molecular biology.

[6]  Yves Moreau,et al.  Comprehensive analysis of the base composition around the transcription start site in Metazoa , 2004, BMC Genomics.

[7]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[8]  Alain Verreault,et al.  Chromatin Challenges during DNA Replication and Repair , 2007, Cell.

[9]  Bing Li,et al.  The Role of Chromatin during Transcription , 2007, Cell.

[10]  Vincent Miele,et al.  DNA physical properties determine nucleosome occupancy from yeast to fly , 2008, Nucleic acids research.

[11]  Irene K. Moore,et al.  A genomic code for nucleosome positioning , 2006, Nature.

[12]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[13]  G Bernardi,et al.  An analysis of eukaryotic genomes by density gradient centrifugation. , 1976, Journal of molecular biology.

[14]  M. Frommer,et al.  CpG islands in vertebrate genomes. , 1987, Journal of molecular biology.

[15]  Phoebe A. Rice,et al.  Protein-Nucleic Acid Interactions , 2008 .

[16]  T. Richmond,et al.  Crystal structure of the nucleosome core particle at 2.8 Å resolution , 1997, Nature.

[17]  Guo-Cheng Yuan,et al.  Genomic Sequence Is Highly Predictive of Local Nucleosome Depletion , 2007, PLoS Comput. Biol..

[18]  Ronald W. Davis,et al.  A high-resolution atlas of nucleosome occupancy in yeast , 2007, Nature Genetics.

[19]  Irene K. Moore,et al.  The DNA-encoded nucleosome organization of a eukaryotic genome , 2009, Nature.

[20]  Peter J. Park,et al.  nuScore: a web-interface for nucleosome positioning predictions , 2008, Bioinform..

[21]  Jun S. Song,et al.  High-throughput mapping of the chromatin structure of human promoters , 2007, Nature Biotechnology.

[22]  Juliane C. Dohm,et al.  Substantial biases in ultra-short read data sets from high-throughput DNA sequencing , 2008, Nucleic acids research.

[23]  I. Brukner,et al.  Sequence‐dependent bending propensity of DNA as revealed by DNase I: parameters for trinucleotides. , 1995, The EMBO journal.

[24]  I. Albert,et al.  Nucleosome positions predicted through comparative genomics , 2006, Nature Genetics.

[25]  H. Widlund,et al.  TGGA repeats impair nucleosome formation. , 1998, Journal of molecular biology.

[26]  B. Suter,et al.  Poly(dA.dT) sequences exist as rigid DNA structures in nucleosome-free yeast promoters in vivo. , 2000, Nucleic acids research.

[27]  J. Widom,et al.  Nucleosomal locations of dominant DNA sequence motifs for histone-DNA interactions and nucleosome positioning. , 2004, Journal of molecular biology.

[28]  Lani F. Wu,et al.  Genome-Scale Identification of Nucleosome Positions in S. cerevisiae , 2005, Science.

[29]  A V Sivolob,et al.  Translational positioning of nucleosomes on DNA: the role of sequence-dependent isotropic DNA bending stiffness. , 1995, Journal of molecular biology.

[30]  H R Drew,et al.  Principles of sequence-dependent flexure of DNA. , 1986, Journal of molecular biology.

[31]  R. Gellibolian,et al.  Long CCG triplet repeat blocks exclude nucleosomes: a possible mechanism for the nature of fragile sites in chromosomes. , 1996, Journal of molecular biology.

[32]  C. C. Correll,et al.  Protein-nucleic acid interactions : structural biology , 2008 .

[33]  S. Schreiber,et al.  Global nucleosome occupancy in yeast , 2004, Genome Biology.

[34]  H. Drew,et al.  Sequence periodicities in chicken nucleosome core DNA. , 1986, Journal of molecular biology.

[35]  William Stafford Noble,et al.  Nucleosome positioning signals in genomic DNA. , 2007, Genome research.

[36]  E. Segal,et al.  Poly(da:dt) Tracts: Major Determinants of Nucleosome Organization This Review Comes from a Themed Issue on Protein-nucleic Acid Interactions Edited , 2022 .

[37]  D. Crothers,et al.  Structural origins of adenine-tract bending , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[38]  H R Drew,et al.  DNA bending and its relation to nucleosome positioning. , 1985, Journal of molecular biology.

[39]  R. Wells,et al.  Preferential nucleosome assembly at DNA triplet repeats from the myotonic dystrophy gene. , 1994, Science.

[40]  Steven M. Johnson,et al.  A high-resolution, nucleosome position map of C. elegans reveals a lack of universal sequence-dictated positioning. , 2008, Genome research.

[41]  Yaniv Lubling,et al.  Distinct Modes of Regulation by Chromatin Encoded through Nucleosome Positioning Signals , 2008, PLoS Comput. Biol..

[42]  J. Lieb,et al.  Evidence for nucleosome depletion at active regulatory regions genome-wide , 2004, Nature Genetics.