DNAshape: a method for the high-throughput prediction of DNA structural features on a genomic scale

We present a method and web server for predicting DNA structural features in a high-throughput (HT) manner for massive sequence data. This approach provides the framework for the integration of DNA sequence and shape analyses in genome-wide studies. The HT methodology uses a sliding-window approach to mine DNA structural information obtained from Monte Carlo simulations. It requires only nucleotide sequence as input and instantly predicts multiple structural features of DNA (minor groove width, roll, propeller twist and helix twist). The results of rigorous validations of the HT predictions based on DNA structures solved by X-ray crystallography and NMR spectroscopy, hydroxyl radical cleavage data, statistical analysis and cross-validation, and molecular dynamics simulations provide strong confidence in this approach. The DNAshape web server is freely available at http://rohslab.cmb.usc.edu/DNAshape/.

[1]  B. Honig,et al.  Nuance in the double-helix and its role in protein-DNA recognition. , 2009, Current opinion in structural biology.

[2]  Yaniv Lubling,et al.  Distinct Modes of Regulation by Chromatin Encoded through Nucleosome Positioning Signals , 2008, PLoS Comput. Biol..

[3]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules , 1995 .

[4]  E. Trifonov,et al.  The pitch of chromatin DNA is reflected in its nucleotide sequence. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[5]  R. Mann,et al.  Cofactor Binding Evokes Latent Differences in DNA Binding Specificity between Hox Proteins , 2011, Cell.

[6]  R. Mann,et al.  Origins of specificity in protein-DNA recognition. , 2010, Annual review of biochemistry.

[7]  D. Case,et al.  A systematic molecular dynamics study of nearest-neighbor effects on base pair and base pair step conformations and fluctuations in B-DNA , 2009, Nucleic acids research.

[8]  Daniele Varsano,et al.  Optical properties of triplex DNA from time-dependent density functional theory. , 2012, The journal of physical chemistry. B.

[9]  Remo Rohs,et al.  Mechanism of origin DNA recognition and assembly of an initiator-helicase complex by SV40 large tumor antigen. , 2013, Cell reports.

[10]  M. Bulyk,et al.  Genomic regions flanking E-box binding sites influence DNA binding specificity of bHLH transcription factors through DNA shape. , 2013, Cell reports.

[11]  H. Drew,et al.  Sequence periodicities in chicken nucleosome core DNA. , 1986, Journal of molecular biology.

[12]  R. Lavery,et al.  Unraveling proteins: a molecular mechanics study. , 1999, Biophysical journal.

[13]  Stephan C. Schuster,et al.  Nucleosome organization in the Drosophila genome , 2008, Nature.

[14]  Stephen C. J. Parker,et al.  Local DNA Topography Correlates with Functional Noncoding Regions of the Human Genome , 2009, Science.

[15]  V. Zhurkin,et al.  DNA sequence-dependent deformability deduced from protein-DNA crystal complexes. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Remo Rohs,et al.  Electrostatic Interactions between Arginines and the Minor Groove in the Nucleosome , 2010, Journal of biomolecular structure & dynamics.

[17]  F. Javier Luque,et al.  Towards a molecular dynamics consensus view of B-DNA flexibility , 2008, Nucleic acids research.

[18]  R. Mann,et al.  The role of DNA shape in protein-DNA recognition , 2009, Nature.

[19]  Duilio Cascio,et al.  The shape of the DNA minor groove directs binding by the DNA-bending protein Fis. , 2010, Genes & development.

[20]  Stephen C. J. Parker,et al.  A map of minor groove shape and electrostatic potential from hydroxyl radical cleavage patterns of DNA. , 2011, ACS chemical biology.

[21]  Daniel Svozil,et al.  Refinement of the AMBER force field for nucleic acids: improving the description of alpha/gamma conformers. , 2007, Biophysical journal.

[22]  R. Sandstrom,et al.  Probing DNA shape and methylation state on a genomic scale with DNase I , 2013, Proceedings of the National Academy of Sciences.

[23]  Remo Rohs,et al.  Molecular flexibility in ab initio drug docking to DNA: binding-site and binding-mode transitions in all-atom Monte Carlo simulations , 2005, Nucleic acids research.

[24]  R. Lavery,et al.  Defining the structure of irregular nucleic acids: conventions and principles. , 1989, Journal of biomolecular structure & dynamics.

[25]  Satoshi Fujii,et al.  Sequence-dependent DNA deformability studied using molecular dynamics simulations , 2007, Nucleic acids research.

[26]  R. Rohs,et al.  Structural and energetic origins of sequence-specific DNA bending: Monte Carlo simulations of papillomavirus E2-DNA binding sites. , 2005, Structure.

[27]  Remo Rohs,et al.  Using internal and collective variables in Monte Carlo simulations of nucleic acid structures: Chain breakage/closure algorithm and associated Jacobians , 2006, J. Comput. Chem..

[28]  Ad Bax,et al.  Overall structure and sugar dynamics of a DNA dodecamer from homo- and heteronuclear dipolar couplings and 31P chemical shift anisotropy , 2003, Journal of biomolecular NMR.

[29]  Clarisse G. Ricci,et al.  Molecular dynamics of DNA: comparison of force fields and terminal nucleotide definitions. , 2010, The journal of physical chemistry. B.

[30]  J. Šponer,et al.  Refinement of the AMBER Force Field for Nucleic Acids: Improving the Description of α/γ Conformers , 2007 .

[31]  Michael A. Crickmore,et al.  Functional Specificity of a Hox Protein Mediated by the Recognition of Minor Groove Structure , 2007, Cell.