Protein−DNA binding in the absence of specific base-pair recognition

Significance Understanding molecular mechanisms of how regulatory proteins, called transcription factors (TFs), recognize their specific binding sites encoded into genomic DNA represents one of the central, long-standing problems of molecular biophysics. Strikingly, our experiments demonstrate that DNA context characterized by certain repeat symmetries surrounding specific TF binding sites significantly influences binding specificity. We expect that our results will significantly impact the understanding of molecular, biophysical principles of transcriptional regulation, and significantly improve our ability to predict how variations in DNA sequences, i.e., mutations or polymorphisms, and protein concentrations influence gene expression programs in living cells. Until now, it has been reasonably assumed that specific base-pair recognition is the only mechanism controlling the specificity of transcription factor (TF)−DNA binding. Contrary to this assumption, here we show that nonspecific DNA sequences possessing certain repeat symmetries, when present outside of specific TF binding sites (TFBSs), statistically control TF−DNA binding preferences. We used high-throughput protein−DNA binding assays to measure the binding levels and free energies of binding for several human TFs to tens of thousands of short DNA sequences with varying repeat symmetries. Based on statistical mechanics modeling, we identify a new protein−DNA binding mechanism induced by DNA sequence symmetry in the absence of specific base-pair recognition, and experimentally demonstrate that this mechanism indeed governs protein−DNA binding preferences.

[1]  Lin Yang,et al.  DNAshape: a method for the high-throughput prediction of DNA structural features on a genomic scale , 2013, Nucleic Acids Res..

[2]  E. Fraenkel,et al.  Structural basis of DNA recognition by the heterodimeric cell cycle transcription factor E2F-DP. , 1999, Genes & development.

[3]  R. Young,et al.  Rapid analysis of the DNA-binding specificities of transcription factors with DNA microarrays , 2004, Nature Genetics.

[4]  Ernest,et al.  Enzymatic synthesis of deoxyribonucleic acid. , 1969, Harvey lectures.

[5]  M. Bulyk,et al.  Genomic regions flanking E-box binding sites influence DNA binding specificity of bHLH transcription factors through DNA shape. , 2013, Cell reports.

[6]  Andrew R. Gehrke,et al.  Genome-wide analysis of ETS-family DNA-binding in vitro and in vivo , 2010, The EMBO journal.

[7]  S. Melcher,et al.  Thermodynamics of the interactions of lac repressor with variants of the symmetric lac operator: effects of converting a consensus site to a non-specific site. , 1997, Journal of molecular biology.

[8]  D. Dunlap,et al.  Determination of the number of proteins bound non-specifically to DNA , 2010, Journal of physics. Condensed matter : an Institute of Physics journal.

[9]  V. Walsh Models and Theory , 1987 .

[10]  A. Riggs,et al.  The lac represser-operator interaction , 1970 .

[11]  Antoine M. van Oijen,et al.  Tumor suppressor p53 slides on DNA with low friction and high stability. , 2008, Biophysical journal.

[12]  Martha L. Bulyk,et al.  DNA-binding specificity changes in the evolution of forkhead transcription factors , 2013, Proceedings of the National Academy of Sciences.

[13]  P. V. von Hippel,et al.  Diffusion-driven mechanisms of protein translocation on nucleic acids. 1. Models and theory. , 1981, Biochemistry.

[14]  Raluca Gordân,et al.  Curated collection of yeast transcription factor DNA binding specificity data reveals novel structural and gene regulatory insights , 2011, Genome Biology.

[15]  Daniel E. Newburger,et al.  Diversity and Complexity in DNA Recognition by Transcription Factors , 2009, Science.

[16]  Barbara E. Engelhardt,et al.  Stability selection for regression-based models of transcription factor–DNA binding specificity , 2013, Bioinform..

[17]  P. V. von Hippel,et al.  On the specificity of DNA-protein interactions. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Energy Fluctuations Shape Free Energy of Nonspecific Biomolecular Interactions , 2011, 1101.4529.

[19]  D. B. Lukatsky,et al.  DNA sequence correlations shape nonspecific transcription factor-DNA binding affinity. , 2011, Biophysical journal.

[20]  Ariel Afek,et al.  Genome-wide organization of eukaryotic preinitiation complex is influenced by nonconsensus protein-DNA binding. , 2013, Biophysical journal.

[21]  R. Mann,et al.  The role of DNA shape in protein-DNA recognition , 2009, Nature.

[22]  Robert L. Grossman,et al.  A cis-regulatory map of the Drosophila genome , 2011, Nature.

[23]  X. Xie,et al.  Nonspecifically bound proteins spin while diffusing along DNA , 2009, Nature Structural &Molecular Biology.

[24]  J. Josse,et al.  Enzymatic synthesis of deoxyribonucleic acid. VIII. Frequencies of nearest neighbor base sequences in deoxyribonucleic acid. , 1961, The Journal of biological chemistry.

[25]  P. V. von Hippel,et al.  Non-specific DNA binding of genome regulating proteins as a biological control mechanism: I. The lac operon: equilibrium aspects. , 1974, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Nathan C. Sheffield,et al.  The accessible chromatin landscape of the human genome , 2012, Nature.

[27]  L. Mirny,et al.  Diffusion in correlated random potentials, with applications to DNA. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  P. V. von Hippel,et al.  Facilitated Target Location in Biological Systems* , 2022 .

[29]  E. Cox,et al.  Single molecule measurements of repressor protein 1D diffusion on DNA. , 2006, Physical review letters.

[30]  Anatoly B Kolomeisky,et al.  Physics of protein-DNA interactions: mechanisms of facilitated target search. , 2011, Physical chemistry chemical physics : PCCP.

[31]  L. Mirny,et al.  Kinetics of protein-DNA interaction: facilitated target location in sequence-dependent potential. , 2004, Biophysical journal.

[32]  A. Riggs,et al.  The lac repressor-operator interaction. 3. Kinetic studies. , 1970, Journal of molecular biology.

[33]  P. V. von Hippel,et al.  Selection of DNA binding sites by regulatory proteins. Statistical-mechanical theory and application to operators and promoters. , 1987, Journal of molecular biology.

[34]  Atina G. Coté,et al.  Evaluation of methods for modeling transcription factor sequence specificity , 2013, Nature Biotechnology.

[35]  A. Grosberg,et al.  How proteins search for their specific sites on DNA: the role of DNA conformation. , 2006, Biophysical journal.

[36]  S. Quake,et al.  A Systems Approach to Measuring the Binding Energy Landscapes of Transcription Factors , 2007, Science.

[37]  A. Philippakis,et al.  Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificities , 2006, Nature Biotechnology.

[38]  David Z. Chen,et al.  Architecture of the human regulatory network derived from ENCODE data , 2012, Nature.

[39]  P. V. von Hippel,et al.  From "simple" DNA-protein interactions to the macromolecular machines of gene expression. , 2007, Annual review of biophysics and biomolecular structure.

[40]  Martha L. Bulyk,et al.  UniPROBE, update 2011: expanded content and search tools in the online database of protein-binding microarray data on protein–DNA interactions , 2010, Nucleic Acids Res..

[41]  M. Plischke,et al.  Equilibrium statistical physics , 1988 .

[42]  M. Berger,et al.  Universal protein-binding microarrays for the comprehensive characterization of the DNA-binding specificities of transcription factors , 2009, Nature Protocols.