A structural approach reveals how neighbouring C2H2 zinc fingers influence DNA binding specificity

Development of an accurate protein–DNA recognition code that can predict DNA specificity from protein sequence is a central problem in biology. C2H2 zinc fingers constitute by far the largest family of DNA binding domains and their binding specificity has been studied intensively. However, despite decades of research, accurate prediction of DNA specificity remains elusive. A major obstacle is thought to be the inability of current methods to account for the influence of neighbouring domains. Here we show that this problem can be addressed using a structural approach: we build structural models for all C2H2-ZF–DNA complexes with known binding motifs and find six distinct binding modes. Each mode changes the orientation of specificity residues with respect to the DNA, thereby modulating base preference. Most importantly, the structural analysis shows that residues at the domain interface strongly and predictably influence the binding mode, and hence specificity. Accounting for predicted binding mode significantly improves prediction accuracy of predicted motifs. This new insight into the fundamental behaviour of C2H2-ZFs has implications for both improving the prediction of natural zinc finger-binding sites, and for prioritizing further experiments to complete the code. It also provides a new design feature for zinc finger engineering.

[1]  G. Hong,et al.  Nucleic Acids Research , 2015, Nucleic Acids Research.

[2]  H. Najafabadi,et al.  Mapping and analysis of Caenorhabditis elegans transcription factor sequence specificities , 2015, eLife.

[3]  Mihai Albu,et al.  C2H2 zinc finger proteins greatly expand the human regulatory lexicon , 2015, Nature Biotechnology.

[4]  Benjamin L. Oakes,et al.  A systematic survey of the Cys2His2 zinc finger DNA-binding landscape , 2015, Nucleic acids research.

[5]  Kate B. Cook,et al.  Determination and Inference of Eukaryotic Transcription Factor Sequence Specificity , 2014, Cell.

[6]  M. Bulyk,et al.  Diversification of transcription factor paralogs via noncanonical modularity in C2H2 zinc finger DNA binding. , 2014, Molecular cell.

[7]  Mona Singh,et al.  De novo prediction of DNA-binding specificities for Cys2His2 zinc finger proteins , 2013, Nucleic acids research.

[8]  Brendan J. Frey,et al.  A compendium of RNA-binding motifs for decoding gene regulation , 2013, Nature.

[9]  Juan M. Vaquerizas,et al.  DNA-Binding Specificities of Human Transcription Factors , 2013, Cell.

[10]  Gary D. Stormo,et al.  An optimized two-finger archive for ZFN-mediated gene targeting , 2012, Nature Methods.

[11]  Helen M. Rowe,et al.  Dynamic control of endogenous retroviruses during development. , 2011, Virology.

[12]  P. Bradley,et al.  Extensive protein and DNA backbone sampling improves structure-based specificity prediction for C2H2 zinc fingers , 2011, Nucleic acids research.

[13]  Feng Zhang,et al.  Selection-Free Zinc-Finger Nuclease Engineering by Context-Dependent Assembly (CoDA) , 2010, Nature Methods.

[14]  H. Kimura,et al.  Proviral silencing in embryonic stem cells requires the histone methyltransferase ESET , 2010, Nature.

[15]  Aaron Klug,et al.  The discovery of zinc fingers and their development for practical applications in gene regulation and genome manipulation , 2010, Quarterly Reviews of Biophysics.

[16]  Helen M. Rowe,et al.  KAP1 controls endogenous retroviruses in embryonic stem cells , 2010, Nature.

[17]  Victor X. Jin,et al.  Genomic Targets of the KRAB and SCAN Domain-containing Zinc Finger Protein 263* , 2009, The Journal of Biological Chemistry.

[18]  Juan M. Vaquerizas,et al.  A census of human transcription factors: function, expression and evolution , 2009, Nature Reviews Genetics.

[19]  R. Emerson,et al.  Adaptive Evolution in Zinc Finger Transcription Factors , 2009, PLoS genetics.

[20]  Ronnie J Winfrey,et al.  Rapid "open-source" engineering of customized zinc-finger nucleases for highly efficient gene modification. , 2008, Molecular cell.

[21]  M. Noyes,et al.  A systematic characterization of factors that regulate Drosophila segmentation via a bacterial one-hybrid system , 2008, Nucleic acids research.

[22]  David J. Segal,et al.  The Protein-Binding Potential of C2H2 Zinc Finger Domains , 2008, Cell Biochemistry and Biophysics.

[23]  David J. Segal,et al.  Keep Your Fingers Off My DNA: Protein–Protein Interactions Mediated by C2H2 Zinc Finger Domains , 2008, Cell Biochemistry and Biophysics.

[24]  J. Šponer,et al.  Refinement of the AMBER Force Field for Nucleic Acids: Improving the Description of α/γ Conformers , 2007 .

[25]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[26]  V. Hornak,et al.  Comparison of multiple Amber force fields and development of improved protein backbone parameters , 2006, Proteins.

[27]  Holger Gohlke,et al.  The Amber biomolecular simulation programs , 2005, J. Comput. Chem..

[28]  M. Brodsky,et al.  A bacterial one-hybrid system for determining the DNA-binding specificity of transcription factors , 2005, Nature Biotechnology.

[29]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[30]  Michael Feig,et al.  MMTSB Tool Set: enhanced sampling and multiscale modeling methods for applications in structural biology. , 2004, Journal of molecular graphics & modelling.

[31]  W. Olson,et al.  3DNA: a software package for the analysis, rebuilding and visualization of three-dimensional nucleic acid structures. , 2003, Nucleic acids research.

[32]  Hui Liu,et al.  The KRAB Domain of Zinc Finger Gene ZNF268: a Potential Transcriptional Repressor , 2003, IUBMB life.

[33]  Y. Pang Successful molecular dynamics simulation of two zinc complexes bridged by a hydroxide in phosphotriesterase using the cationic dummy atom method , 2001, Proteins.

[34]  C. Pabo,et al.  Beyond the "recognition code": structures of two Cys2His2 zinc finger/TATA box complexes. , 2001, Structure.

[35]  A. Klug,et al.  A rapid, generally applicable method to engineer zinc fingers illustrated by targeting the HIV-1 promoter , 2001, Nature Biotechnology.

[36]  J. Zhang,et al.  Identification and characterization of DPZF, a novel human BTB/POZ zinc finger protein sharing homology to BCL-6. , 2001, Biochemical and biophysical research communications.

[37]  S. Iuchi,et al.  Three classes of C2H2 zinc finger proteins , 2001, Cellular and Molecular Life Sciences CMLS.

[38]  T. Darden,et al.  Efficient particle-mesh Ewald based approach to fixed and induced dipolar interactions , 2000 .

[39]  A Klug,et al.  Comprehensive DNA recognition through concerted interactions from adjacent zinc fingers. , 1998, Biochemistry.

[40]  A Klug,et al.  Synergy between adjacent zinc fingers in sequence-specific DNA recognition. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[41]  J R Desjarlais,et al.  Toward rules relating zinc finger protein sequences and DNA binding site preferences. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[42]  N. Pavletich,et al.  Zinc finger-DNA recognition: crystal structure of a Zif268-DNA complex at 2.1 A , 1991, Science.

[43]  G Vriend,et al.  WHAT IF: a molecular modeling and drug design program. , 1990, Journal of molecular graphics.

[44]  H. Berendsen,et al.  Molecular dynamics with coupling to an external bath , 1984 .

[45]  G. Ciccotti,et al.  Numerical Integration of the Cartesian Equations of Motion of a System with Constraints: Molecular Dynamics of n-Alkanes , 1977 .

[46]  Philip M. Kim,et al.  Information for : “ C 2 H 2 zinc finger proteins greatly expand the human regulatory lexicon ” , 2014 .

[47]  Gary D. Stormo,et al.  Program in Gene Function and Expression Publications and Presentations Program in Gene Function and Expression 4-2014 An improved predictive recognition model for Cys 2-His 2 zinc finger proteins , 2014 .

[48]  L. Stubbs,et al.  Function and Evolution of C2H2 Zinc Finger Arrays. , 2011, Sub-cellular biochemistry.

[49]  T R Hughes,et al.  A catalogue of eukaryotic transcription factor types, their evolutionary origin, and species distribution. , 2011, Sub-cellular biochemistry.

[50]  Daniel Svozil,et al.  Refinement of the AMBER force field for nucleic acids: improving the description of alpha/gamma conformers. , 2007, Biophysical journal.

[51]  Lei Xie,et al.  Using multiple structure alignments, fast model building, and energetic analysis in fold recognition and homology modeling , 2003, Proteins.

[52]  S. Iuchi,et al.  Three classes of C 2 H 2 zinc finger proteins , 2001 .

[53]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[54]  C. Pabo,et al.  DNA recognition by Cys2His2 zinc finger proteins. , 2000, Annual review of biophysics and biomolecular structure.