Geometric analysis and comparison of protein-DNA interfaces: why is there no simple code for recognition?

Structural studies of protein-DNA complexes have shown that there are many distinct families of DNA-binding proteins, and have shown that there is no simple "code" describing side-chain/base interactions. However, systematic analysis and comparison of protein-DNA complexes has been complicated by the diversity of observed contacts, the sheer number of complexes currently available and the absence of any consistent method of comparison that retains detailed structural information about the protein-DNA interface. To address these problems, we have developed geometric methods for characterizing the local structural environment in which particular side-chain/base interactions are observed. In particular, we develop methods for analyzing and comparing spatial relationships at the protein-DNA interface. Our method involves attaching local coordinate systems to the DNA bases and to the C(alpha) atoms of the peptide backbone (these are relatively rigid structural units). We use these tools to consider how the position and orientation of the polypeptide backbone (with respect to the DNA) helps to determine what contacts are possible at any given position in a protein-DNA complex. Here, we focus on base contacts that are made in the major groove, and we use spatial relationships in analyzing: (i) the observed patterns of side-chain/base interactions; (ii) observed helix docking orientations; (iii) family/subfamily relationships among DNA-binding proteins; and (iv) broader questions about evolution, altered specificity mutants and the limits for the design of new DNA-binding proteins. Our analysis, which highlights differences in spatial relationships in different complexes and at different positions in a complex, helps explain why there is no simple, general code for protein-DNA recognition.

[1]  A. Aggarwal,et al.  Structure of Bam HI endonuclease bound to DNA: partial folding and unfolding on DNA binding. , 1995, Science.

[2]  Song Tan,et al.  Structure of serum response factor core bound to DNA , 1995, Nature.

[3]  T. Richmond,et al.  The X-ray structure of the GCN4-bZIP bound to ATF/CREB site DNA shows the complex depends on DNA flexibility. , 1993, Journal of molecular biology.

[4]  R. Dickerson,et al.  DNA bending: the prevalence of kinkiness and the virtues of normality. , 1998, Nucleic acids research.

[5]  T. Richmond,et al.  Crystal structure of a yeast TFIIA/TBP/DNA complex , 1996, Nature.

[6]  M. Cleary,et al.  Structure of a HoxB1–Pbx1 Heterodimer Bound to DNA Role of the Hexapeptide and a Fourth Homeodomain Helix in Complex Formation , 1999, Cell.

[7]  S. Harrison,et al.  Conserved residues make similar contacts in two repressor-operator complexes. , 1990, Science.

[8]  S. Grossman,et al.  Crystal structure at 1.7 Å of the bovine papillomavirus-1 E2 DMA-binding domain bound to its DNA target , 1992, Nature.

[9]  Brian W. Matthews,et al.  No code for recognition , 1988, Nature.

[10]  L. Guarente,et al.  Structure of HAP1-18–DNA implicates direct allosteric effect of protein–DNA interactions on transcriptional activation , 1999, Nature Structural Biology.

[11]  J. Kuriyan,et al.  High resolution crystal structure of a paired (Pax) class cooperative homeodomain dimer on DNA , 1995, Cell.

[12]  K. Morikawa,et al.  Structure of a replication-terminator protein complexed with DNA , 1996, Nature.

[13]  E. Pednault,et al.  Nucleic acid structure analysis. Mathematics for local Cartesian and helical structure parameters that are truly comparable between structures. , 1994, Journal of molecular biology.

[14]  B. Matthews,et al.  A new DNA-binding motif in the Skn-1 binding domain–DNA complex , 1998, Nature Structural Biology.

[15]  J. Horton,et al.  Asp34 of PvuII endonuclease is directly involved in DNA minor groove recognition and indirectly involved in catalysis. , 1998, Journal of molecular biology.

[16]  S. Becker,et al.  Three-dimensional structure of the Stat3β homodimer bound to DNA , 1998, Nature.

[17]  Gary Parkinson,et al.  Aromatic hydrogen bond in sequence-specific protein DNA recognition , 1996, Nature Structural Biology.

[18]  A Klug,et al.  Physical basis of a protein-DNA recognition code. , 1997, Current opinion in structural biology.

[19]  J. Darnell,et al.  Crystal Structure of a Tyrosine Phosphorylated STAT-1 Dimer Bound to DNA , 1998, Cell.

[20]  C. Chothia,et al.  The structure of protein-protein recognition sites. , 1990, The Journal of biological chemistry.

[21]  Song Tan,et al.  Crystal structure of the yeast MATα2/MCM1/DNA ternary complex , 1998, Nature.

[22]  Xiaodong Cheng,et al.  Structure of PvuII endonuclease with cognate DNA. , 1994, The EMBO journal.

[23]  Helen M. Berman,et al.  Structure of the CAP-DNA complex at 2.5 angstroms resolution: a complete picture of the protein-DNA interface. , 1997, Journal of molecular biology.

[24]  A. Ferré-D’Amaré,et al.  Structure and function of the b/HLH/Z domain of USF , 1994 .

[25]  P. Sigler,et al.  Structural determinants of nuclear receptor assembly on DNA direct repeats , 1995, Nature.

[26]  R. Sauer Protein-DNA interactions , 1991 .

[27]  Fu Lu,et al.  The structure of PurR mutant L54M shows an alternative route to DNA kinking , 1998, Nature Structural Biology.

[28]  Li Zhang,et al.  Structure of a HAP1–DNA complex reveals dramatically asymmetric DNA binding by a homodimeric protein , 1999, Nature Structural Biology.

[29]  S. Harrison,et al.  The complex between phage 434 repressor DNA-binding domain and operator site OR3: structural differences between consensus and non-consensus half-sites. , 1993, Structure.

[30]  T. Richmond,et al.  Crystal structure of a bZIP/DNA complex at 2.2 A: determinants of DNA specific recognition. , 1995, Journal of molecular biology.

[31]  H. Margalit,et al.  Comprehensive analysis of hydrogen bonds in regulatory protein DNA-complexes: in search of common principles. , 1995, Journal of molecular biology.

[32]  C. Chothia Principles that determine the structure of proteins. , 1984, Annual review of biochemistry.

[33]  H. Nelson,et al.  A new use for the 'wing' of the 'winged' helix-turn-helix motif in the HSF–DNA cocrystal , 1999, Nature Structural Biology.

[34]  Ernest Fraenkel,et al.  Comparison of X-ray and NMR structures for the Antennapedia homeodomain–DNA complex , 1998, Nature Structural &Molecular Biology.

[35]  Robert T. Sauer,et al.  DNA recognition by β-sheets in the Arc represser–operator crystal structure , 1994, Nature.

[36]  R. Kodandapani,et al.  A new pattern for helix–turn–helix recognition revealed by the PU.l ETS–domain–DNA complex , 1996, Nature.

[37]  M. Lazar,et al.  Structural elements of an orphan nuclear receptor-DNA complex. , 1998, Molecular cell.

[38]  R Lavery,et al.  The definition of generalized helicoidal parameters and of axis curvature for irregular nucleic acids. , 1988, Journal of biomolecular structure & dynamics.

[39]  Y. Kyōgoku,et al.  Crystal structure of PHO4 bHLH domain–DNA complex: flanking base recognition , 1997, The EMBO journal.

[40]  S. Burley,et al.  Crystal structure of a TFIIB–TBP–TATA-element ternary complex , 1995, Nature.

[41]  S. Harrison,et al.  Differing roles for zinc fingers in DNA recognition: structure of a six-finger transcription factor IIIA complex. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[42]  Juli D. Klemm,et al.  Crystal structure of the Oct-1 POU domain bound to an octamer site: DNA recognition with tethered DNA-binding modules , 1994, Cell.

[43]  T. Ceska,et al.  The crystal structure of an intact human Max-DNA complex: new insights into mechanisms of transcriptional control. , 1997, Structure.

[44]  M. Schumacher,et al.  The X-ray Structure of the PurR-Guanine-purF Operator Complex Reveals the Contributions of Complementary Electrostatic Surfaces and a Water-mediated Hydrogen Bond to Corepressor Specificity and Binding Affinity* , 1997, The Journal of Biological Chemistry.

[45]  N. Seeman,et al.  Sequence-specific Recognition of Double Helical Nucleic Acids by Proteins (base Pairs/hydrogen Bonding/recognition Fidelity/ion Binding) , 2022 .

[46]  Peter König,et al.  The Crystal Structure of the DNA-Binding Domain of Yeast RAP1 in Complex with Telomeric DNA , 1996, Cell.

[47]  Cynthia Wolberger,et al.  The Structure of GABPα/β: An ETS Domain- Ankyrin Repeat Heterodimer Bound to DNA , 1998 .

[48]  F. Winkler,et al.  Mg2+ binding to the active site of EcoRV endonuclease: a crystallographic study of complexes with substrate and product DNA at 2 A resolution. , 1995, Biochemistry.

[49]  Robert G. Martin,et al.  A novel DNA-binding motif in MarA: the first structure for an AraC family transcriptional activator. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[50]  A Klug,et al.  Toward a code for the interactions of zinc fingers with DNA: selection of randomized fingers displayed on phage. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[51]  Aneel K. Aggarwal,et al.  Structure of a DNA-bound Ultrabithorax–Extradenticle homeodomain complex , 1999, Nature.

[52]  R. Dickerson,et al.  Helix bending as a factor in protein/DNA recognition , 1997, Biopolymers.

[53]  P. Sigler,et al.  Structure of NF-κB p50 homodimer bound to a κB site , 1998, Nature.

[54]  P. Jeffrey,et al.  Crystal structure of a p53 tumor suppressor-DNA complex: understanding tumorigenic mutations. , 1994, Science.

[55]  P. Cramer,et al.  Structure of the specificity domain of the Dorsal homologue Gambif1 bound to DNA. , 1999, Structure.

[56]  F. Guo,et al.  Asymmetric DNA bending in the Cre-loxP site-specific recombination synapse. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[57]  M. Gerstein,et al.  Binding geometry of α‐helices that recognize DNA , 1995, Proteins.

[58]  S. Harrison,et al.  Crystal structure of a PPR1-DNA complex: DNA recognition by proteins containing a Zn2Cys6 binuclear cluster. , 1994, Genes & development.

[59]  S. Harrison,et al.  Crystal structure of transcription factor E47: E-box recognition by a basic region helix-loop-helix dimer. , 1994, Genes & development.

[60]  P. Sharp,et al.  Homeodomain determinants of major groove recognition. , 1994, Biochemistry.

[61]  P B Sigler,et al.  The 2.1-A crystal structure of an archaeal preinitiation complex: TATA-box-binding protein/transcription factor (II)B core/TATA-box. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[62]  A. Aggarwal,et al.  Structure of the even‐skipped homeodomain complexed to AT‐rich DNA: new perspectives on homeodomain specificity. , 1995, The EMBO journal.

[63]  Michael Carey,et al.  DNA recognition by GAL4: structure of a protein-DNA complex , 1992, Nature.

[64]  Pei-ji Chen,et al.  An exceptionally well-preserved theropod dinosaur from the Yixian Formation of China , 1998, Nature.

[65]  Harold Abelson,et al.  Turtle geometry : the computer as a medium for exploring mathematics , 1983 .

[66]  Carl O. Pabo,et al.  Crystal structure of MyoD bHLH domain-DNA complex: Perspectives on DNA recognition and implications for transcriptional activation , 1994, Cell.

[67]  Cynthia Wolberger,et al.  Crystal structure of a MAT alpha 2 homeodomain-operator complex suggests a general model for homeodomain-DNA interactions. , 1991, Cell.

[68]  C. Pabo,et al.  Crystal structure of a five-finger GLI-DNA complex: new perspectives on zinc fingers. , 1993, Science.

[69]  C. Pabo,et al.  Zif268 protein-DNA complex refined at 1.6 A: a model system for understanding zinc finger-DNA interactions. , 1996, Structure.

[70]  Gregory L. Verdine,et al.  Structure of the NF-κB p50 homodimer bound to DNA , 1995, Nature.

[71]  S K Burley,et al.  Crystal structure of a human TATA box-binding protein/TATA element complex. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[72]  A. Vershon,et al.  Crystal structure of the MATa1/MATalpha2 homeodomain heterodimer in complex with DNA containing an A-tract. , 1998, Nucleic acids research.

[73]  M Suzuki,et al.  DNA recognition code of transcription factors in the helix-turn-helix, probe helix, hormone receptor, and zinc finger families. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[74]  K. Struhl,et al.  The GCN4 basic region leucine zipper binds DNA as a dimer of uninterrupted α Helices: Crystal structure of the protein-DNA complex , 1992, Cell.

[75]  B. Matthews,et al.  Crystal structure of lambda-Cro bound to a consensus operator at 3.0 A resolution. , 1998, Journal of molecular biology.

[76]  Alexander D. Johnson,et al.  Crystallization and preliminary X‐ray diffraction studies of an a1/α2/DNA ternary complex , 1995 .

[77]  R. Sauer,et al.  DNA recognition by beta-sheets in the Arc repressor-operator crystal structure. , 1994, Nature.

[78]  R. Sauer,et al.  Origins of DNA-binding specificity: role of protein contacts with the DNA backbone. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[79]  C R Calladine,et al.  The assessment of the geometry of dinucleotide steps in double-helical DNA; a new local calculation scheme. , 1995, Journal of molecular biology.

[80]  A. Joachimiak,et al.  Crystal structure of trp represser/operator complex at atomic resolution , 1988, Nature.

[81]  A. Aggarwal,et al.  Structure of IRF-1 with bound DNA reveals determinants of interferon regulation , 1998, Nature.

[82]  H. Kono,et al.  Structure‐based prediction of DNA target sites by regulatory proteins , 1999, Proteins.

[83]  S. Harrison,et al.  Structure of the DNA-binding domains from NFAT, Fos and Jun bound specifically to DNA , 1998, Nature.

[84]  M. Schumacher,et al.  Crystal structure of LacI member, PurR, bound to DNA: minor groove binding by alpha helices. , 1994, Science.

[85]  E. Fraenkel,et al.  Structural basis of DNA recognition by the heterodimeric cell cycle transcription factor E2F-DP. , 1999, Genes & development.

[86]  T. Sixma,et al.  Crystal structure of the specific DNA‐binding domain of Tc3 transposase of C.elegans in complex with transposon DNA , 1997, The EMBO journal.

[87]  Stephen K. Burley,et al.  Co-crystal structure of TBP recognizing the minor groove of a TATA element , 1993, Nature.

[88]  S. Harrison,et al.  The phage 434 OR2/R1-69 complex at 2.5 A resolution. , 1993, Journal of molecular biology.

[89]  L. Beamer,et al.  Refined 1.8 p crystal structure of the ? repressor-operator complex*1 , 1992 .

[90]  K. Yamamoto,et al.  Crystallographic analysis of the interaction of the glucocorticoid receptor with DNA , 2003, Nature.

[91]  M. Suzuki Common features in DNA recognition helices of eukaryotic transcription factors. , 1993, The EMBO journal.

[92]  S K Burley,et al.  Cocrystal structure of YY1 bound to the adeno-associated virus P5 initiator. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[93]  C. Lawson,et al.  Cocrystals of Escherichia coli trp repressor bound to an alternative operator DNA sequence. , 1993, Journal of molecular biology.

[94]  Carl O. Pabo,et al.  Crystal structure of an engrailed homeodomain-DNA complex at 2.8 Å resolution: A framework for understanding homeodomain-DNA interactions , 1990, Cell.

[95]  R. Lavery,et al.  Defining the structure of irregular nucleic acids: conventions and principles. , 1989, Journal of biomolecular structure & dynamics.

[96]  Ronen Marmorstein,et al.  Crystal structure of a PUT3–DNA complex reveals a novel mechanism for DMA recognition by a protein containing a Zn2Cys6 binuclear cluster , 1997, Nature Structural Biology.

[97]  A. Aggarwal,et al.  Structure of the multimodular endonuclease FokI bound to DNA , 1997, Nature.

[98]  John W. R. Schwabe,et al.  The crystal structure of a two zinc-finger peptide reveals an extension to the rules for zinc-finger/DNA recognition , 1993, Nature.

[99]  C. Pabo,et al.  Computer-aided model-building strategies for protein design. , 1986, Biochemistry.

[100]  M Suzuki,et al.  A framework for the DNA-protein recognition code of the probe helix in transcription factors: the chemical and stereochemical rules. , 1994, Structure.

[101]  John W. R. Schwabe,et al.  The crystal structure of the estrogen receptor DNA-binding domain bound to DNA: How receptors discriminate between their response elements , 1993, Cell.

[102]  J. Michael McCarthy,et al.  Introduction to theoretical kinematics , 1990 .

[103]  Sarah E. Ades,et al.  Differential DNA-binding specificity of the engrailed homeodomain: the role of residue 50. , 1994, Biochemistry.

[104]  Claude Desplan,et al.  Crystal structure of a paired domain-DNA complex at 2.5 å resolution reveals structural basis for pax developmental mutations , 1995, Cell.

[105]  S K Burley,et al.  Co-crystal structure of sterol regulatory element binding protein 1a at 2.3 A resolution. , 1998, Structure.

[106]  Stephen K. Burley,et al.  Recognition by Max of its cognate DNA through a dimeric b/HLH/Z domain , 1993, Nature.

[107]  T. Steitz,et al.  The structure of a CAP-DNA complex having two cAMP molecules bound to each monomer. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[108]  Phoebe A Rice,et al.  Crystal Structure of an IHF-DNA Complex: A Protein-Induced DNA U-Turn , 1996, Cell.

[109]  S. Phillips,et al.  Crystal structure of the met represser–operator complex at 2.8 Å resolution reveals DNA recognition by β-strands , 1992, Nature.

[110]  C. Müller,et al.  Crystallographic structure of the T domain–DNA complex of the Brachyury transcription factor , 1997, Nature.

[111]  Helen M. Berman,et al.  Structure of the CAP-DNA Complex at 2.5 Å Resolution: A Complete Picture of the Protein-DNA Interface , 1996 .

[112]  R. Marmorstein,et al.  Structures of SAP-1 bound to DNA targets from the E74 and c-fos promoters: insights into DNA sequence discrimination by Ets proteins. , 1998, Molecular cell.

[113]  S. Burley,et al.  Co-crystal structure of the HNF-3/fork head DNA-recognition motif resembles histone H5 , 1993, Nature.

[114]  G. Ghosh,et al.  Crystal structure of p50/p65 heterodimer of transcription factor NF-κB bound to DNA , 1998, Nature.

[115]  N. Pavletich,et al.  Zinc finger-DNA recognition: crystal structure of a Zif268-DNA complex at 2.1 A , 1991, Science.

[116]  S. Diekmann,et al.  Definitions and nomenclature of nucleic acid structure parameters. , 1989, The EMBO journal.

[117]  E. Fraenkel,et al.  Engrailed homeodomain-DNA complex at 2.2 A resolution: a detailed view of the interface and comparison with other engrailed structures. , 1998, Journal of molecular biology.

[118]  Steven Hahn,et al.  Crystal structure of a yeast TBP/TATA-box complex , 1993, Nature.

[119]  A Klug,et al.  Selection of DNA binding sites for zinc fingers using rationally randomized DNA reveals coded interactions. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[120]  C. Pabo,et al.  High-resolution structures of variant Zif268-DNA complexes: implications for understanding zinc finger-DNA recognition. , 1998, Structure.

[121]  K Nadassy,et al.  Structural features of protein-nucleic acid recognition sites. , 1999, Biochemistry.

[122]  H M Berman,et al.  Protein-DNA interactions: A structural analysis. , 1999, Journal of molecular biology.

[123]  J. Berg,et al.  A 2.2 Å resolution crystal structure of a designed zinc finger protein bound to DNA , 1996, Nature Structural Biology.

[124]  M. A. El Hassan,et al.  Structure and conformation of helical nucleic acids: analysis program (SCHNAaP). , 1997, Journal of molecular biology.

[125]  P. Sigler,et al.  The basis for half-site specificity explored through a non-cognate steroid receptor-DNA complex , 1995, Nature Structural Biology.

[126]  M. Rosenfeld,et al.  Structure of Pit-1 POU domain bound to DNA as a dimer: unexpected arrangement and flexibility. , 1997, Genes & development.

[127]  C Sander,et al.  Mapping the Protein Universe , 1996, Science.

[128]  C. Wolberger,et al.  Crystal Structure of the MATa1/MATα2 Homeodomain Heterodimer Bound to DNA , 1995, Science.

[129]  J A Epstein,et al.  Crystal structure of the human Pax6 paired domain-DNA complex reveals specific roles for the linker region and carboxy-terminal subdomain in DNA binding. , 1999, Genes & development.

[130]  C. Pabo,et al.  DNA recognition by Cys2His2 zinc finger proteins. , 2000, Annual review of biophysics and biomolecular structure.

[131]  M Ptashne,et al.  Recognition of a DNA operator by the repressor of phage 434: a view at high resolution , 1988, Science.

[132]  P. Cramer,et al.  Engineering of diffraction‐quality crystals of the NF‐κB P52 homodimer:DNA complex , 1997, FEBS letters.

[133]  J. N. Mark Glover,et al.  Crystal structure of the heterodimeric bZIP transcription factor c-Fos–c-Jun bound to DNA , 1995, Nature.

[134]  Sarah E. Ades,et al.  Engrailed (Gln50-->Lys) homeodomain-DNA complex at 1.9 A resolution: structural basis for enhanced affinity and altered specificity. , 1997, Structure.

[135]  J M Rosenberg,et al.  Refinement of Eco RI endonuclease crystal structure: a revised protein chain tracing. , 1990, Science.

[136]  R. Dickerson,et al.  Hin recombinase bound to DNA: the origin of specificity in major and minor groove interactions. , 1994, Science.

[137]  Alfonso Mondragón,et al.  The phage 434 Cro/OR1 complex at 2.5 A resolution. , 1991, Journal of molecular biology.

[138]  D. Ringe,et al.  Structure of the metal-ion-activated diphtheria toxin repressor/ tox operator complex , 1998, Nature.

[139]  Wei Yang,et al.  Crystal structure of the site-specific recombinase γδ resolvase complexed with a 34 by cleavage site , 1995, Cell.

[140]  M Gerstein,et al.  Stereochemical basis of DNA recognition by Zn fingers. , 1994, Nucleic acids research.