Amino acid?base interactions: a three-dimensional analysis of protein?DNA interactions at an atomic level

To assess whether there are universal rules that govern amino acid-base recognition, we investigate hydrogen bonds, van der Waals contacts and water-mediated bonds in 129 protein-DNA complex structures. DNA-backbone interactions are the most numerous, providing stability rather than specificity. For base interactions, there are significant base-amino acid type correlations, which can be rationalised by considering the stereochemistry of protein side chains and the base edges exposed in the DNA structure. Nearly two-thirds of the direct read-out of DNA sequences involves complex networks of hydrogen bonds, which enhance specificity. Two-thirds of all protein-DNA interactions comprise van der Waals contacts, compared to about one-sixth each of hydrogen and water-mediated bonds. This highlights the central importance of these contacts for complex formation, which have previously been relegated to a secondary role. Although common, water-mediated bonds are usually non-specific, acting as space-fillers at the protein-DNA interface. In conclusion, the majority of amino acid-base interactions observed follow general principles that apply across all protein-DNA complexes, although there are individual exceptions. Therefore, we distinguish between interactions whose specificities are 'universal' and 'context-dependent'. An interactive Web-based atlas of side chain-base contacts provides access to the collected data, including analyses and visualisation of the three-dimensional geometry of the interactions.

[1]  C. Orengo CORA—Topological fingerprints for protein structural families , 2008, Protein science : a publication of the Protein Society.

[2]  James E. Bray,et al.  The CATH Database provides insights into protein structure/function relationships , 1999, Nucleic Acids Res..

[3]  A. R. Srinivasan,et al.  The nucleic acid database. A comprehensive relational database of three-dimensional structures of nucleic acids. , 1992, Biophysical journal.

[4]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 , 2000, Nucleic Acids Res..

[5]  R A Sayle,et al.  RASMOL: biomolecular graphics for all. , 1995, Trends in biochemical sciences.

[6]  M Suzuki,et al.  A framework for the DNA-protein recognition code of the probe helix in transcription factors: the chemical and stereochemical rules. , 1994, Structure.

[7]  R. Sauer,et al.  Transcription factors: structural families and principles of DNA recognition. , 1992, Annual review of biochemistry.

[8]  J M Thornton,et al.  Sequences annotated by structure: a tool to facilitate the use of structural information in sequence analysis. , 1998, Protein engineering.

[9]  J. Thornton,et al.  Atlas of protein side-chain interactions , 1992 .

[10]  N. Seeman,et al.  Sequence-specific Recognition of Double Helical Nucleic Acids by Proteins (base Pairs/hydrogen Bonding/recognition Fidelity/ion Binding) , 2022 .

[11]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[12]  Secret code , 1998, Nature Structural Biology.

[13]  C. Lawson,et al.  Tandem binding in crystals of a trp represser/operator half-site complex , 1993, Nature.

[14]  H M Berman,et al.  Protein-DNA interactions: A structural analysis. , 1999, Journal of molecular biology.

[15]  Zukang Feng,et al.  The Nucleic Acid Database. , 2002, Acta crystallographica. Section D, Biological crystallography.

[16]  Janet M. Thornton,et al.  The interaction between phenylalanine rings in proteins , 1985 .

[17]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[18]  H. Margalit,et al.  Comprehensive analysis of hydrogen bonds in regulatory protein DNA-complexes: in search of common principles. , 1995, Journal of molecular biology.

[19]  W R Taylor,et al.  SSAP: sequential structure alignment program for protein structure comparison. , 1996, Methods in enzymology.

[20]  S. Jones,et al.  Principles of protein-protein interactions. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[21]  R. Sauer,et al.  Protein-DNA recognition. , 1984, Annual review of biochemistry.

[22]  J. Thornton,et al.  PQS: a protein quaternary structure file server. , 1998, Trends in biochemical sciences.

[23]  T. P. Flores,et al.  Identification and classification of protein fold families. , 1993, Protein engineering.

[24]  J. Thornton,et al.  Satisfying hydrogen bonding potential in proteins. , 1994, Journal of molecular biology.

[25]  Brian W. Matthews,et al.  No code for recognition , 1988, Nature.