A database method for automated map interpretation in protein crystallography

A significant portion of new protein structures contain folds that are related to those seen before. During the development of a computer program that can accurately position, in electron density maps, large protein domains with large structural deviations, it became apparent that the redundancy in protein folds could be used in a non trivial manner during a protein structure determination. As a result a computational procedure, Database Assisted Density Interpretation (DADI), was developed and tested to aid in the building of models in protein crystallography and to assist in interpreting electron density maps. The initial tests of the DADI procedure using a small database of protein domains are described. The philosophy is to first work with entire domains then with the secondary structure elements of these domains and finally with individual residues of the secondary structure elements via Monte Carlo, “chopping” and “clipping” procedures, respectively. The first test case was a traceable 3.2 Å multiple isomorphous replacement with anomalous scattering (MIRAS) electron density map of a human topoisomerase I‐DNA complex. The second test case uses poor electron density for the third domain of the diphtheria toxin repressor resulting from a molecular replacement solution with the first two domains. Despite the fact that a fairly small database was employed in these test cases, the DADI procedure was able to find a large portion of the protein backbone with very few errors. In the first case nearly 45% of the backbone and more than 80% of the secondary structure was placed automatically. In the second test case nearly 50% of the third domain was automatically detected. A particular encouraging result was that in both cases more than 75% of the beta sheet secondary structure was found automatically by the DADI procedure. Clearly, the procedures employed are promising avenues to exploit the current explosion of protein structures for the determination of future structures. Proteins 1999;36:526–541. © 1999 Wiley‐Liss, Inc.

[1]  D. McRee 1 – LABORATORY TECHNIQUES , 1993 .

[2]  B C Finzel LORE: exploiting database of known structures. , 1997, Methods in enzymology.

[3]  W G Hol,et al.  High-resolution structure of the diphtheria toxin repressor complexed with cobalt and manganese reveals an SH3-like third domain and suggests a possible role of phosphate as co-corepressor. , 1996, Biochemistry.

[4]  V. Ramakrishnan,et al.  Ribosomal protein structures: insights into the architecture, machinery and evolution of the ribosome. , 1998, Trends in biochemical sciences.

[5]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[6]  J. Berger,et al.  Structure and mechanism of DNA topoisomerase II , 1996, Nature.

[7]  R M Esnouf,et al.  An extensively modified version of MolScript that includes greatly enhanced coloring capabilities. , 1997, Journal of molecular graphics & modelling.

[8]  B. Stec,et al.  Refinement of purothionins reveals solute particles important for lattice formation and toxicity. Part 1: alpha1-purothionin revisited. , 1995, Acta crystallographica. Section D, Biological crystallography.

[9]  D. McRee Practical Protein Crystallography , 1993 .

[10]  Chris Sander,et al.  Dali/FSSP classification of three-dimensional protein folds , 1997, Nucleic Acids Res..

[11]  J L Sussman,et al.  Protein Data Bank archives of three-dimensional macromolecular structures. , 1997, Methods in enzymology.

[12]  E. Pohl,et al.  Comparison of high‐resolution structures of the diphtheria toxin repressor in complex with cobalt and zinc at the cation‐anion binding site , 1997, Protein science : a publication of the Protein Society.

[13]  C. Chothia One thousand families for the molecular biologist , 1992, Nature.

[14]  G. N. Ramachandran,et al.  Stereochemical criteria for polypeptide and protein chain conformations. II. Allowed conformations for a pair of peptide units. , 1965, Biophysical journal.

[15]  W G Hol,et al.  A rapid method for positioning small flexible molecules, nucleic acids, and large protein fragments in experimental electron density maps , 1999, Proteins.

[16]  A. Liljas,et al.  Structure of the C-terminal domain of the ribosomal protein L7/L12 from Escherichia coli at 1.7 A. , 1987, Journal of molecular biology.

[17]  C. Verlinde,et al.  Three-dimensional structure of the diphtheria toxin repressor in complex with divalent cation co-repressors. , 1995, Structure.

[18]  A. Lesk,et al.  Structural principles of alpha/beta barrel proteins: the packing of the interior of the sheet. , 1989, Proteins.

[19]  B. Stec,et al.  Refinement of purothionins reveals solute particles important for lattice formation and toxicity. Part 2: structure of beta-purothionin at 1.7 A resolution. , 1995, Acta crystallographica. Section D, Biological crystallography.

[20]  J. Thornton,et al.  PROMOTIF—A program to identify and analyze structural motifs in proteins , 1996, Protein science : a publication of the Protein Society.

[21]  Cyrus Chothia,et al.  Structural principles of α/β barrel proteins: The packing of the interior of the sheet , 1989 .

[22]  J. Zou,et al.  Improved methods for building protein models in electron density maps and the location of errors in these models. , 1991, Acta crystallographica. Section A, Foundations of crystallography.

[23]  M. Nilges,et al.  Structure of the pleckstrin homology domain from beta-spectrin. , 1994, Nature.

[24]  J. Champoux,et al.  Crystal structures of human topoisomerase I in covalent and noncovalent complexes with DNA. , 1998, Science.

[25]  W G Hol,et al.  A model for the mechanism of human topoisomerase I. , 1998, Science.

[26]  M Bolognesi,et al.  Three-dimensional structure of the complex between pancreatic secretory trypsin inhibitor (Kazal type) and trypsinogen at 1.8 A resolution. Structure solution, crystallographic refinement and preliminary structural interpretation. , 1982, Journal of molecular biology.

[27]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.

[28]  P. Kraulis A program to produce both detailed and schematic plots of protein structures , 1991 .

[29]  R. Huber,et al.  Accurate Bond and Angle Parameters for X-ray Protein Structure Refinement , 1991 .

[30]  Michael Nilges,et al.  Structure of the pleckstrin homology domain from β-spectrin , 1994, Nature.

[31]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[32]  F. Allen,et al.  The Cambridge Crystallographic Data Centre: computer-based search, retrieval, analysis and display of information , 1979 .