Use of techniques derived from graph theory to compare secondary structure motifs in proteins.

A substructure matching algorithm is described that can be used for the automatic identification of secondary structural motifs in three-dimensional protein structures from the Protein Data Bank. The proteins and motifs are stored for searching as labelled graphs, with the nodes of a graph corresponding to linear representations of helices and strands and the edges to the inter-line angles and distances. A modification of Ullman's subgraph isomorphism algorithm is described that can be used to search these graph representations. Tests with patterns from the protein structure literature demonstrate both the efficiency and the effectiveness of the search procedure, which has been implemented in FORTRAN 77 on a MicroVAX-II system, coupled to the molecular fitting program FRODO on an Evans and Sutherland PS300 graphics system.

[1]  S. Arnott,et al.  Atomic co-ordinates for an alpha-helix: refinement of the crystal structure of alpha-poly-l-alanine. , 1966, Journal of molecular biology.

[2]  T. A. Jones,et al.  A graphics model building and refinement system for macromolecules , 1978 .

[3]  Edward H. Sussenguth A Graph-Theoretic Algorithm for Matching Chemical Structures. , 1965 .

[4]  P. Karplus,et al.  Refined structure of glutathione reductase at 1.54 A resolution. , 1987, Journal of molecular biology.

[5]  J. W. Campbell,et al.  Structure of yeast phosphoglycerate mutase , 1974, Nature.

[6]  P. Willett,et al.  A Comparison of Some Measures for the Determination of Inter‐Molecular Structural Similarity Measures of Inter‐Molecular Structural Similarity , 1986 .

[7]  L Järup,et al.  Crystal structure of human carbonic anhydrase C. , 1972, Nature: New biology.

[8]  D. Corneil,et al.  An Efficient Algorithm for Graph Isomorphism , 1970, JACM.

[9]  J M Burridge,et al.  Protein secondary structural representation using real-time interactive computer graphics , 1986 .

[10]  F A Quiocho,et al.  Carboxypeptidase A: a protein and an enzyme. , 1971, Advances in protein chemistry.

[11]  J. Richardson,et al.  β-Sheet topology and the relatedness of proteins , 1977, Nature.

[12]  Annette Von Scholley A relaxation algorithm for generic chemical structure screening , 1984, J. Chem. Inf. Comput. Sci..

[13]  J. Kraut,et al.  Two-Angstrom crystal structure of oxidized Chromatium high potential iron protein. , 1976, The Journal of biological chemistry.

[14]  I. C. O. B. Nomenclature IUPAC-IUB Commission on Biochemical Nomenclature. Abbreviations and symbols for the description of the conformation of polypeptide chains. Tentative rules (1969). , 1970, Biochemistry.

[15]  M. Rossmann,et al.  Structure of Lactate Dehydrogenase at 2.8 Å Resolution , 1970, Nature.

[16]  R. Kretsinger,et al.  Refinement of the structure of carp muscle calcium-binding parvalbumin by model building and difference Fourier analysis. , 1976, Journal of molecular biology.

[17]  D W Banner,et al.  Atomic coordinates for triose phosphate isomerase from chicken muscle. , 1976, Biochemical and biophysical research communications.

[18]  K H Kim,et al.  Structural asymmetry in the CTP-liganded form of aspartate carbamoyltransferase from Escherichia coli. , 1987, Journal of molecular biology.

[19]  Michael G. Rossmann,et al.  Chemical and biological evolution of a nucleotide-binding protein , 1974, Nature.

[20]  D. M. Blow,et al.  Structure of crystalline -chymotrypsin. V. The atomic structure of tosyl- -chymotrypsin at 2 A resolution. , 1972, Journal of molecular biology.

[21]  Cyrus Chothia,et al.  The 14th barrel rolls out , 1988, Nature.

[22]  Christopher J. Rawlings,et al.  Reasoning about protein topology using the logic programming language PROLOG , 1985 .

[23]  H L Carrell,et al.  Comparison of backbone structures of glucose isomerase from Streptomyces and Arthrobacter. , 1988, Protein engineering.

[24]  F. Richards,et al.  Identification of structural motifs from protein coordinate data: Secondary structure and first‐level supersecondary structure * , 1988, Proteins.

[25]  M G Rossmann,et al.  Studies of asymmetry in the three-dimensional structure of lobster D-glyceraldehyde-3-phosphate dehydrogenase. , 1977, The Journal of biological chemistry.

[26]  Peter Willett,et al.  Algorithms for the identification of three-dimensional maximal common substructures , 1987, J. Chem. Inf. Comput. Sci..

[27]  Y. Matsuura,et al.  Structure and possible catalytic residues of Taka-amylase A. , 1982, Journal of biochemistry.

[28]  L. Sieker,et al.  Structure of a bacterial ferredoxin. , 1973, The Journal of biological chemistry.

[29]  A. W. Hanson,et al.  The three-dimensional structure of ribonuclease-S. Interpretation of an electron density map at a nominal resolution of 2 A. , 1970, The Journal of biological chemistry.

[30]  H. Eklund,et al.  Three-dimensional structure of isonicotinimidylated liver alcohol dehydrogenase. , 1984, The Journal of biological chemistry.

[31]  W G Hol,et al.  Crystal structure of p-hydroxybenzoate hydroxylase complexed with its reaction product 3,4-dihydroxybenzoate. , 1988, Journal of Molecular Biology.

[32]  S. D. Dover,et al.  Refinement of bond angles of an α-helix , 1967 .

[33]  R. Abagyan,et al.  A simple qualitative representation of polypeptide chain folds: comparison of protein tertiary structures. , 1988, Journal of biomolecular structure & dynamics.

[34]  C. Blake,et al.  Strjcture of human plasma prealbumin at 2-5 A resolution. A preliminary report on the polypeptide chain conformation, quaternary structure and thyroxine binding. , 1974, Journal of molecular biology.

[35]  G. Schulz,et al.  Three-dimensional structure of adenyl kinase , 1974, Nature.

[36]  G. Schulz,et al.  Three-dimensional structure of glutathione reductase at 2 A resolution. , 1981, Journal of molecular biology.

[37]  P Argos,et al.  Three-dimensional Fourier synthesis of calf liver cytochrome b 5 at 2-8 A resolution. , 1972, Journal of molecular biology.

[38]  K. D. Watenpaugh,et al.  Refinement of the model of a protein: rubredoxin at 1.5 Å resolution , 1973 .

[39]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[40]  M G Rossmann,et al.  Conformation of coenzyme fragments when bound to lactate dehydrogenase. , 1973, Journal of molecular biology.

[41]  J. Thornton,et al.  Helix geometry in proteins. , 1988, Journal of molecular biology.

[42]  Cyrus Chothia,et al.  Conformation of twisted β-pleated sheets in proteins , 1973 .

[43]  J. L. Crawford,et al.  Crystal and molecular structures of native and CTP-liganded aspartate carbamoyltransferase from Escherichia coli. , 1982, Journal of molecular biology.

[44]  Jones Ta,et al.  Diffraction methods for biological macromolecules. Interactive computer graphics: FRODO. , 1985, Methods in enzymology.

[45]  K. Moffat,et al.  The refined structure of vitamin D-dependent calcium-binding protein from bovine intestine. Molecular details, ion binding, and implications for the structure of other calcium-binding proteins. , 1986, The Journal of biological chemistry.

[46]  T. Steitz,et al.  Structural dynamics of yeast hexokinase during catalysis. , 1981, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[47]  R M Sweet,et al.  Crystal structure of the complex of porcine trypsin with soybean trypsin inhibitor (Kunitz) at 2.6-A resolution. , 1974, Biochemistry.

[48]  W G Hol,et al.  The double domain structure of rhodanese. , 1975, Journal of molecular biology.

[49]  J. Richardson,et al.  A high resolution structure of an inhibitor complex of the extracellular nuclease of Staphylococcus aureus. I. Experimental procedures and chain tracing. , 1971, The Journal of biological chemistry.

[50]  Taiji Imoto,et al.  21 Vertebrate Lysozymes , 1972 .

[51]  J. Drenth,et al.  The structure of papain. , 1971, Advances in protein chemistry.

[52]  D W Rice,et al.  Recent progress on the structure and function of glutamate dehydrogenase. , 1987, Biochemical Society transactions.

[53]  M. James,et al.  Structure of the calcium regulatory muscle protein troponin-C at 2.8 Å resolution , 1985, Nature.

[54]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[55]  G. Petsko,et al.  Structure of chicken muscle triose phosphate isomerase determined crystallographically at 2.5Å resolution: using amino acid sequence data , 1975, Nature.

[56]  C. Brändén,et al.  X-ray investigation of the binding of 1,10-phenanthroline and imidazole to horse-liver alcohol dehydrogenase. , 1977, European journal of biochemistry.

[57]  Blake Cc X-ray studies of glycolytic enzymes. , 1975 .

[58]  J. Richardson,et al.  The singly-wound parallel beta barrel: a proposed structure for 2-keto-3-deoxy-6-phosphogluconate aldolase. , 1979, Biochemical and biophysical research communications.

[59]  D I Stuart,et al.  Crystal structure of cat muscle pyruvate kinase at a resolution of 2.6 A. , 1979, Journal of molecular biology.

[60]  B. Matthews,et al.  The structure of thermolysin: an electron density map at 2-3 A resolution. , 1972, Journal of molecular biology.

[61]  Julian R. Ullmann,et al.  An Algorithm for Subgraph Isomorphism , 1976, J. ACM.

[62]  H Weinstein,et al.  Structural analysis of carboxypeptidase A and its complexes with inhibitors as a basis for modeling enzyme recognition and specificity , 1985, Biopolymers.

[63]  A Elliott,et al.  Structure of beta-poly-L-alanine: refined atomic co-ordinates for an anti-parallel beta-pleated sheet. , 1967, Journal of molecular biology.

[64]  P. Willett,et al.  Pharmacophoric pattern matching in files of 3d chemical structures: comparison of geometric searching algorithms , 1987 .

[65]  J. Kraut,et al.  Atomic coordinates for subtilisin BPN' (or Novo). , 1971, Biochemical and biophysical research communications.

[66]  G M Edelman,et al.  The covalent and three-dimensional structure of concanavalin A. IV. Atomic coordinates, hydrogen bonding, and quaternary structure. , 1977, The Journal of biological chemistry.

[67]  John Figueras,et al.  Substructure Search by Set Reduction. , 1972 .

[68]  J. Deisenhofer,et al.  Crystallographic refinement of the structure of bovine pancreatic trypsin inhibitor at l.5 Å resolution , 1975 .