Parameterization and classification of the protein universe via geometric techniques.

We present a scheme for the classification of 3487 non-redundant protein structures into 1207 non-hierarchical clusters by using recurring structural patterns of three to six amino acids as keys of classification. This results in several signature patterns, which seem to decide membership of a protein in a functional category. The patterns provide clues to the key residues involved in functional sites as well as in protein-protein interaction. The discovered patterns include a "glutamate double bridge" of superoxide dismutase, the functional interface of the serine protease and inhibitor, interface of homo/hetero dimers, and functional sites of several enzyme families. We use geometric invariants to decide superimposability of structural patterns. This allows the parameterization of patterns and discovery of recurring patterns via clustering. The geometric invariant-based approach eliminates the computationally explosive step of pair-wise comparison of structures. The results provide a vast resource for the biologists for experimental validation of the proposed functional sites, and for the design of synthetic enzymes, inhibitors and drugs.

[1]  B. Dijkstra,et al.  Three-dimensional structure of Endo-1,4-beta-xylanase I from Aspergillus niger: molecular basis for its low pH optimum. , 1996, Journal of molecular biology.

[2]  D. Hilbert,et al.  Ueber die vollen Invariantensysteme , 1893 .

[3]  J M Thornton,et al.  Derivation of 3D coordinate templates for searching structural databases: Application to ser‐His‐Asp catalytic triads in the serine proteinases and lipases , 1996, Protein science : a publication of the Protein Society.

[4]  Amos Bairoch,et al.  The PROSITE dictionary of sites and patterns in proteins, its current status , 1993, Nucleic Acids Res..

[5]  R J Fletterick,et al.  The crystal structure of cruzain: a therapeutic target for Chagas' disease. , 1995, Journal of molecular biology.

[6]  David M. Blow,et al.  The tortuous story of Asp…His…Ser: Structural analysis of α-chymotrypsin , 1997 .

[7]  H. Weyl The Classical Groups , 1940 .

[8]  S. Suh,et al.  Crystal structures of thermostable xylose isomerases from Thermus caldophilus and Thermus thermophilus: possible structural determinants of thermostability. , 1999, Journal of molecular biology.

[9]  S. B. Needleman,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 1989 .

[10]  S M Swanson,et al.  Structure of recombinant mouse collagenase-3 (MMP-13). , 1999, Journal of molecular biology.

[11]  R. Russell,et al.  Detection of protein three-dimensional side-chain patterns: new examples of convergent evolution. , 1998, Journal of molecular biology.

[12]  S. Zucker,et al.  Localization of Tissue Inhibitor of Matrix Metalloproteinases in Alzheimer's Disease and Normal Brain , 1995, Journal of neuropathology and experimental neurology.

[13]  J F Gibrat,et al.  Surprising similarities in structure comparison. , 1996, Current opinion in structural biology.

[14]  M. Cockett,et al.  The C-terminal domain of 72 kDa gelatinase A is not required for catalysis, but is essential for membrane activation and modulates interactions with tissue inhibitors of metalloproteinases. , 1992, The Biochemical journal.

[15]  Alexey G. Murzin Familiar Strangers , 2019, Twelve Weeks to Change a Life.

[16]  John Kuriyan,et al.  Intramolecular Regulatory Interactions in the Src Family Kinase Hck Probed by Mutagenesis of a Conserved Tryptophan Residue* , 1998, The Journal of Biological Chemistry.

[17]  Chris Sander,et al.  Touring protein fold space with Dali/FSSP , 1998, Nucleic Acids Res..

[18]  B. Atanasov,et al.  Protonation of the beta-lactam nitrogen is the trigger event in the catalytic action of class A beta-lactamases. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[19]  R Huber,et al.  Cytochrome c Nitrite Reductase from Wolinella succinogenes , 2000, The Journal of Biological Chemistry.

[20]  M. Soriano-garcia,et al.  Differences in the intersubunit contacts in triosephosphate isomerase from two closely related pathogenic trypanosomes. , 1998, Journal of molecular biology.

[21]  N. Hakulinen,et al.  The crystal structure of beta-glucosidase from Bacillus circulans sp. alkalophilus: ability to form long polymeric assemblies. , 2000, Journal of structural biology.

[22]  Karen N. Allen,et al.  X-ray crystallographic structures of D-xylose isomerase-substrate complexes position the substrate and provide evidence for metal movement during catalysis. , 1994, Biochemistry.

[23]  J. R. Somoza,et al.  The crystal structure of human cathepsin F and its implications for the development of novel immunomodulators. , 2002, Journal of molecular biology.

[24]  T J Oldfield,et al.  Data mining the protein data bank: Residue interactions , 2002, Proteins.

[25]  H. Birkedal‐Hansen,et al.  Matrix metalloproteinases: a review. , 1993, Critical reviews in oral biology and medicine : an official publication of the American Association of Oral Biologists.

[26]  N. Guex,et al.  SWISS‐MODEL and the Swiss‐Pdb Viewer: An environment for comparative protein modeling , 1997, Electrophoresis.

[27]  G. Siuzdak,et al.  Antiviral agent blocks breathing of the common cold virus. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[28]  M. Yaguchi,et al.  Mutational and crystallographic analyses of the active site residues of the bacillus circulans xylanase , 1994, Protein science : a publication of the Protein Society.

[29]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[30]  James E. Bray,et al.  Assigning genomic sequences to CATH , 2000, Nucleic Acids Res..

[31]  J. Hermoso,et al.  Crystal structure of beta-glucosidase A from Bacillus polymyxa: insights into the catalytic activity in family 1 glycosyl hydrolases. , 1998, Journal of molecular biology.

[32]  K. Okuyama,et al.  Crystal structure of the E166A mutant of extended-spectrum beta-lactamase Toho-1 at 1.8 A resolution. , 1999, Journal of molecular biology.

[33]  S J Ferguson,et al.  Cytochrome cd1 structure: unusual haem environments in a nitrite reductase and analysis of factors contributing to beta-propeller folds. , 1997, Journal of molecular biology.

[34]  Ferdinando Febbraio,et al.  SDS-resistant Active and Thermostable Dimers Are Obtained from the Dissociation of Homotetrameric β-Glycosidase from Hyperthermophilic Sulfolobus solfataricus in SDS , 2002, The Journal of Biological Chemistry.

[35]  N. Verdaguer,et al.  Structure of human rhinovirus serotype 2 (HRV2). , 2000, Journal of molecular biology.

[36]  Karen N. Allen,et al.  Design, synthesis, and characterization of a potent xylose isomerase inhibitor, D-threonohydroxamic acid, and high-resolution X-ray crystallographic structure of the enzyme-inhibitor complex. , 1995, Biochemistry.

[37]  L. M. Saraiva,et al.  Sequencing the gene encoding desulfovibrio desulfuricans ATCC 27774 nine-heme cytochrome c. , 1999, Biochemical and biophysical research communications.

[38]  J M Thornton,et al.  Three-dimensional structure analysis of PROSITE patterns. , 1999, Journal of molecular biology.

[39]  T. Attwood,et al.  PRINTS--a database of protein motif fingerprints. , 1994, Nucleic acids research.

[40]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[41]  O. Lichtarge,et al.  Evolutionary predictions of binding surfaces and interactions. , 2002, Current opinion in structural biology.

[42]  D. Mumford,et al.  Geometric Invariant Theory , 2011 .

[43]  J. W. Whittaker,et al.  A Glutamate Bridge Is Essential for Dimer Stability and Metal Selectivity in Manganese Superoxide Dismutase* , 1998, The Journal of Biological Chemistry.

[44]  Ashish V. Tendulkar,et al.  Functional sites in protein families uncovered via an objective and automated graph theoretic approach. , 2003, Journal of molecular biology.

[45]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.

[46]  S. Al-Karadaghi,et al.  Iron superoxide dismutase from the archaeon Sulfolobus solfataricus: analysis of structure and thermostability. , 1999, Journal of molecular biology.

[47]  R. Huber,et al.  Thermus thermophilus cytochrome-c552: A new highly thermostable cytochrome-c structure obtained by MAD phasing. , 1997, Journal of molecular biology.

[48]  G. S. Bell,et al.  Tiny TIM: a small, tetrameric, hyperthermostable triosephosphate isomerase. , 2001, Journal of molecular biology.

[49]  S. Hubbard,et al.  Crystallographic and Solution Studies of an Activation Loop Mutant of the Insulin Receptor Tyrosine Kinase , 2001, The Journal of Biological Chemistry.

[50]  P. Willett,et al.  A graph-theoretic approach to the identification of three-dimensional patterns of amino acid side-chains in protein structures. , 1994, Journal of molecular biology.