Active Site Prediction for Comparative Model Structures with Thematics

THEMATICS (Theoretical Microscopic Titration Curves) is a simple, reliable computational predictor of the active sites of enzymes from structure. Our method, based on well-established Finite Difference Poisson-Boltzmann techniques, identifies the ionisable residues with anomalous predicted titration behavior. A cluster of two or more such perturbed residues is a very reliable predictor of the active site. The protein does not have to bear any resemblance in sequence or structure to any previously characterized protein, but the method does require the three-dimensional structure. We now present evidence that THEMATICS can also locate the active site in structures built by comparative modeling from similar structures. Results are given for a total of 21 sets of proteins, including 21 templates and 83 comparative model structures. Detailed results are presented for three sets of orthologous proteins (Triosephosphate isomerase, 6-Hydroxymethyl-7,8-dihydropterin pyrophosphokinase, and Aspartate aminotransferase) and for one set of human homologues of Aldose reductase with different functions. THEMATICS correctly locates the active site in the model structures. This suggests that the method can be applicable to a much larger set of proteins for which an experimentally determined structure is unavailable. With a few exceptions, the predicted active sites in the comparative model structures are similar to that of the corresponding template structure.

[1]  Ihsan A. Shehadi,et al.  Future directions in protein function prediction , 2002, Molecular Biology Reports.

[2]  M. Ondrechen,et al.  Protein structure to function: insights from computation , 2004, Cellular and Molecular Life Sciences CMLS.

[3]  Manuel C. Peitsch,et al.  SWISS-MODEL: an automated protein homology-modeling server , 2003, Nucleic Acids Res..

[4]  T. D. Read,et al.  Role of Mobile DNA in the Evolution of Vancomycin-Resistant Enterococcus faecalis , 2003, Science.

[5]  Masahira Hattori,et al.  Genome sequence of Vibrio parahaemolyticus: a pathogenic mechanism distinct from that of V cholerae , 2003, The Lancet.

[6]  Zukang Feng,et al.  The Protein Data Bank and structural genomics , 2003, Nucleic Acids Res..

[7]  O. White,et al.  Complete genome sequence and comparative analysis of the metabolically versatile Pseudomonas putida KT2440. , 2002, Environmental microbiology.

[8]  E. Alexov,et al.  Combining conformational flexibility and continuum electrostatics for calculating pK(a)s in proteins. , 2002, Biophysical journal.

[9]  M. Ondrechen,et al.  THEMATICS: A simple computational predictor of enzyme function from structure , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[10]  I. Miyahara,et al.  Strain is more important than electrostatic interaction in controlling the pKa of the catalytic group in aspartate aminotransferase. , 2001, Biochemistry.

[11]  S. Lory,et al.  Complete genome sequence of Pseudomonas aeruginosa PAO1, an opportunistic pathogen , 2000, Nature.

[12]  S. Salzberg,et al.  DNA sequence of both chromosomes of the cholera pathogen Vibrio cholerae , 2000, Nature.

[13]  S. Salzberg,et al.  Complete genome sequence of Neisseria meningitidis serogroup B strain MC58. , 2000, Science.

[14]  G. Petsko,et al.  The role of residues outside the active site: structural basis for function of C191 mutants of Escherichia coli aspartate aminotransferase. , 2000, Protein engineering.

[15]  R Sánchez,et al.  Comparative protein structure modeling. Introduction and practical examples with modeller. , 2000, Methods in molecular biology.

[16]  A. Sali,et al.  Modeling of loops in protein structures , 2000, Protein science : a publication of the Protein Society.

[17]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[18]  G Klebe,et al.  Improving macromolecular electrostatics calculations. , 1999, Protein engineering.

[19]  J. Champness,et al.  2.0 Å X‐ray structure of the ternary complex of 7,8‐dihydro‐6‐hydroxymethylpterinpyrophosphokinase from Escherichia coli with ATP and a substrate analogue , 1999, FEBS letters.

[20]  F Guarnieri,et al.  A self-consistent, microenvironment modulated screened coulomb potential approximation to calculate pH-dependent electrostatic effects in proteins. , 1999, Biophysical journal.

[21]  Honggao Yan,et al.  Crystal structure of 6-hydroxymethyl-7,8-dihydropterin pyrophosphokinase, a potential target for the development of novel antimicrobial agents. , 1999, Structure.

[22]  J. Briggs,et al.  Calculation of the pKa values for the ligands and side chains of Escherichia coli D-alanine:D-alanine ligase. , 1999, Journal of medicinal chemistry.

[23]  A. Sali 100,000 protein structures for the biologist , 1998, Nature Structural Biology.

[24]  G. Petsko,et al.  Crystal structure of Saccharomyces cerevisiae cytosolic aspartate aminotransferase , 1998, Protein science : a publication of the Protein Society.

[25]  E. Alexov,et al.  Incorporating protein conformational flexibility into the calculation of pH-dependent protein properties. , 1997, Biophysical journal.

[26]  M. Gilson,et al.  Computing ionization states of proteins with a detailed charge model , 1996, J. Comput. Chem..

[27]  M. Gilson,et al.  The determinants of pKas in proteins. , 1996, Biochemistry.

[28]  R. Fleischmann,et al.  The Minimal Gene Complement of Mycoplasma genitalium , 1995, Science.

[29]  L. R. Scott,et al.  Electrostatics and diffusion of molecules in solution: simulations with the University of Houston Brownian dynamics program , 1995 .

[30]  P. Beroza,et al.  Electrostatic calculations of amino acid titration and electron transfer, Q-AQB-->QAQ-B, in the reaction center. , 1995, Biophysical journal.

[31]  A. Karshikoff A simple algorithm for the calculation of multiple site titration curves. , 1995, Protein engineering.

[32]  A Sali,et al.  Modeling mutations and homologous proteins. , 1995, Current opinion in biotechnology.

[33]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[34]  H Hayashi,et al.  X-ray crystallographic study of pyridoxamine 5'-phosphate-type aspartate aminotransferases from Escherichia coli in three forms. , 1994, Journal of biochemistry.

[35]  B. Honig,et al.  Environmental effects on the protonation states of active site residues in bacteriorhodopsin. , 1994, Biophysical journal.

[36]  G. Petsko,et al.  Crystal structure of recombinant chicken triosephosphate isomerase-phosphoglycolohydroxamate complex at 1.8-A resolution. , 1994, Biochemistry.

[37]  G. Petsko,et al.  An anion binding site in human aldose reductase: mechanistic implications for the binding of citrate, cacodylate, and glucose 6-phosphate. , 1994, Biochemistry.

[38]  I. Björkhem,et al.  Cloning and expression of cDNA of human Δ4‐3‐oxosteroid 5β‐reductase and substrate specificity of the expressed enzyme , 1994 .

[39]  I. Björkhem,et al.  Cloning and expression of cDNA of human delta 4-3-oxosteroid 5 beta-reductase and substrate specificity of the expressed enzyme. , 1994, European journal of biochemistry.

[40]  M. New,et al.  Molecular cloning of multiple cDNAs encoding human enzymes structurally related to 3α-hydroxysteroid dehydrogenase , 1993, The Journal of Steroid Biochemistry and Molecular Biology.

[41]  H. Takikawa,et al.  cDNA cloning and expression of the human hepatic bile acid-binding protein. A member of the monomeric reductase gene family. , 1993, The Journal of biological chemistry.

[42]  K. Sharp,et al.  On the calculation of pKas in proteins , 1993, Proteins.

[43]  M. Gilson Multiple‐site titration and molecular modeling: Two rapid methods for computing energies and forces for ionizable groups in proteins , 1993, Proteins.

[44]  D. Bashford,et al.  Electrostatic calculations of the pKa values of ionizable groups in bacteriorhodopsin. , 1992, Journal of molecular biology.

[45]  M. Karplus,et al.  Multiple-site titration curves of proteins: an analysis of exact and approximate methods for their calculation , 1991 .

[46]  J. Knowles,et al.  Neutral imidazole is the electrophile in the reaction catalyzed by triosephosphate isomerase: structural origins and catalytic implications. , 1991, Biochemistry.

[47]  B. Wermuth,et al.  Primary structure of aldehyde reductase from human liver. , 1987, Progress in clinical and biological research.