Recognizing Complex, Asymmetric Functional Sites in Protein Structures Using a Bayesian Scoring Function

The increase in known three-dimensional protein structures enables us to build statistical profiles of important functional sites in protein molecules. These profiles can then be used to recognize sites in large-scale automated annotations of new protein structures. We report an improved FEATURE system which recognizes functional sites in protein structures. FEATURE defines multi-level physico-chemical properties and recognizes sites based on the spatial distribution of these properties in the sites' microenvironments. It uses a Bayesian scoring function to compare a query region with the statistical profile built from known examples of sites and control nonsites. We have previously shown that FEATURE can accurately recognize calcium-binding sites and have reported interesting results scanning for calcium-binding sites in the entire Protein Data Bank. Here we report the ability of the improved FEATURE to characterize and recognize geometrically complex and asymmetric sites such as ATP-binding sites and disulfide bond-forming sites. FEATURE does not rely on conserved residues or conserved residue geometry of the sites. We also demonstrate that, in the absence of a statistical profile of the sites, FEATURE can use an artificially constructed profile based on a priori knowledge to recognize the sites in new structures, using redoxin active sites as an example.

[1]  Shmuel Pietrokovski,et al.  New features of the Blocks Database servers , 1999, Nucleic Acids Res..

[2]  J M Thornton,et al.  Three-dimensional structure analysis of PROSITE patterns. , 1999, Journal of molecular biology.

[3]  Douglas L. Brutlag,et al.  Enumerating and Ranking Discrete Motifs , 1997, ISMB.

[4]  D C Richardson,et al.  Kinemages--simple macromolecular graphics for interactive teaching and publication. , 1994, Trends in biochemical sciences.

[5]  Robert D. Finn,et al.  Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins , 1999, Nucleic Acids Res..

[6]  S. Henikoff,et al.  Automated assembly of protein blocks for database searching. , 1991, Nucleic acids research.

[7]  W. W. Wells,et al.  Interactions of platinum complexes with thioltransferase(glutaredoxin), in vitro. , 1991, Biochemical and Biophysical Research Communications - BBRC.

[8]  W. W. Wells,et al.  Identification and characterization of the functional amino acids at the active center of pig liver thioltransferase by site-directed mutagenesis. , 1991, The Journal of biological chemistry.

[9]  M. Nayal,et al.  Predicting Ca(2+)-binding sites in proteins. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[10]  J. Skolnick,et al.  Method for prediction of protein function from sequence using the sequence-to-structure-to-function paradigm with application to glutaredoxins/thioredoxins and T1 ribonucleases. , 1998, Journal of molecular biology.

[11]  D Eisenberg,et al.  Where metal ions bind in proteins. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Michael G. Rossmann,et al.  Chemical and biological evolution of a nucleotide-binding protein , 1974, Nature.

[13]  R. Altman,et al.  Characterizing the microenvironment surrounding protein sites , 1995, Protein science : a publication of the Protein Society.

[14]  S Karlin,et al.  Clusters of charged residues in protein three-dimensional structures. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[15]  S Karlin,et al.  Characterizations of diverse residue clusters in protein three-dimensional structures. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[16]  H. Kubinyi QSAR : Hansch analysis and related approaches , 1993 .

[17]  J. Walker,et al.  Distantly related sequences in the alpha‐ and beta‐subunits of ATP synthase, myosin, kinases and other ATP‐requiring enzymes and a common nucleotide binding fold. , 1982, The EMBO journal.

[18]  M Nayal,et al.  Valence screening of water in protein crystals reveals potential Na+ binding sites. , 1996, Journal of molecular biology.

[19]  W. W. Wells,et al.  Catalytic mechanism of thioltransferase. , 1991, The Journal of biological chemistry.

[20]  R. Stevens,et al.  Global Efforts in Structural Genomics , 2001, Science.

[21]  J Skolnick,et al.  Functional analysis of the Escherichia coli genome using the sequence-to-structure-to-function paradigm: identification of proteins exhibiting the glutaredoxin/thioredoxin disulfide oxidoreductase activity. , 1998, Journal of molecular biology.

[22]  Hugo Kubinyi,et al.  3D QSAR in drug design : theory, methods and applications , 2000 .

[23]  Amos Bairoch,et al.  The PROSITE database, its status in 1997 , 1997, Nucleic Acids Res..

[24]  J. Thornton,et al.  Tess: A geometric hashing algorithm for deriving 3D coordinate templates for searching structural databases. Application to enzyme active sites , 1997, Protein science : a publication of the Protein Society.

[25]  C. Frömmel,et al.  The automatic search for ligand binding sites in proteins of known three-dimensional structure using only geometric criteria. , 1996, Journal of molecular biology.

[26]  Terri K. Attwood,et al.  PRINTS prepares for the new millennium , 1999, Nucleic Acids Res..

[27]  P. R. Sibbald,et al.  The P-loop--a common motif in ATP- and GTP-binding proteins. , 1990, Trends in biochemical sciences.

[28]  D C Richardson,et al.  The kinemage: A tool for scientific communication , 1992, Protein science : a publication of the Protein Society.