Identification of Functional Subclasses in the DJ-1 Superfamily Proteins

Genomics has posed the challenge of determination of protein function from sequence and/or 3-D structure. Functional assignment from sequence relationships can be misleading, and structural similarity does not necessarily imply functional similarity. Proteins in the DJ-1 family, many of which are of unknown function, are examples of proteins with both sequence and fold similarity that span multiple functional classes. THEMATICS (theoretical microscopic titration curves), an electrostatics-based computational approach to functional site prediction, is used to sort proteins in the DJ-1 family into different functional classes. Active site residues are predicted for the eight distinct DJ-1 proteins with available 3-D structures. Placement of the predicted residues onto a structural alignment for six of these proteins reveals three distinct types of active sites. Each type overlaps only partially with the others, with only one residue in common across all six sets of predicted residues. Human DJ-1 and YajL from Escherichia coli have very similar predicted active sites and belong to the same probable functional group. Protease I, a known cysteine protease from Pyrococcus horikoshii, and PfpI/YhbO from E. coli, a hypothetical protein of unknown function, belong to a separate class. THEMATICS predicts a set of residues that is typical of a cysteine protease for Protease I; the prediction for PfpI/YhbO bears some similarity. YDR533Cp from Saccharomyces cerevisiae, of unknown function, and the known chaperone Hsp31 from E. coli constitute a third group with nearly identical predicted active sites. While the first four proteins have predicted active sites at dimer interfaces, YDR533Cp and Hsp31 both have predicted sites contained within each subunit. Although YDR533Cp and Hsp31 form different dimers with different orientations between the subunits, the predicted active sites are superimposable within the monomer structures. Thus, the three predicted functional classes form four different types of quaternary structures. The computational prediction of the functional sites for protein structures of unknown function provides valuable clues for functional classification.

[1]  Abderrahim Malki,et al.  Cloning, expression, and purification of the general stress protein YhbO from Escherichia coli. , 2005, Protein expression and purification.

[2]  M. Ondrechen,et al.  THEMATICS: A simple computational predictor of enzyme function from structure , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Sourav Bandyopadhyay,et al.  Evolutionary and functional relationships within the DJ1 superfamily , 2004, BMC Evolutionary Biology.

[4]  Mark A. Wilson,et al.  The Parkinson's disease protein DJ-1 is neuroprotective due to cysteine-sulfinic acid-driven mitochondrial localization , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Geoffrey J. Barton,et al.  GOtcha: a new method for prediction of protein function assessed by the annotation of seven genomes , 2004, BMC Bioinformatics.

[6]  Gregory A Petsko,et al.  The atomic resolution crystal structure of the YajL (ThiJ) protein from Escherichia coli: a close prokaryotic homologue of the Parkinsonism-associated protein DJ-1. , 2005, Journal of molecular biology.

[7]  Ihsan A. Shehadi,et al.  Future directions in protein function prediction , 2002, Molecular Biology Reports.

[8]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[9]  Hans Lehrach,et al.  GOblet: a platform for Gene Ontology annotation of anonymous sequence data , 2004, Nucleic Acids Res..

[10]  C. Sander,et al.  Dali: a network tool for protein structure comparison. , 1995, Trends in biochemical sciences.

[11]  Ying Wei,et al.  Physicochemical Methods for Prediction of Functional Information for Proteins , 2004 .

[12]  E. Rudiño-Piñera,et al.  Unusual Cys-Tyr covalent bond in a large catalase. , 2004, Journal of molecular biology.

[13]  Ronald J. Williams,et al.  Statistical criteria for the identification of protein active sites using theoretical microscopic titration curves , 2005, Proteins.

[14]  Konstantin Korotkov,et al.  The 1.6-Å crystal structure of the class of chaperones represented by Escherichia coli Hsp31 reveals a putative catalytic triad , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Gregory A Petsko,et al.  The 1.8-A resolution crystal structure of YDR533Cp from Saccharomyces cerevisiae: a member of the DJ-1/ThiJ/PfpI superfamily. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Ying Wei,et al.  Active Site Prediction for Comparative Model Structures with Thematics , 2005, J. Bioinform. Comput. Biol..

[17]  Rolf Apweiler,et al.  InterProScan: protein domains identifier , 2005, Nucleic Acids Res..

[18]  Carl J. Schmidt,et al.  GoFigure: Automated Gene OntologyTM annotation , 2003, Bioinform..

[19]  Jean-Michel Claverie,et al.  Phydbac (phylogenomic display of bacterial genes): an interactive resource for the annotation of bacterial genomes , 2003, Nucleic Acids Res..

[20]  S H Kim,et al.  Crystal structure of an intracellular protease from Pyrococcus horikoshii at 2-A resolution. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Ying Wei,et al.  Prediction of active sites for protein structures from computed chemical properties , 2005, ISMB.

[22]  Adam Godzik,et al.  JAFA: a protein function annotation meta-server , 2006, Nucleic Acids Res..

[23]  M. Ondrechen,et al.  Protein structure to function: insights from computation , 2004, Cellular and Molecular Life Sciences CMLS.

[24]  P. Babbitt,et al.  Superfamily active site templates , 2004, Proteins.