Structure-based inference of molecular functions of proteins of unknown function from Berkeley Structural Genomics Center

Advances in sequence genomics have resulted in an accumulation of a huge number of protein sequences derived from genome sequences. However, the functions of a large portion of them cannot be inferred based on the current methods of sequence homology detection to proteins of known functions. Three-dimensional structure can have an important impact in providing inference of molecular function (physical and chemical function) of a protein of unknown function. Structural genomics centers worldwide have been determining many 3-D structures of the proteins of unknown functions, and possible molecular functions of them have been inferred based on their structures. Combined with bioinformatics and enzymatic assay tools, the successful acceleration of the process of protein structure determination through high throughput pipelines enables the rapid functional annotation of a large fraction of hypothetical proteins. We present a brief summary of the process we used at the Berkeley Structural Genomics Center to infer molecular functions of proteins of unknown function.

[1]  Sung-Hou Kim,et al.  Crystal structure of a stress inducible protein from Mycoplasma pneumoniae at 2.85 Å resolution , 2003, Journal of Structural and Functional Genomics.

[2]  Sung-Hou Kim,et al.  Crystal structure of a small heat-shock protein , 1998, Nature.

[3]  S. Kim,et al.  Structure-based assignment of the biochemical function of a hypothetical protein: a test case of structural genomics. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Yunje Cho,et al.  Structure-based identification of a novel NTPase from Methanococcus jannaschii , 1999, Nature Structural Biology.

[5]  Sung-Hou Kim,et al.  Crystal structures of an NAD kinase from Archaeoglobus fulgidus in complex with ATP, NAD, or NADP. , 2005, Journal of molecular biology.

[6]  Nikos Kyrpides,et al.  The Genomes On Line Database (GOLD) v.2: a monitor of genome projects worldwide , 2005, Nucleic Acids Res..

[7]  Sung-Hou Kim,et al.  Crystal Structure of a Nicotinate Phosphoribosyltransferase from Thermoplasma acidophilum* , 2005, Journal of Biological Chemistry.

[8]  Sung-Hou Kim,et al.  A conserved hypothetical protein from Mycoplasma genitalium shows structural homology to nusb proteins , 2003, Proteins.

[9]  Michael Y. Galperin,et al.  The COG database: new developments in phylogenetic classification of proteins from complete genomes , 2001, Nucleic Acids Res..

[10]  Yun Lou,et al.  Crystal structure of YjeQ from Thermotoga maritima contains a circularly permuted GTPase domain. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[11]  John-Marc Chandonia,et al.  Structural proteomics of minimal organisms: Conservation of protein fold usage and evolutionary implications , 2006, BMC Structural Biology.

[12]  Sung-Hou Kim,et al.  Crystal Structure of a PhoU Protein Homologue , 2005, Journal of Biological Chemistry.

[13]  Sung-Hou Kim,et al.  Crystal structure of a phosphatase with a unique substrate binding domain from Thermotoga maritima , 2003, Protein science : a publication of the Protein Society.

[14]  Dong Hae Shin,et al.  Structure-based functional inference in structural genomics , 2004, Journal of Structural and Functional Genomics.

[15]  Sung-Hou Kim,et al.  Structure of the hypothetical protein AQ_1354 from Aquifex aeolicus. , 2003, Acta crystallographica. Section D, Biological crystallography.

[16]  O. White,et al.  Global transposon mutagenesis and a minimal Mycoplasma genome. , 1999, Science.

[17]  Robert D. Finn,et al.  Pfam: clans, web tools and services , 2005, Nucleic Acids Res..

[18]  Chris Sander,et al.  Touring protein fold space with Dali/FSSP , 1998, Nucleic Acids Res..

[19]  Sung-Hou Kim,et al.  Crystal structure of a hypothetical protein, TM841 of Thermotoga maritima, reveals its function as a fatty acid–binding protein , 2003, Proteins.

[20]  Sung-Hou Kim,et al.  Structure-based experimental confirmation of biochemical function to a methyltransferase, MJ0882, from hyperthermophile Methanococcus jannaschii , 2004, Journal of Structural and Functional Genomics.

[21]  Steven E Brenner,et al.  The Impact of Structural Genomics: Expectations and Outcomes , 2005, Science.

[22]  A. Sali,et al.  Protein Structure Prediction and Structural Genomics , 2001, Science.

[23]  Sung-Hou Kim,et al.  Crystal structure of ScpB from Chlorobium tepidum, a protein involved in chromosome partitioning , 2005, Proteins.

[24]  D. Wemmer,et al.  Ybiv from Escherichia coli K12 is a HAD phosphatase , 2005, Proteins.

[25]  S. Brenner,et al.  Implications of structural genomics target selection strategies: Pfam5000, whole genome, and random approaches , 2004, Proteins.

[26]  Rosalind Kim,et al.  Structural and Functional Characterization of a Novel Phosphodiesterase from Methanococcus jannaschii* , 2004, Journal of Biological Chemistry.

[27]  O. Nureki,et al.  Structural basis for sulfur relay to RNA mediated by heterohexameric TusBCD complex. , 2006, Structure.

[28]  Sung-Hou Kim,et al.  Crystal structure of DNA sequence specificity subunit of a type I restriction-modification enzyme and its functional implications. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Sung-Hou Kim,et al.  A global representation of the protein fold space , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Sung-Hou Kim,et al.  Structure of the putative DNA-binding protein SP_1288 from Streptococcus pyogenes. , 2004, Acta crystallographica. Section D, Biological crystallography.

[31]  Steven E Brenner,et al.  Target selection and deselection at the Berkeley Structural Genomics Center , 2005, Proteins.

[32]  H. Yokota,et al.  Structure of the hypothetical Mycoplasma protein MPN555 suggests a chaperone function. , 2005, Acta crystallographica. Section D, Biological crystallography.

[33]  Sung-Hou Kim,et al.  Crystal structure of a heat-inducible transcriptional repressor HrcA from Thermotoga maritima: structural insight into DNA binding and dimerization. , 2005, Journal of molecular biology.