Confronting the catalytic dark matter encoded by sequenced genomes

Abstract The post-genomic era has provided researchers with a deluge of protein sequences. However, a significant fraction of the proteins encoded by sequenced genomes remains without an identified function. Here, we aim at determining how many enzymes of uncertain or unknown function are still present in the Saccharomyces cerevisiae and human proteomes. Using information available in the Swiss-Prot, BRENDA and KEGG databases in combination with a Hidden Markov Model-based method, we estimate that >600 yeast and 2000 human proteins (>30% of their proteins of unknown function) are enzymes whose precise function(s) remain(s) to be determined. This illustrates the impressive scale of the ‘unknown enzyme problem’. We extensively review classical biochemical as well as more recent systematic experimental and computational approaches that can be used to support enzyme function discovery research. Finally, we discuss the possible roles of the elusive catalysts in light of recent developments in the fields of enzymology and metabolism as well as the significance of the unknown enzyme problem in the context of metabolic modeling, metabolic engineering and rare disease research.

[1]  Suzanne M. Paley,et al.  The MetaCyc database of metabolic pathways and enzymes , 2017, Nucleic Acids Res..

[2]  Jianxin Wu Hidden Markov model , 2018 .

[3]  T. Meitinger,et al.  NAXE Mutations Disrupt the Cellular NAD(P)HX Repair System and Cause a Lethal Neurometabolic Disorder of Early Childhood , 2017 .

[4]  J. Sun,et al.  Nit1 is a metabolite repair enzyme that hydrolyzes deaminated glutathione , 2017, Proceedings of the National Academy of Sciences.

[5]  E. Glaab,et al.  Molecular Identification of d-Ribulokinase in Budding Yeast and Mammals* , 2016, The Journal of Biological Chemistry.

[6]  A. Hanson,et al.  A Guardian Angel Phosphatase for Mainline Carbon Metabolism. , 2016, Trends in biochemical sciences.

[7]  D. Vertommen,et al.  A conserved phosphatase destroys toxic glycolytic side products in mammals and yeast. , 2016, Nature Chemical Biology.

[8]  Thomas D. Niehaus,et al.  Systematic identification and analysis of frequent gene fusion events in metabolic pathways , 2016, BMC Genomics.

[9]  C. Schofield,et al.  Arginine demethylation is catalysed by a subset of JmjC histone lysine demethylases , 2016, Nature Communications.

[10]  Oliver Fiehn,et al.  'Nothing of chemistry disappears in biology': the Top 30 damage-prone endogenous metabolites. , 2016, Biochemical Society transactions.

[11]  Craig D. Kaplan,et al.  The mechanism of RNA 5′ capping with NAD+, NADH, and desphospho-CoA , 2016, Nature.

[12]  B. Cravatt,et al.  A calcium-dependent acyltransferase that produces N-acyl phosphatidylethanolamines , 2016, Nature chemical biology.

[13]  D. Vertommen,et al.  ISPD produces CDP-ribitol used by FKTN and FKRP to transfer ribitol phosphate onto α-dystroglycan , 2016, Nature Communications.

[14]  Z. Deng,et al.  Characterization of a C3 Deoxygenation Pathway Reveals a Key Branch Point in Aminoglycoside Biosynthesis. , 2016, Journal of the American Chemical Society.

[15]  Murat Yücel,et al.  The Neurobiology of Cannabis Use Disorders: A Call for Evidence , 2016, Frontiers in Behavioral Neuroscience.

[16]  O. Fiehn,et al.  Metabolite Damage and Metabolite Damage Control in Plants. , 2016, Annual review of plant biology.

[17]  Katharina Höfer,et al.  Cap-like structures in bacterial RNA and epitranscriptomic modification. , 2016, Current opinion in microbiology.

[18]  Dan S. Tawfik,et al.  Editorial overview: Biocatalysis and Biotransformation: Esoteric, Niche Enzymology. , 2016, Current opinion in chemical biology.

[19]  K. Nishikura,et al.  A-to-I editing of coding and non-coding RNAs by ADARs , 2015, Nature Reviews Molecular Cell Biology.

[20]  Jeremy J. Jay,et al.  Cross-Species Integrative Functional Genomics in GeneWeaver Reveals a Role for Pafah1b1 in Altered Response to Alcohol , 2016, Front. Behav. Neurosci..

[21]  Paul P Jung,et al.  Saccharomyces cerevisiae Forms d-2-Hydroxyglutarate and Couples Its Degradation to d-Lactate Formation via a Cytosolic Transhydrogenase*♦ , 2016, The Journal of Biological Chemistry.

[22]  V. de Crécy-Lagard,et al.  Bacterial and plant HAD enzymes catalyse a missing phosphatase step in thiamin diphosphate biosynthesis. , 2016, The Biochemical journal.

[23]  C. Webber,et al.  Systematic Phenomics Analysis Deconvolutes Genes Mutated in Intellectual Disability into Biologically Coherent Modules. , 2016, American journal of human genetics.

[24]  Robert D. Finn,et al.  The Pfam protein families database: towards a more sustainable future , 2015, Nucleic Acids Res..

[25]  Olivier Martin,et al.  MetaNetX/MNXref – reconciliation of metabolites and biochemical reactions to bring together genome-scale metabolic networks , 2015, Nucleic Acids Res..

[26]  Minoru Kanehisa,et al.  KEGG as a reference resource for gene and protein annotation , 2015, Nucleic Acids Res..

[27]  Jonathan H. Young,et al.  Efforts to make and apply humanized yeast , 2015, Briefings in functional genomics.

[28]  T. Brummelkamp,et al.  Human ISPD Is a Cytidyltransferase Required for Dystroglycan O-Mannosylation. , 2015, Chemistry & biology.

[29]  M. Keller,et al.  The Impact of Non-Enzymatic Reactions and Enzyme Promiscuity on Cellular Metabolism during (Oxidative) Stress Conditions , 2015, Biomolecules.

[30]  Thomas D. Niehaus,et al.  Proteins of Unknown Biochemical Function: A Persistent Problem and a Roadmap to Help Overcome It1 , 2015, Plant Physiology.

[31]  Jarmo Niemi,et al.  Divergent evolution of an atypical S-adenosyl-l-methionine–dependent monooxygenase involved in anthracycline biosynthesis , 2015, Proceedings of the National Academy of Sciences.

[32]  Adriana Espinosa-Cantú,et al.  Gene duplication and the evolution of moonlighting proteins , 2015, Front. Genet..

[33]  A. Fernie,et al.  Genetic Determinants of the Network of Primary Metabolism and Their Relationships to Plant Performance in a Maize Recombinant Inbred Line Population[OPEN] , 2015, Plant Cell.

[34]  Dan S. Tawfik,et al.  Identification of the algal dimethyl sulfide–releasing enzyme: A missing link in the marine sulfur cycle , 2015, Science.

[35]  Austin G. Meyer,et al.  Systematic humanization of yeast genes reveals conserved functions and genetic modularity , 2015, Science.

[36]  Robert D. Finn,et al.  HMMER web server: 2015 update , 2015, Nucleic Acids Res..

[37]  R. Mullen,et al.  Evidence that glutamine transaminase and omega-amidase potentially act in tandem to close the methionine salvage cycle in bacteria and plants. , 2015, Phytochemistry.

[38]  S. Almo,et al.  Computational-guided discovery and characterization of a sesquiterpene synthase from Streptomyces clavuligerus , 2015, Proceedings of the National Academy of Sciences.

[39]  Ute Roessner,et al.  Detection of QTL for metabolic and agronomic traits in wheat with adjustments for variation at genetic loci that affect plant phenology. , 2015, Plant science : an international journal of experimental plant biology.

[40]  Hana Cahová,et al.  NAD captureSeq indicates NAD as a bacterial cap for a subset of regulatory RNAs , 2014, Nature.

[41]  Rafael Brüschweiler,et al.  Metabolomics beyond spectroscopic databases: a combined MS/NMR strategy for the rapid identification of new metabolites in complex mixtures. , 2015, Analytical chemistry.

[42]  M. Veiga-da-Cunha,et al.  Enzyme complexity in intermediary metabolism , 2015, Journal of Inherited Metabolic Disease.

[43]  María Martín,et al.  UniProt: A hub for protein information , 2015 .

[44]  Kengo Kinoshita,et al.  COXPRESdb in 2015: coexpression database for animal species by DNA-microarray and RNAseq-based expression data with multiple quality assessment systems , 2014, Nucleic Acids Res..

[45]  Antje Chang,et al.  BRENDA in 2015: exciting developments in its 25th year of existence , 2014, Nucleic Acids Res..

[46]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..

[47]  The Uniprot Consortium,et al.  UniProt: a hub for protein information , 2014, Nucleic Acids Res..

[48]  Nikos Kyrpides,et al.  The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification , 2014, Nucleic Acids Res..

[49]  Daisuke Kihara,et al.  Genome-scale identification and characterization of moonlighting proteins , 2014, Biology Direct.

[50]  S. Copley,et al.  An evolutionary perspective on protein moonlighting. , 2014, Biochemical Society transactions.

[51]  Patricia C. Babbitt,et al.  New Insights about Enzyme Evolution from Large Scale Studies of Sequence and Structure Relationships* , 2014, The Journal of Biological Chemistry.

[52]  Matthew P. Jacobson,et al.  Leveraging structure for enzyme function prediction: methods, opportunities, and challenges. , 2014, Trends in biochemical sciences.

[53]  Chunaram Choudhary,et al.  The growing landscape of lysine acetylation links metabolism and cell signalling , 2014, Nature Reviews Molecular Cell Biology.

[54]  Pascal Bouvry,et al.  Management of an academic HPC cluster: The UL experience , 2014, 2014 International Conference on High Performance Computing & Simulation (HPCS).

[55]  Torsten Seemann,et al.  Prokka: rapid prokaryotic genome annotation , 2014, Bioinform..

[56]  H. Overkleeft,et al.  Current developments in activity-based protein profiling. , 2014, Bioconjugate chemistry.

[57]  Rick L. Stevens,et al.  High-throughput comparison, functional annotation, and metabolic modeling of plant genomes using the PlantSEED resource , 2014, Proceedings of the National Academy of Sciences.

[58]  B. Cravatt,et al.  Enzyme inhibitor discovery by activity-based protein profiling. , 2014, Annual review of biochemistry.

[59]  C. Médigue,et al.  Profiling the orphan enzymes , 2014, Biology Direct.

[60]  D. Vertommen,et al.  Metabolite Proofreading in Carnosine and Homocarnosine Synthesis , 2014, The Journal of Biological Chemistry.

[61]  G. Prosser,et al.  Metabolomic strategies for the identification of new enzyme functions and metabolic pathways , 2014, EMBO reports.

[62]  Bing Ren,et al.  Lysine 2-hydroxyisobutyrylation is a widely distributed active histone mark. , 2014, Nature chemical biology.

[63]  Vamsi K Mootha,et al.  CLYBL is a polymorphic human enzyme with malate synthase and β-methylmalate synthase activity. , 2014, Human molecular genetics.

[64]  Angela Gallo,et al.  The RNA editing enzymes ADARs: mechanism of action and human disease , 2014, Cell and Tissue Research.

[65]  Michael Weiss,et al.  Sulphoglycolysis in Escherichia coli K-12 closes a gap in the biogeochemical sulphur cycle , 2014, Nature.

[66]  L. Kruglyak,et al.  Genetic Basis of Metabolome Variation in Yeast , 2014, PLoS genetics.

[67]  J. Gerlt,et al.  Prediction and Biochemical Demonstration of a Catabolic Pathway for the Osmoprotectant Proline Betaine , 2014, mBio.

[68]  V. Gladyshev The free radical theory of aging is dead. Long live the damage theory! , 2014, Antioxidants & redox signaling.

[69]  Ziad M. Eletr,et al.  Regulation of proteolysis by human deubiquitinating enzymes. , 2014, Biochimica et biophysica acta.

[70]  N. Chow,et al.  Purification and Characterization of a Hemocyanin (Hemo1) with Potential Lignin-Modification Activities from the Wood-Feeding Termite, Coptotermes formosanus Shiraki , 2014, Applied Biochemistry and Biotechnology.

[71]  B. Cravatt,et al.  Application of activity-based protein profiling to study enzyme function in adipocytes. , 2014, Methods in enzymology.

[72]  Peter Uetz,et al.  Protein Domains of Unknown Function Are Essential in Bacteria , 2013, mBio.

[73]  W. Earnshaw Deducing Protein Function by Forensic Integrative Cell Biology , 2013, PLoS biology.

[74]  S. Henry,et al.  Revising the Representation of Fatty Acid, Glycerolipid, and Glycerophospholipid Metabolism in the Consensus Model of Yeast Metabolism. , 2013, Industrial biotechnology.

[75]  Nathan D. Price,et al.  Version 6 of the consensus yeast metabolic network refines biochemical coverage and improves model performance , 2013, Database J. Biol. Databases Curation.

[76]  Ronan M. T. Fleming,et al.  A community-driven global reconstruction of human metabolism , 2013, Nature Biotechnology.

[77]  R. Balling,et al.  Immune-responsive gene 1 protein links metabolism to immunity by catalyzing itaconic acid production , 2013, Proceedings of the National Academy of Sciences.

[78]  W. Kaelin,et al.  What a difference a hydroxyl makes: mutant IDH, (R)-2-hydroxyglutarate, and cancer. , 2013, Genes & development.

[79]  Nicola Zamboni,et al.  The integrated response of primary metabolites to gene deletions and the environment. , 2013, Molecular bioSystems.

[80]  Carole L. Linster,et al.  Metabolite damage and its repair or pre-emption. , 2013, Nature chemical biology.

[81]  Andrej Sali,et al.  Assignment of pterin deaminase activity to an enzyme of unknown function guided by homology modeling and docking. , 2013, Journal of the American Chemical Society.

[82]  Ines Thiele,et al.  Inferring the metabolism of human orphan metabolites from their metabolic network context affirms human gluconokinase activity. , 2013, The Biochemical journal.

[83]  E. Werner,et al.  Orphan enzymes in ether lipid metabolism , 2013, Biochimie.

[84]  E. Schaftingen,et al.  Metabolite proofreading, a neglected aspect of intermediary metabolism , 2013, Journal of Inherited Metabolic Disease.

[85]  V. de Crécy-Lagard,et al.  Biosynthesis and function of posttranscriptional modifications of transfer RNAs. , 2012, Annual review of genetics.

[86]  Christian Brion,et al.  QTL mapping of the production of wine aroma compounds by yeast , 2012, BMC Genomics.

[87]  Wladek Minor,et al.  Identification of unknown protein function using metabolite cocktail screening. , 2012, Structure.

[88]  S. Copley Moonlighting is mainstream: Paradigm adjustment required , 2012, BioEssays : news and reviews in molecular, cellular and developmental biology.

[89]  Andrew D Hanson,et al.  Frontiers in metabolic reconstruction and modeling of plant genomes. , 2012, Journal of experimental botany.

[90]  Paul Pavlidis,et al.  “Guilt by Association” Is the Exception Rather Than the Rule in Gene Networks , 2012, PLoS Comput. Biol..

[91]  Gos Micklem,et al.  YeastMine—an integrated data warehouse for Saccharomyces cerevisiae data as a multipurpose tool-kit , 2012, Database J. Biol. Databases Curation.

[92]  Insuk Lee,et al.  Metabolomics as a Hypothesis-Generating Functional Genomics Tool for the Annotation of Arabidopsis thaliana Genes of “Unknown Function” , 2012, Front. Plant Sci..

[93]  M. Veiga-da-Cunha,et al.  Molecular Identification of Hydroxylysine Kinase and of Ammoniophospholyases Acting on 5-Phosphohydroxy-l-lysine and Phosphoethanolamine* , 2012, The Journal of Biological Chemistry.

[94]  Amos Bairoch,et al.  neXtProt: a knowledge platform for human proteins , 2011, Nucleic Acids Res..

[95]  Costas D. Maranas,et al.  MetRxn: a knowledgebase of metabolites and reactions spanning metabolic models and databases , 2012, BMC Bioinformatics.

[96]  Heidi J. Imker,et al.  The Enzyme Function Initiative. , 2011, Biochemistry.

[97]  D. Vertommen,et al.  Ethylmalonyl-CoA Decarboxylase, a New Enzyme Involved in Metabolite Proofreading* , 2011, The Journal of Biological Chemistry.

[98]  D. Vertommen,et al.  Extremely Conserved ATP- or ADP-dependent Enzymatic System for Nicotinamide Nucleotide Repair* , 2011, The Journal of Biological Chemistry.

[99]  Bernhard O. Palsson,et al.  The human metabolic reconstruction Recon 1 directs hypotheses of novel human metabolic functions , 2011, BMC Systems Biology.

[100]  D. Vertommen,et al.  Molecular Identification of β-Citrylglutamate Hydrolase as Glutamate Carboxypeptidase 3* , 2011, The Journal of Biological Chemistry.

[101]  O. Ebenhöh,et al.  Systems approaches to modelling pathways and networks. , 2011, Briefings in functional genomics.

[102]  P. Alzari,et al.  Functional plasticity and allosteric regulation of α-ketoglutarate decarboxylase in central mycobacterial metabolism. , 2011, Chemistry & biology.

[103]  Dietmar Schomburg,et al.  BKM-react, an integrated biochemical reaction database , 2011, BMC Biochemistry.

[104]  G. Siuzdak,et al.  Metabolomics annotates ABHD3 as a physiologic regulator of medium-chain phospholipids , 2011, Nature chemical biology.

[105]  Andreas Wilke,et al.  Synergistic use of plant-prokaryote comparative genomics for functional annotations , 2011, BMC Genomics.

[106]  A. Caudy,et al.  Riboneogenesis in Yeast , 2011, Cell.

[107]  G. Marcucci,et al.  Purification, Identification, and Cloning of Lysoplasmalogenase, the Enzyme That Catalyzes Hydrolysis of the Vinyl Ether Bond of Lysoplasmalogen , 2011, The Journal of Biological Chemistry.

[108]  Carole L. Linster,et al.  A Novel GDP-d-glucose Phosphorylase Involved in Quality Control of the Nucleoside Diphosphate Sugar Pool in Caenorhabditis elegans and Mammals* , 2011, The Journal of Biological Chemistry.

[109]  V. de Crécy-Lagard,et al.  Mining high-throughput experimental data to link gene and function. , 2011, Trends in biotechnology.

[110]  Jef Rozenski,et al.  The RNA modification database, RNAMDB: 2011 update , 2010, Nucleic Acids Res..

[111]  Edward W. Tate,et al.  Activity-based probes: discovering new biology and new drug targets. , 2011, Chemical Society reviews.

[112]  Dan S. Tawfik Messy biology and the origins of evolutionary innovations. , 2010, Nature chemical biology.

[113]  A. Hopper,et al.  tRNA biology charges to the front. , 2010, Genes & development.

[114]  F. Opperdoes,et al.  Molecular Identification of N-Acetylaspartylglutamate Synthase and β-Citrylglutamate Synthase* , 2010, The Journal of Biological Chemistry.

[115]  G. Golderer,et al.  Identification of the gene encoding alkylglycerol monooxygenase defines a third class of tetrahydrobiopterin-dependent enzymes , 2010, Proceedings of the National Academy of Sciences.

[116]  Dan S. Tawfik,et al.  Enzyme promiscuity: a mechanistic and evolutionary perspective. , 2010, Annual review of biochemistry.

[117]  C. Lima,et al.  Activity-based metabolomic profiling of enzymatic function: identification of Rv1248c as a mycobacterial 2-hydroxy-3-oxoadipate synthase. , 2010, Chemistry & biology.

[118]  Kriston L. McGary,et al.  Systematic discovery of nonobvious human disease models through orthologous phenotypes , 2010, Proceedings of the National Academy of Sciences.

[119]  D. Vertommen,et al.  Molecular Identification of Carnosine Synthase as ATP-grasp Domain-containing Protein 1 (ATPGD1)* , 2010, The Journal of Biological Chemistry.

[120]  R. Tenhaken,et al.  Cloning of Glucuronokinase from Arabidopsis thaliana, the Last Missing Enzyme of the myo-Inositol Oxygenase Pathway to Nucleotide Sugars , 2009, The Journal of Biological Chemistry.

[121]  Patricia C. Babbitt,et al.  Annotation Error in Public Databases: Misannotation of Molecular Function in Enzyme Superfamilies , 2009, PLoS Comput. Biol..

[122]  P. May,et al.  An integrative approach towards completing genome-scale metabolic networks. , 2009, Molecular bioSystems.

[123]  M. Veiga-da-Cunha,et al.  Molecular identification of omega-amidase, the enzyme that is functionally coupled with glutamine transaminases, as the putative tumor suppressor Nit2. , 2009, Biochimie.

[124]  M. Tomita,et al.  Metabolite Profiling Reveals YihU as a Novel Hydroxybutyrate Dehydrogenase for Alternative Succinic Semialdehyde Metabolism in Escherichia coli* , 2009, The Journal of Biological Chemistry.

[125]  V. de Crécy-Lagard,et al.  'Unknown' proteins and 'orphan' enzymes: the missing half of the engineering parts list--and how to find it. , 2009, The Biochemical journal.

[126]  P. Courtoy,et al.  Molecular identification of aspartate N-acetyltransferase and its mutation in hypoacetylaspartia. , 2009, The Biochemical journal.

[127]  E. Schaftingen,et al.  l-2-Hydroxyglutaric aciduria, a disorder of metabolite repair , 2009, Journal of Inherited Metabolic Disease.

[128]  Carole L. Linster,et al.  L-Ascorbate biosynthesis in higher plants: the role of VTC2. , 2008, Trends in plant science.

[129]  B. Cravatt,et al.  Activity-based protein profiling: from enzyme chemistry to proteomic chemistry. , 2008, Annual review of biochemistry.

[130]  C. Brenner,et al.  Arabidopsis VTC2 Encodes a GDP-l-Galactose Phosphorylase, the Last Unknown Enzyme in the Smirnoff-Wheeler Pathway to Ascorbic Acid in Plants*♦ , 2007, Journal of Biological Chemistry.

[131]  T. Hughes,et al.  Why Are There Still Over 1000 Uncharacterized Yeast Genes? , 2007, Genetics.

[132]  Pei Yee Ho,et al.  Multiple High-Throughput Analyses Monitor the Response of E. coli to Perturbations , 2007, Science.

[133]  Takao Shimizu,et al.  A Single Enzyme Catalyzes Both Platelet-activating Factor Production and Membrane Biogenesis of Inflammatory Cells , 2007, Journal of Biological Chemistry.

[134]  Vinay Satish Kumar,et al.  Optimization based automated curation of metabolic reconstructions , 2007, BMC Bioinformatics.

[135]  B. Palsson,et al.  Systems approach to refining genome annotation , 2006, Proceedings of the National Academy of Sciences.

[136]  Giuseppe Manco,et al.  The latent promiscuity of newly identified microbial lactonases is linked to a recently diverged phosphotriesterase. , 2006, Biochemistry.

[137]  H. Mori,et al.  Metabolomics approach for enzyme discovery. , 2006, Journal of proteome research.

[138]  Jingyuan Fu,et al.  The genetics of plant metabolism , 2006, Nature Genetics.

[139]  Michael Y. Galperin,et al.  House cleaning, a part of good housekeeping , 2006, Molecular microbiology.

[140]  Sylvie Garneau-Tsodikova,et al.  Protein posttranslational modifications: the chemistry of proteome diversifications. , 2005, Angewandte Chemie.

[141]  Benjamin F. Cravatt,et al.  Assignment of protein function in the postgenomic era , 2005 .

[142]  Makoto Suematsu,et al.  Variant tricarboxylic acid cycle in Mycobacterium tuberculosis: identification of alpha-ketoglutarate decarboxylase. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[143]  P. Karp,et al.  Genome annotation errors in pathway databases due to semantic ambiguity in partial EC numbers , 2005, Nucleic acids research.

[144]  Johannes Söding,et al.  Protein homology detection by HMM?CHMM comparison , 2005, Bioinform..

[145]  P. Dobson,et al.  Predicting enzyme class from protein structure without alignments. , 2005, Journal of molecular biology.

[146]  Mark Johnston,et al.  The promise of functional genomics: completing the encyclopedia of a cell. , 2004, Current opinion in microbiology.

[147]  J. Flintham,et al.  Genetic Control of Storage Oil Synthesis in Seeds of Arabidopsis1 , 2004, Plant Physiology.

[148]  Sean R Eddy,et al.  What is a hidden Markov model? , 2004, Nature Biotechnology.

[149]  Y.Z. Chen,et al.  Enzyme family classification by support vector machines , 2004, Proteins.

[150]  J. Skolnick,et al.  EFICAz: a comprehensive approach for accurate genome-scale enzyme function inference. , 2004, Nucleic acids research.

[151]  Michael Y. Galperin,et al.  'Conserved hypothetical' proteins: prioritization of targets for experimental study. , 2004, Nucleic acids research.

[152]  R. Overbeek,et al.  Missing genes in metabolic pathways: a comparative genomics approach. , 2003, Current opinion in chemical biology.

[153]  C. A. Andersen,et al.  Prediction of human protein function from post-translational modifications and localization features. , 2002, Journal of molecular biology.

[154]  Anton J. Enright,et al.  Functional associations of proteins in entire genomes by means of exhaustive detection of gene fusions , 2001, Genome Biology.

[155]  Lawrence P. Wackett,et al.  Melamine Deaminase and Atrazine Chlorohydrolase: 98 Percent Identical but Functionally Different , 2001, Journal of bacteriology.

[156]  D. Kell,et al.  A functional genomics strategy that uses metabolome data to reveal the phenotype of silent mutations , 2001, Nature Biotechnology.

[157]  L. Aravind Guilt by association: contextual information in genome analysis. , 2000, Genome research.

[158]  R. Overbeek,et al.  The use of gene clusters to infer functional coupling. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[159]  James I. Garrels,et al.  The Yeast Proteome Database (YPD): a model for the organization and presentation of genome-wide functional data , 1999, Nucleic Acids Res..

[160]  N. Smirnoff,et al.  The biosynthetic pathway of vitamin C in higher plants , 1998, Nature.

[161]  B. R. Wiseman,et al.  Quantitative trait loci and metabolic pathways. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[162]  R. D'ari,et al.  Underground metabolism. , 1998, BioEssays : news and reviews in molecular, cellular and developmental biology.

[163]  G.E. Moore,et al.  Cramming More Components Onto Integrated Circuits , 1998, Proceedings of the IEEE.

[164]  Gapped BLAST and PSI-BLAST: A new , 1997 .

[165]  Golubev Ag The other side of metabolism , 1996 .

[166]  A. Golubev [The other side of metabolism]. , 1996, Biokhimiia.

[167]  A. Dilella,et al.  Molecular cloning and characterization of the major endothelin receptor subtype in porcine cerebellum. , 1992, Molecular pharmacology.

[168]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[169]  Gordon E. Moore,et al.  Progress in digital integrated electronics , 1975 .