The COMBREX Project: Design, Methodology, and Initial Results

Experimental data exists for only a vanishingly small fraction of sequenced microbial genes. This community page discusses the progress made by the COMBREX project to address this important issue using both computational and experimental resources.

[1]  L. Columbus,et al.  A broad specificity nucleoside kinase from Thermoplasma acidophilum , 2013, Proteins.

[2]  N. Grishin,et al.  Tagaturonate-fructuronate epimerase UxaE, a novel enzyme in the hexuronate catabolic network in Thermotoga maritima. , 2012, Environmental microbiology.

[3]  D. Söll,et al.  Selenomodification of tRNA in archaea requires a bipartite rhodanese enzyme , 2012, FEBS letters.

[4]  Ian K. Blaby,et al.  The archaeal COG1901/DUF358 SPOUT-methyltransferase members, together with pseudouridine synthase Pus10, catalyze the formation of 1-methylpseudouridine at position 54 of tRNA. , 2012, RNA.

[5]  V. de Crécy-Lagard,et al.  Diversity of archaeosine synthesis in crenarchaeota. , 2012, ACS chemical biology.

[6]  Richard J. Roberts,et al.  Characterization of DNA methyltransferase specificities using single-molecule, real-time DNA sequencing , 2011, Nucleic acids research.

[7]  Ramana Madupu,et al.  CharProtDB: a database of experimentally characterized protein annotations , 2011, Nucleic Acids Res..

[8]  I-Min A. Chen,et al.  The Genomes OnLine Database (GOLD) v.4: status of genomic and metagenomic projects and their associated metadata , 2011, Nucleic Acids Res..

[9]  Tatiana A. Tatusova,et al.  NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy , 2011, Nucleic Acids Res..

[10]  R. Morgan,et al.  Characterization of Type II and III Restriction-Modification Systems from Bacillus cereus Strains ATCC 10987 and ATCC 14579 , 2011, Journal of bacteriology.

[11]  Richard J. Roberts,et al.  COMBREX: a project to accelerate the functional annotation of prokaryotic genomes , 2010, Nucleic Acids Res..

[12]  Claire O'Donovan,et al.  A guide to UniProt for protein scientists. , 2011, Methods in molecular biology.

[13]  I-Min A. Chen,et al.  The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadata , 2007, Nucleic Acids Res..

[14]  Erin Beck,et al.  The comprehensive microbial resource , 2000, Nucleic Acids Res..

[15]  Patricia C. Babbitt,et al.  Annotation Error in Public Databases: Misannotation of Molecular Function in Enzyme Superfamilies , 2009, PLoS Comput. Biol..

[16]  Pascal Lapierre,et al.  Estimating the size of the bacterial pan-genome. , 2009, Trends in genetics : TIG.

[17]  Tatiana A. Tatusova,et al.  The National Center for Biotechnology Information's Protein Clusters Database , 2008, Nucleic Acids Res..

[18]  I-Min A. Chen,et al.  The integrated microbial genomes (IMG) system in 2007: data content and analysis tool extensions , 2007, Nucleic Acids Res..

[19]  Simon Kasif,et al.  The art of gene function prediction , 2006, Nature Biotechnology.

[20]  Naryttza N. Diaz,et al.  The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes , 2005, Nucleic acids research.

[21]  S. Kasif,et al.  Whole-genome annotation by using evidence integration in functional-linkage networks. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Richard J Roberts,et al.  Identifying Protein Function—A Call for Community Action , 2004, PLoS biology.

[23]  Christopher H. Bryant,et al.  Functional genomic hypothesis generation and experimentation by a robot scientist , 2004, Nature.

[24]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[25]  Michael Y. Galperin,et al.  'Conserved hypothetical' proteins: prioritization of targets for experimental study. , 2004, Nucleic acids research.

[26]  Stanley Letovsky,et al.  Predicting protein function from protein/protein interaction data: a probabilistic approach , 2003, ISMB.

[27]  Daniel Fischer,et al.  Twenty thousand ORFan microbial protein families for the biologist? , 2003, Structure.

[28]  David S. Eisenberg,et al.  Finding families for genomic ORFans , 1999, Bioinform..

[29]  S. Brenner Errors in genome annotation. , 1999, Trends in genetics : TIG.

[30]  P D Karp,et al.  What we do not know about sequence analysis and sequence databases. , 1998, Bioinformatics.

[31]  Dana Angluin,et al.  Queries and concept learning , 1988, Machine Learning.