A method for increasing expressivity of Gene Ontology annotations using a compositional approach

BackgroundThe Gene Ontology project integrates data about the function of gene products across a diverse range of organisms, allowing the transfer of knowledge from model organisms to humans, and enabling computational analyses for interpretation of high-throughput experimental and clinical data. The core data structure is the annotation, an association between a gene product and a term from one of the three ontologies comprising the GO. Historically, it has not been possible to provide additional information about the context of a GO term, such as the target gene or the location of a molecular function. This has limited the specificity of knowledge that can be expressed by GO annotations.ResultsThe GO Consortium has introduced annotation extensions that enable manually curated GO annotations to capture additional contextual details. Extensions represent effector–target relationships such as localization dependencies, substrates of protein modifiers and regulation targets of signaling pathways and transcription factors as well as spatial and temporal aspects of processes such as cell or tissue type or developmental stage. We describe the content and structure of annotation extensions, provide examples, and summarize the current usage of annotation extensions.ConclusionsThe additional contextual information captured by annotation extensions improves the utility of functional annotation by representing dependencies between annotations to terms in the different ontologies of GO, external ontologies, or an organism’s gene products. These enhanced annotations can also support sophisticated queries and reasoning, and will provide curated, directional links between many gene products to support pathway and network reconstruction.

[1]  Lincoln Stein,et al.  The Plant Ontology Database: a community resource for plant structure and developmental stages controlled vocabulary and annotations , 2008, Nucleic Acids Res..

[2]  Eduard H. Hovy,et al.  Annotation , 1935, Glasgow Medical Journal.

[3]  Christoph Steinbeck,et al.  The ChEBI reference database and ontology for biologically relevant chemistry: enhancements for 2013 , 2012, Nucleic Acids Res..

[4]  María Martín,et al.  Activities at the Universal Protein Resource (UniProt) , 2013, Nucleic Acids Res..

[5]  Rachael P. Huntley,et al.  The UniProt-GO Annotation database in 2011 , 2011, Nucleic Acids Res..

[6]  Rachael P. Huntley,et al.  The GOA database in 2009—an integrated Gene Ontology Annotation resource , 2008, Nucleic Acids Res..

[7]  M. Kapiloff,et al.  mAKAP and the ryanodine receptor are part of a multi-component signaling complex on the cardiomyocyte nuclear envelope. , 2001, Journal of cell science.

[8]  Carl J. Schmidt,et al.  Developing a biocuration workflow for AgBase, a non-model organism database , 2012, Database J. Biol. Databases Curation.

[9]  Jürg Bähler,et al.  PomBase: a comprehensive online resource for fission yeast , 2011, Nucleic Acids Res..

[10]  Sebastian Bauer,et al.  Continuous Integration of Open Biological Ontology Libraries , 2012 .

[11]  Alexander D. Diehl,et al.  Logical Development of the Cell Ontology , 2011, BMC Bioinformatics.

[12]  Christoph Steinbeck,et al.  Dovetailing biology and chemistry: integrating the Gene Ontology with the ChEBI chemical ontology , 2013, BMC Genomics.

[13]  S. Lewis,et al.  Uberon, an integrative multi-species anatomy ontology , 2012, Genome Biology.

[14]  F. Z. Watts,et al.  Nep1, a Schizosaccharomyces pombe deneddylating enzyme. , 2005, The Biochemical journal.

[15]  Sean Bechhofer,et al.  OWL: Web Ontology Language , 2009, Encyclopedia of Database Systems.

[16]  Tanya Z. Berardini,et al.  Cross-product extensions of the Gene Ontology , 2009, J. Biomed. Informatics.

[17]  Tanya Z. Berardini,et al.  Building an efficient curation workflow for the Arabidopsis literature corpus , 2012, Database J. Biol. Databases Curation.

[18]  Atul J. Butte,et al.  Ten Years of Pathway Analysis: Current Approaches and Outstanding Challenges , 2012, PLoS Comput. Biol..

[19]  Ralf Morgenstern,et al.  Multiple roles of microsomal glutathione transferase 1 in cellular protection: a mechanistic study. , 2010, Free radical biology & medicine.

[20]  Kimberly Van Auken,et al.  WormBase 2012: more genomes, more data, new website , 2011, Nucleic Acids Res..

[21]  Judith A. Blake,et al.  Manual Gene Ontology annotation workflow at the Mouse Genome Informatics Database , 2012, Database J. Biol. Databases Curation.

[22]  Kimberly Van Auken,et al.  A guide to best practices for Gene Ontology (GO) manual annotation , 2013, Database J. Biol. Databases Curation.

[23]  BMC Bioinformatics , 2005 .

[24]  Gautier Koscielny,et al.  Ensembl Genomes: Extending Ensembl across the taxonomic space , 2009, Nucleic Acids Res..