OGG: a Biological Ontology for Representing Genes and Genomes in Specific Organisms

In this report, we present the development of the Ontology of Genes and Genomes (OGG), a biological ontology in the domain of genes and genomes. To integrate with other ontologies, OGG is aligned with the Basic Formal Ontology (BFO). OGGspecific term IDs and annotations are designed by mapping to NCBI Taxonomy IDs and NCBI Entrez Gene IDs. Each gene in OGG has over 10 annotation items, includes gene-associated Gene Ontology (GO) and PubMed article information. OGG has represented genes in human, two viruses, and four bacteria. Additionally, 7 OGG subsets are developed to represent genes and genomes of 7 model systems including mouse, fruit fly, zebrafish, yeast, A. thaliana, C. elegans, P. falciparum. An ontology URI dereferencing approach was designed and implemented in Ontobee to resolve the issue of dereferencing OGG terms from different OGG subset documents. OGG can be used in different cases, including SPARQL query of gene information within OGG or in combination with other ontologies, and the OGG gene term reuse in other ontologies (e.g., Vaccine Ontology). The OGG project website is: https://code.google.com/p/ogg/. Keywords—ontology; Ontology of Genes and Genomes (OGG); gene; genome; organism; vaccine; Gene Ontology (GO)

[1]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[2]  Scott Federhen,et al.  The NCBI Taxonomy database , 2011, Nucleic Acids Res..

[3]  Jian Zhang,et al.  The Protein Ontology: a structured representation of protein forms and complexes , 2010, Nucleic Acids Res..

[4]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[5]  Tanya Z. Berardini,et al.  Cross-product extensions of the Gene Ontology , 2009, J. Biomed. Informatics.

[6]  Yu Lin,et al.  Ontology representation and analysis of vaccine formulation and administration and their effects on vaccine immune responses , 2012, J. Biomed. Semant..

[7]  Bjoern Peters,et al.  VO: Vaccine Ontology , 2009 .

[8]  Olivier Danos,et al.  Nucleotide sequence of the AIDS virus, LAV , 1985, Cell.

[9]  Jessica A. Turner,et al.  Modeling biomedical experimental processes with OBI , 2010, J. Biomed. Semant..

[10]  Michel Schneider,et al.  UniProtKB/Swiss-Prot. , 2007, Methods in molecular biology.

[11]  Jian Zhang,et al.  Protein Ontology: a controlled structured network of protein entities , 2013, Nucleic Acids Res..

[12]  Barry Smith,et al.  SNAP and SPAN: Towards Dynamic Spatial Ontology , 2004, Spatial Cogn. Comput..

[13]  Daniel R. Zerbino,et al.  Ensembl 2014 , 2013, Nucleic Acids Res..

[14]  J. Gern The Sequence of the Human Genome , 2001, Science.

[15]  H. Lodish Molecular Cell Biology , 1986 .

[16]  Barry Smith,et al.  Infectious Disease Ontology , 2010 .

[17]  M. Ashburner,et al.  The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration , 2007, Nature Biotechnology.

[18]  Ryan R Brinkman,et al.  OntoFox: web-based support for ontology reuse , 2010, BMC Research Notes.

[19]  N. W. Davis,et al.  The complete genome sequence of Escherichia coli K-12. , 1997, Science.

[20]  Robert T. Chen,et al.  Emerging Vaccine Informatics , 2011, Journal of biomedicine & biotechnology.

[21]  D. Mccormick Sequence the Human Genome , 1986, Bio/Technology.

[22]  E. Caler,et al.  A Nonessential African Swine Fever Virus Gene UK Is a Significant Virulence Determinant in Domestic Swine , 1998, Journal of Virology.

[23]  Riichiro Mizoguchi,et al.  An Ontology of Gene , 2012, ICBO.

[24]  Yu Lin,et al.  Brucellosis Ontology (IDOBRU) as an extension of the Infectious Disease Ontology , 2011, J. Biomed. Semant..

[25]  Karen Eilbeck,et al.  Evolution of the Sequence Ontology terms and relationships , 2009, J. Biomed. Informatics.

[26]  Alan Ruttenberg,et al.  Ontobee: A Linked Data Server and Browser for Ontology Terms , 2011, ICBO.