Reasoning with bio-ontologies: using relational closure rules to enable practical querying

MOTIVATION Ontologies have become indispensable in the Life Sciences for managing large amounts of knowledge. The use of logics in ontologies ranges from sound modelling to practical querying of that knowledge, thus adding a considerable value. We conceive reasoning on bio-ontologies as a semi-automated process in three steps: (i) defining a logic-based representation language; (ii) building a consistent ontology using that language; and (iii) exploiting the ontology through querying. RESULTS Here, we report on how we have implemented this approach to reasoning on the OBO Foundry ontologies within BioGateway, a biological Resource Description Framework knowledge base. By separating the three steps in a manual curation effort on Metarel, a vocabulary that specifies relation semantics, we were able to apply reasoning on a large scale. Starting from an initial 401 million triples, we inferred about 158 million knowledge statements that allow for a myriad of prospective queries, potentially leading to new hypotheses about for instance gene products, processes, interactions or diseases. AVAILABILITY SPARUL code, a query end point and curated relation types in OBO Format, RDF and OWL 2 DL are freely available at http://www.semantic-systems-biology.org/metarel.

[1]  Baris E. Suzek,et al.  The Universal Protein Resource (UniProt) in 2010 , 2009, Nucleic Acids Res..

[2]  María Martín,et al.  The Universal Protein Resource (UniProt) in 2010 , 2010 .

[3]  Yin Chen,et al.  OBO Explorer: an editor for open biomedical ontologies in OWL , 2008, Bioinform..

[4]  Barry Smith,et al.  Biodynamic ontology: applying BFO in the biomedical domain. , 2004, Studies in health technology and informatics.

[5]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[6]  M. Ashburner,et al.  The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration , 2007, Nature Biotechnology.

[7]  A. Rector,et al.  Relations in biomedical ontologies , 2005, Genome Biology.

[8]  Nigel W. Hardy,et al.  Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project , 2008, Nature Biotechnology.

[9]  Kei-Hoi Cheung,et al.  Using semantic web rules to reason on an ontology of pseudogenes , 2010, Bioinform..

[10]  David Lee The semantics of just , 1987 .

[11]  Benjamin M. Good,et al.  The Life Sciences Semantic Web is Full of Creeps! , 2006, Briefings Bioinform..

[12]  Lennart Martens,et al.  The Ontology Lookup Service: more data and better tools for controlled vocabulary queries , 2008, Nucleic Acids Res..

[13]  Martin Kuiper,et al.  Biological knowledge management: the emerging role of the Semantic Web technologies , 2009, Briefings Bioinform..

[14]  Wendy Hall,et al.  The Semantic Web Revisited , 2006, IEEE Intelligent Systems.

[15]  Ian Horrocks,et al.  From SHIQ and RDF to OWL: the making of a Web Ontology Language , 2003, J. Web Semant..

[16]  Bernard De Baets,et al.  Metarel : an Ontology to support the inferencing of Semantic Web relations within Biomedical Ontologies , 2009 .

[17]  Diego Calvanese,et al.  The Description Logic Handbook: Theory, Implementation, and Applications , 2003, Description Logic Handbook.

[18]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology , 2003, Nucleic Acids Res..

[19]  José L. V. Mejino,et al.  A reference ontology for biomedical informatics: the Foundational Model of Anatomy , 2003, J. Biomed. Informatics.

[20]  Rachael P. Huntley,et al.  The GOA database in 2009—an integrated Gene Ontology Annotation resource , 2008, Nucleic Acids Res..

[21]  Christopher G. Chute,et al.  BioPortal: ontologies and integrated data resources at the click of a mouse , 2009, Nucleic Acids Res..

[22]  Robert Stevens,et al.  The Cell Cycle Ontology: an application ontology for the representation and integrated analysis of the cell cycle process , 2009, Genome Biology.

[23]  Charles J. Petrie The Semantics of "Semantics" , 2009, IEEE Internet Computing.

[24]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[25]  Bernard De Baets,et al.  ONTO-PERL: An API for supporting the development and analysis of bio-ontologies , 2008, Bioinform..

[26]  Bernard De Baets,et al.  BioGateway: a semantic systems biology tool for the life sciences , 2009, BMC Bioinformatics.

[27]  Kerry Innes,et al.  SNOMED CT and its Place in Health Information Management Practice , 2010, Health information management : journal of the Health Information Management Association of Australia.