Inferring ontology graph structures using OWL reasoning

BackgroundOntologies are representations of a conceptualization of a domain. Traditionally, ontologies in biology were represented as directed acyclic graphs (DAG) which represent the backbone taxonomy and additional relations between classes. These graphs are widely exploited for data analysis in the form of ontology enrichment or computation of semantic similarity. More recently, ontologies are developed in a formal language such as the Web Ontology Language (OWL) and consist of a set of axioms through which classes are defined or constrained. While the taxonomy of an ontology can be inferred directly from the axioms of an ontology as one of the standard OWL reasoning tasks, creating general graph structures from OWL ontologies that exploit the ontologies’ semantic content remains a challenge.ResultsWe developed a method to transform ontologies into graphs using an automated reasoner while taking into account all relations between classes. Searching for (existential) patterns in the deductive closure of ontologies, we can identify relations between classes that are implied but not asserted and generate graph structures that encode for a large part of the ontologies’ semantic content. We demonstrate the advantages of our method by applying it to inference of protein-protein interactions through semantic similarity over the Gene Ontology and demonstrate that performance is increased when graph structures are inferred using deductive inference according to our method. Our software and experiment results are available at http://github.com/bio-ontology-research-group/Onto2Graph.ConclusionsOnto2Graph is a method to generate graph structures from OWL ontologies using automated reasoning. The resulting graphs can be used for improved ontology visualization and ontology-based data analysis.

[1]  Wolfgang Meissner,et al.  Reverse crosstalk of TGFβ and PPARβ/δ signaling identified by transcriptional profiling , 2010, Nucleic Acids Res..

[2]  Monte Westerfield,et al.  ZFIN: enhancements and updates to the zebrafish model organism database , 2010, Nucleic Acids Res..

[3]  Judith A. Blake,et al.  The Mouse Genome Database (MGD): facilitating mouse as a model for human biology and disease , 2014, Nucleic Acids Res..

[4]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[5]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[6]  Chris Mungall,et al.  Use of OWL within the Gene Ontology , 2014, bioRxiv.

[7]  Kimberly Van Auken,et al.  WormBase: a comprehensive resource for nematode research , 2009, Nucleic Acids Res..

[8]  João D. Ferreira,et al.  Semantic Similarity for Automatic Classification of Chemical Compounds , 2010, PLoS Comput. Biol..

[9]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[10]  Michel Dumontier,et al.  Relations as patterns: bridging the gap between OBO and OWL , 2010, BMC Bioinformatics.

[11]  Boris Motik,et al.  HermiT: A Highly-Efficient OWL Reasoner , 2008, OWLED.

[12]  S. Dudoit,et al.  Multiple Hypothesis Testing in Microarray Experiments , 2003 .

[13]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[14]  Aurélien Naldi,et al.  Diversity and Plasticity of Th Cell Types Predicted from Regulatory Network Modelling , 2010, PLoS Comput. Biol..

[15]  Giulia Antonazzo,et al.  FlyBase: establishing a Gene Group resource for Drosophila melanogaster , 2015, Nucleic Acids Res..

[16]  Mario Albrecht,et al.  FunSimMat update: new features for exploring functional similarity , 2009, Nucleic Acids Res..

[17]  Michel Dumontier,et al.  Identifying aberrant pathways through integrated analysis of knowledge in pharmacogenomics , 2012, Bioinform..

[18]  Marcel H. Schulz,et al.  Clinical diagnostics in human genetics with semantic similarity searches in ontologies. , 2009, American journal of human genetics.

[19]  Sean Bechhofer,et al.  The OWL API: A Java API for OWL ontologies , 2011, Semantic Web.

[20]  Miguel Ángel Rodríguez-García,et al.  Visualising Ontologies with AberOWL , 2015, SWAT4LS.

[21]  Phillip W. Lord,et al.  Semantic Similarity in Biomedical Ontologies , 2009, PLoS Comput. Biol..

[22]  Paul N. Schofield,et al.  The role of ontologies in biological and biomedical research: a functional perspective , 2015, Briefings Bioinform..

[23]  Catia Pesquita,et al.  Metrics for GO based protein semantic similarity: a systematic evaluation , 2008, BMC Bioinformatics.

[24]  Sylvie Ranwez,et al.  The semantic measures library and toolkit: fast computation of semantic similarity and relatedness using biomedical ontologies , 2014, Bioinform..

[25]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Sean D. Mooney,et al.  STOP using just GO: a multi-ontology hypothesis generation tool for high throughput experimentation , 2013, BMC Bioinformatics.

[27]  A. Rector,et al.  Relations in biomedical ontologies , 2005, Genome Biology.

[28]  Knut Reinert,et al.  Robust consensus computation , 2008, BMC Bioinformatics.

[29]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[30]  Paul N. Schofield,et al.  Aber-OWL: a framework for ontology-based data access in biology , 2014, BMC Bioinformatics.

[31]  Paul N. Schofield,et al.  PhenomeNET: a whole-phenome approach to disease gene discovery , 2011, Nucleic acids research.

[32]  Christopher G. Chute,et al.  BioPortal: ontologies and integrated data resources at the click of a mouse , 2009, Nucleic Acids Res..

[33]  Nicola Guarino,et al.  Formal Ontology and Information Systems , 1998 .

[34]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[35]  Marcel J. T. Reinders,et al.  An integrated approach of gene expression and DNA-methylation profiles of WNT signaling genes uncovers novel prognostic markers in Acute Myeloid Leukemia , 2015, BMC Bioinformatics.

[36]  Mario Cannataro,et al.  Semantic similarity analysis of protein data: assessment with biological features and issues , 2012, Briefings Bioinform..

[37]  Markus Krötzsch,et al.  The Incredible ELK , 2013, Journal of Automated Reasoning.

[38]  Emden R. Gansner,et al.  Graphviz - Open Source Graph Drawing Tools , 2001, GD.

[39]  Sidahmed Benabderrahmane,et al.  IntelliGO: a new vector-based semantic similarity measure including annotation origin , 2010, BMC Bioinformatics.

[40]  Boris Motik,et al.  OWL 2: The next step for OWL , 2008, J. Web Semant..

[41]  Kara Dolinski,et al.  Saccharomyces Genome Database provides mutant phenotype data , 2009, Nucleic Acids Res..

[42]  Diego Calvanese,et al.  The Description Logic Handbook: Theory, Implementation, and Applications , 2003, Description Logic Handbook.

[43]  Orri Erling,et al.  RDF Support in the Virtuoso DBMS , 2007, CSSW.

[44]  Akira R. Kinjo,et al.  Neuro-symbolic representation learning on biological knowledge graphs , 2016, Bioinform..

[45]  Markus Krötzsch,et al.  ELK Reasoner: Architecture and Evaluation , 2012, ORE.

[46]  Sergei L. Kosakovsky Pond,et al.  An Evolutionary Model-Based Algorithm for Accurate Phylogenetic Breakpoint Mapping and Subtype Prediction in HIV-1 , 2009, PLoS Comput. Biol..

[47]  U. Brandes,et al.  GraphML Progress Report ? Structural Layer Proposal , 2001 .

[48]  Silvia Mazzini,et al.  LodLive, exploring the web of data , 2012, I-SEMANTICS '12.