Simplifying Description Logic Ontologies

We discuss the problem of minimizing TBoxes expressed in the lightweight description logic $\mathcal{EL}$ , which forms a basis of some large ontologies like SNOMED, Gene Ontology, NCI and Galen. We show that the minimization of TBoxes is intractable (NP-complete). While this looks like a bad news result, we also provide a heuristic technique for minimizing TBoxes. We prove the correctness of the heuristics and show that it provides optimal results for a class of ontologies, which we define through an acyclicity constraint over a reference relation between equivalence classes of concepts. To establish the feasibility of our approach, we have implemented the algorithm and evaluated its effectiveness on a small suite of benchmarks.

[1]  Gert Smolka,et al.  Attributive Concept Descriptions with Complements , 1991, Artif. Intell..

[2]  Bernardo Cuenca Grau,et al.  OWL 2 Web Ontology Language: Profiles , 2009 .

[3]  Steffen Staab,et al.  The Semantic Web - ISWC 2008, 7th International Semantic Web Conference, ISWC 2008, Karlsruhe, Germany, October 26-30, 2008. Proceedings , 2008, SEMWEB.

[4]  Brian Davis,et al.  Knowledge Engineering and Knowledge Management , 2012, Lecture Notes in Computer Science.

[5]  Sherri de Coronado,et al.  NCI Thesaurus: A semantic model integrating cancer-related clinical and molecular information , 2007, J. Biomed. Informatics.

[6]  Diego Calvanese,et al.  The Description Logic Handbook: Theory, Implementation, and Applications , 2003, Description Logic Handbook.

[7]  Pierre Marquis,et al.  A Knowledge Compilation Map , 2002, J. Artif. Intell. Res..

[8]  Sebastian Rudolph,et al.  ExpExpExplosion: Uniform Interpolation in General EL Terminologies , 2012, ECAI.

[9]  Peter F. Patel-Schneider,et al.  OWL 2 Web Ontology Language , 2009 .

[10]  Jeff Z. Pan,et al.  The Semantic Web: Research and Applications - 8th Extended Semantic Web Conference, ESWC 2011, Heraklion, Crete, Greece, May 29-June 2, 2011, Proceedings, Part I , 2010, ESWC.

[11]  Jens Wissmann,et al.  Elimination of Redundancy in Ontologies , 2011, ESWC.

[12]  Boris Konev,et al.  Forgetting and Uniform Interpolation in Large-Scale Description Logic Terminologies , 2009, IJCAI.

[13]  Birte Glimm,et al.  Hitting the Sweetspot: Economic Rewriting of Knowledge Bases , 2012, SEMWEB.

[14]  Ian Horrocks,et al.  Ontologies and the semantic web , 2008, CACM.

[15]  Franz Baader,et al.  Pushing the EL Envelope , 2005, IJCAI.

[16]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[17]  Jeff Heflin,et al.  The Semantic Web – ISWC 2012 , 2012, Lecture Notes in Computer Science.

[18]  Meghyn Bienvenu,et al.  Prime Implicates and Prime Implicants: From Propositional to Modal Logic , 2009, J. Artif. Intell. Res..

[19]  Aldo Gangemi,et al.  The GALEN CORE Model Schemata for Anatomy: Towards a Re-usable Application-Independent Model of Medical Concepts , 2008 .

[20]  Kent A. Spackman,et al.  SNOMED RT: a reference terminology for health care , 1997, AMIA.

[21]  Nicola Guarino,et al.  Sweetening Ontologies with DOLCE , 2002, EKAW.

[22]  Sebastian Rudolph,et al.  Reasoning-Supported Interactive Revision of Knowledge Bases , 2011, IJCAI.

[23]  Bijan Parsia,et al.  Laconic and Precise Justifications in OWL , 2008, SEMWEB.

[24]  Lora Aroyo,et al.  The Semantic Web: Research and Applications , 2009, Lecture Notes in Computer Science.

[25]  Markus Krötzsch,et al.  ELK Reasoner: Architecture and Evaluation , 2012, ORE.