A Corpus of OWL DL Ontologies

Tool development for and empirical experimentation in OWL ontology engineering require a wide variety of suitable ontologies as input for testing and evaluation purposes. Empirical activities often resort to (somewhat arbitrarily) hand curated corpora available on the web, such as the NCBO BioPortal and the TONES Repository, or manually select a set of well-known ontologies. Results may be biased, even heavily, towards these datasets. Sampling from a large corpus of ontologies, on the other hand, may lead to more representative results. Current large scale repositories/web crawls are mostly uncurated, suffer from duplication and contain large numbers of ontology versions, variants, and facets, and therefore do not lend themselves to random sampling. In this paper, we describe the creation of a corpus of OWL DL ontologies using strategies such as web crawling, various forms of de-duplications and manual cleaning, which allows random sampling of ontologies for a variety of empirical applications.

[1]  Shonali Krishnaswamy,et al.  Predicting Reasoning Performance Using Ontology Metrics , 2012, SEMWEB.

[2]  Axel Polleres,et al.  OWL: Yet to arrive on the Web of Data? , 2012, LDOW.

[3]  Boris Motik,et al.  The HermiT OWL Reasoner , 2012, ORE.

[4]  Dieter Pfoser Indexing the Trajectories of Moving Objects , 2002 .

[5]  Allan Third "Hidden semantics": what can we learn from the names in an ontology? , 2012, INLG.

[6]  Bijan Parsia,et al.  Extracting Justifications from BioPortal Ontologies , 2012, International Semantic Web Conference.

[7]  James A. Hendler,et al.  Debugging unsatisfiable classes in OWL ontologies , 2005, J. Web Semant..

[8]  Jeff Z. Pan,et al.  Finding Maximally Satisfiable Terminologies for the Description Logic ALC , 2006, AAAI.

[9]  James A. Hendler,et al.  A Survey of the Web Ontology Landscape , 2006, SEMWEB.

[10]  Franz Baader,et al.  Pushing the EL Envelope , 2005, IJCAI.

[11]  Robert Stevens,et al.  Analysing Syntactic Regularities in Ontologies , 2012, OWLED.

[12]  Gerhard Friedrich,et al.  On computing minimal conflicts for ontology debugging , 2008 .

[13]  Guilin Qi,et al.  A Modularization-Based Approach to Finding All Justifications for OWL DL Entailments , 2008, ASWC.

[14]  Bijan Parsia,et al.  Concept-Based Semantic Difference in Expressive Description Logics , 2012, Description Logics.

[15]  Richard Power,et al.  Measuring the Understandability of Deduction Rules for OWL , 2012, WoDOOM@EKAW.

[16]  Jeff Z. Pan,et al.  Approximating OWL-DL Ontologies , 2007, AAAI.

[17]  C. Maria Keet Detecting and Revising Flaws in OWL Object Property Expressions , 2012, EKAW.

[18]  Ian Horrocks,et al.  The Even More Irresistible SROIQ , 2006, KR.

[19]  Li Ding,et al.  Characterizing the Semantic Web on the Web , 2006, SEMWEB.

[20]  Jianfeng Du,et al.  Decomposition-Based Optimization for Debugging of Inconsistent OWL DL Ontologies , 2010, KSEM.

[21]  Ladislav Hluchý,et al.  A Testing Framework for OWL-DL Reasoning , 2008, 2008 Fourth International Conference on Semantics, Knowledge and Grid.

[22]  R. Volz,et al.  Benchmarking OWL Reasoners , 2007 .

[23]  Enrico Motta,et al.  Watson: supporting next generation semantic web applications , 2007 .

[24]  Christopher G. Chute,et al.  BioPortal: ontologies and integrated data resources at the click of a mouse , 2009, Nucleic Acids Res..

[25]  Pascal Hitzler,et al.  Reconciling OWL and Rules , 2011 .

[26]  Diego Calvanese,et al.  The DL-Lite Family and Relations , 2009, J. Artif. Intell. Res..

[27]  Rafael Peñaloza,et al.  Pinpointing in the Description Logic EL , 2007, Description Logics.

[28]  Guilin Qi,et al.  Measuring Incoherence in Description Logic-Based Ontologies , 2007, ISWC/ASWC.

[29]  Sean Bechhofer,et al.  The OWL API: A Java API for OWL ontologies , 2011, Semantic Web.

[30]  Timothy W. Finin,et al.  Swoogle: a search and metadata engine for the semantic web , 2004, CIKM '04.