Checking Class Labels against Naming Conventions: First experiences with the OntoCheck Protégé plugin

Background: Although ontology naming conventions have been proposed by policy makers, the lack of tool support for testing and enforcing naming practices has hindered widespread compliance. We have developed OntoCheck, a Protege plugin, which allows testing labels in an ontology on naming inconsistencies. Objective: We report on initial experience in applying the tool in different settings and show that OntoCheck contributes to quality assurance in a test-set of ontologies. Methods: We apply OntoCheck in four different ontology engineering efforts and test a variety of different ontologies on prevalence of naming issues. For each, we analyze the percentages of class names and labels violating outlined conventions and correlate the check types to the set of OBO Foundry naming conventions. Results: Application of OntoCheck revealed that heterogeneity in class labels is still a common feature, even in release versions of ontologies, and that many of these could be detected and rectified by tool support. Nearly half of the OBO Foundry naming conventions could be assisted by OntoCheck, the remaining fraction relying on more complicated parsing and availability of lexica. Besides requirements drawn from naming conventions themselves, mismatches in string-based ontology alignment algorithms are identified as sanity check on the impact of labelling consistency. Analysing the prevalence of false positive and negative ontology alignment mismatches could prove valuable in deriving new naming conventions and test their effects in cross ontology harmonization efforts. Conclusion: Our results show that typographical and syntactical labelling heterogeneity can be improved by tool support. The application of OntoCheck supports the verification of naming conventions and will ultimately ease string based ontology alignment.

[1]  Marc Ehrig,et al.  State of the art on ontology alignment , 2013 .

[2]  Zahir Tari,et al.  On the Move to Meaningful Internet Systems 2005: CoopIS, DOA, and ODBASE, OTM Confederated International Conferences CoopIS, DOA, and ODBASE 2005, Agia Napa, Cyprus, October 31 - November 4, 2005, Proceedings, Part I , 2005, OTM Conferences.

[3]  M. Ashburner,et al.  The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration , 2007, Nature Biotechnology.

[4]  Martin Hepp,et al.  GoodRelations: An Ontology for Describing Products and Services Offers on the Web , 2008, EKAW.

[5]  Erhard Rahm,et al.  Evolution of the COMA match system , 2011, OM.

[6]  Jan Nößner,et al.  CODI: Combinatorial Optimization for Data Integration: results for OAEI 2011 , 2010, OM.

[7]  Bernardo Cuenca Grau,et al.  LogMap: Logic-Based and Scalable Ontology Matching , 2011, SEMWEB.

[8]  Dietrich Rebholz-Schuhmann,et al.  Reuse of terminological resources for efficient ontological engineering in Life Sciences , 2009, BMC Bioinformatics.

[9]  Christopher J. Mungall,et al.  Obol: Integrating Language and Meaning in Bio-Ontologies , 2004, Comparative and functional genomics.

[10]  Robert Stevens,et al.  Building Ontologies in DAML + OIL , 2003, Comparative and functional genomics.

[11]  Mark A. Musen,et al.  Anchor-PROMPT: Using Non-Local Context for Semantic Matching , 2001, OIS@IJCAI.

[12]  Heiner Stuckenschmidt,et al.  Ontology Alignment Evaluation Initiative: Six Years of Experience , 2011, J. Data Semant..

[13]  Olivier Bodenreider,et al.  Of Mice and Men: Aligning Mouse and Human Anatomies , 2005, AMIA.

[14]  Cosmin Stroe,et al.  AgreementMaker: Efficient Matching for Large Real-World Schemas and Ontologies , 2009, Proc. VLDB Endow..

[15]  Christel Daniel-Le Bozec,et al.  The DebugIT Core Ontology: semantic integration of antibiotics resistance patterns , 2010, MedInfo.

[16]  Jérôme Euzenat,et al.  Ten Challenges for Ontology Matching , 2008, OTM Conferences.

[17]  Zahir Tari,et al.  On the Move to Meaningful Internet Systems: OTM 2008 , 2008, Lecture Notes in Computer Science.

[18]  Johanna Völker,et al.  Linguistic analysis for complex ontology matching , 2010, OM.

[19]  Chris F. Taylor,et al.  Survey-based naming conventions for use in OBO Foundry ontology development , 2009, BMC Bioinformatics.

[20]  Paul Buitelaar,et al.  A Protégé Plug-In for Ontology Extraction from Text Based on Linguistic Analysis , 2004, ESWS.

[21]  Tharam S. Dillon,et al.  On the Move to Meaningful Internet Systems, OTM 2010 , 2010, Lecture Notes in Computer Science.

[22]  Vojtech Svátek,et al.  Analysing Ontological Structures through Name Pattern Tracking , 2008, EKAW.