Intuitive ontology authoring using controlled natural language

Ontologies have been proposed and studied in the last couple of decades as a way to capture and share people's knowledge about the world in a way that is processable by computer systems. Ontologies have the potential to serve as a bridge between the human conceptual understanding of the world and the data produced, processed and stored in computer systems. However, ontologies so far have failed to gather widespread adoption, failing to realise the original vision of the semantic web as a next generation of the world wide web: where everyone would be able to contribute and interlink their data and knowledge as easily as they can contribute and interlink their websites. One of the main reasons for this lack of widespread adoption of ontologies is the steep learning curve for authoring them: most people find it too dfficult to learn the syntax and formal semantics of ontology languages. Most research has tried to alleviate this problem by finding ways to help people to collaborate with knowledge engineers when building ontologies; this approach however, requires the wide availability of knowledge engineers, who in practice are scarce. In the context of the semantic web, recent research has started looking at ways to directly capture knowledge from domain experts as ontologies. One such approach advocates the use of Controlled Natural Languages (CNL) as a promising way to alleviate the syntactical impediment to writing ontological constructs. However, not much is yet known about the capabilities and limitations of CNL-based ontology authoring by domain experts. It is also unknown what type of automatic tool support can and should be provided to novice ontology authors, although such intelligent tool support is becoming possible due to advances in reasoning with existing ontologies and other related areas such as natural language processing. This PhD investigates how CNL-based ontology authoring systems can make ontology authoring more accessible to domain experts by providing intelligent tool support. In particular, this thesis iteratively investigates the impact of providing various types of intelligent tool support for authoring ontologies using the Web Ontology Language (OWL) and a controlled natural language called Rabbit. After each iteration of added tool support, we evaluate how it impacts the ontology authoring process and what are the main limitations of the resulting ontology authoring system. Based on the found limitations, we decide which further tool support would be most beneficial to novice ontology authors. This methodology resulted in iteratively providing support for (i) understanding the syntactic capabilities and limitations of the chosen controlled natural language; (ii) following appropriate ontology engineering methodologies; (iii) fostering awareness about the logical consequences of adding new knowledge to an ontology and (iv) interacting with the ontology authoring system via dialogues. The main contributions of this PhD are (i) showing that domain experts benefit from guidance about the ontology authoring process and understandable syntax error messages for finding the correct CNL syntax; (ii) the definition of a framework to integrate the syntactical and semantic analyses of ontology authors' inputs; (iii) showing that intuitive feedback about the integration of ontology authors' inputs into an existing ontology benefits ontology authors as they become aware of potential ontology defects; (iv) the definition of a framework to analyse and describe ontology authoring in terms of dialogue moves and their discourse structure.

[1]  Ralf Schwitter,et al.  ECOLE: a look-ahead editor of controlled language , 2003, EAMT.

[2]  Kalina Bontcheva,et al.  Automatic Report Generation from Ontologies: The MIAKT Approach , 2004, NLDB.

[3]  Heiko Paulheim,et al.  SoKNOS - Using Semantic Technologies in Disaster Management Software , 2011, ESWC.

[4]  Paulo Nazareno Maia Sampaio,et al.  A Methodology for Domain Dialogue Engineering with the Midiki Dialogue Manager , 2007, TSD.

[5]  Thorsten Liebig,et al.  Understanding Interlinked Data { Visualising, Exploring, and Analysing Ontologies { , 2007 .

[6]  Eva Gahleitner,et al.  Supporting domain experts in creating formal knowledge models ( ontologies ) , 2006 .

[7]  Eva-Martin Mueckstein Controlled natural language interfaces (extended abstract): the best of three worlds , 1985, CSC '85.

[8]  Brian Davis,et al.  On Controlled Natural Languages: Properties and Prospects , 2009, CNL.

[9]  Philipp Cimiano,et al.  Towards portable natural language interfaces to knowledge bases - The case of the ORAKEL system , 2008, Data Knowl. Eng..

[10]  J. Euzenat,et al.  Ontology Matching , 2007, Springer Berlin Heidelberg.

[11]  Davide Fossati,et al.  Towards explaining effective tutorial dialogues , 2009 .

[12]  Margaret-Anne D. Storey,et al.  Ontology Development for the Masses: Creating ICD-11 in WebProtégé , 2010, EKAW.

[13]  Marilyn A. Walker,et al.  MATCH: An Architecture for Multimodal Dialogue Systems , 2002, ACL.

[14]  Bijan Parsia,et al.  Justification Oriented Proofs in OWL , 2010, SEMWEB.

[15]  Joerg Evermann,et al.  Evaluating Ontologies: Towards a Cognitive Measure of Quality , 2007, EDOCW.

[16]  Alexander Schmitt,et al.  OwlSpeak - adaptive spoken dialogue within Intelligent Environments , 2010, 2010 8th IEEE International Conference on Pervasive Computing and Communications Workshops (PERCOM Workshops).

[17]  Kalina Bontcheva,et al.  A Natural Language Query Interface to Structured Information , 2008, ESWC.

[18]  Staffan Larsson,et al.  Information state and dialogue management in the TRINDI dialogue move engine toolkit , 2000, Natural Language Engineering.

[19]  Edward H. Shortliffe,et al.  Rule Based Expert Systems: The Mycin Experiments of the Stanford Heuristic Programming Project (The Addison-Wesley series in artificial intelligence) , 1984 .

[20]  Nikos Loutas,et al.  The Semantic Public Service Portal (S-PSP) , 2011, ESWC.

[21]  Michael Kifer,et al.  Logical foundations of object-oriented and frame-based languages , 1995, JACM.

[22]  Peter Wagner,et al.  An Interactive Dialogue System for Knowledge Acquisition in Cyc , 2003, IJCAI 2003.

[23]  Daniel Schober,et al.  Practical Experiences in Concurrent, Collaborative Ontology Building Using Collaborative Protégé , 2009 .

[24]  James A. Hendler,et al.  Debugging unsatisfiable classes in OWL ontologies , 2005, J. Web Semant..

[25]  Ronald Maier,et al.  Conceptual Foundations for a Service-oriented Knowledge and Learning Architecture: Supporting Content, Process and Ontology Maturing , 2009, Networked Knowledge - Networked Media - Integrating Knowledge Management.

[26]  Peter F. Patel-Schneider,et al.  Reducing OWL entailment to description logic satisfiability , 2004, Journal of Web Semantics.

[27]  Jonathan Ginzburg,et al.  Computational Models of Dialogue , 2010 .

[28]  Simone Braun,et al.  Ontology Maturing: a Collaborative Web 2.0 Approach to Ontology Engineering , 2007, CKC.

[29]  Jurgen J. Vinju,et al.  Parse Forest Diagnostics with Dr. Ambiguity , 2011, SLE.

[30]  Kaarel Kaljurand Paraphrasing Controlled English Texts , 2009, CNL.

[31]  Amit P. Sheth,et al.  OntoQA: Metric-Based Ontology Quality Analysis , 2005 .

[32]  John H. Boose,et al.  A Knowledge Acquisition Program for Expert Systems Based on Personal Construct Psychology , 1985, Int. J. Man Mach. Stud..

[33]  Norbert E. Fuchs,et al.  Attempto Controlled English (ACE) , 1996, ArXiv.

[34]  Martin J. O'Connor,et al.  SQWRL: A Query Language for OWL , 2009, OWLED.

[35]  Gordon Pask,et al.  The Representation of Knowables , 1975, Int. J. Man Mach. Stud..

[36]  Asunción Gómez-Pérez,et al.  METHONTOLOGY: From Ontological Art Towards Ontological Engineering , 1997, AAAI 1997.

[37]  Lakhmi C. Jain,et al.  Knowledge-Based Intelligent Information and Engineering Systems , 2004, Lecture Notes in Computer Science.

[38]  Ali Khalili,et al.  User Interfaces for Semantic Content Authoring : A Systematic Literature Review , 2012 .

[39]  S. Siegel,et al.  Nonparametric Statistics for the Behavioral Sciences , 2022, The SAGE Encyclopedia of Research Design.

[40]  Samantha Bail,et al.  Extracting Finite Sets of Entailments from OWL Ontologies , 2011, Description Logics.

[41]  Thomas Andreas Meyer,et al.  Sydney OWL Syntax - towards a Controlled Natural Language Syntax for OWL 1.1 , 2007, OWLED.

[42]  Tobias Kuhn,et al.  An Evaluation Framework for Controlled Natural Languages , 2009, CNL.

[43]  Diego Calvanese,et al.  The Description Logic Handbook: Theory, Implementation, and Applications , 2003, Description Logic Handbook.

[44]  Tobias Kuhn The understandability of OWL statements in controlled English , 2013, Semantic Web.

[45]  David R. Traum,et al.  20 Questions on Dialogue Act Taxonomies , 2000, J. Semant..

[46]  Bijan Parsia,et al.  SPARQL-DL: SPARQL Query for OWL-DL , 2007, OWLED.

[47]  Steffen Staab,et al.  Ontology-Based Query and Answering in Chemistry: OntoNova @ Project Halo , 2003, SEMWEB.

[48]  Kaarel Kaljurand,et al.  Controlled Natural Language in Speech Recognition Based User Interfaces , 2012, CNL.

[49]  Holger Knublauch,et al.  The Protégé OWL Plugin: An Open Development Environment for Semantic Web Applications , 2004, SEMWEB.

[50]  Lora Aroyo,et al.  OWL-OLM : interactive ontology-based elicitation of user models , 2005 .

[51]  Abraham Bernstein,et al.  Evaluating the usability of natural language query languages and interfaces to Semantic Web knowledge bases , 2010, J. Web Semant..

[52]  Jeffrey M. Bradshaw,et al.  Applying KAoS Services to Ensure Policy Compliance for Semantic Web Services Workflow Composition and Enactment , 2004, SEMWEB.

[53]  Kaarel Kaljurand ACE View - An Ontology and Rule Editor based on Controlled English , 2008, International Semantic Web Conference.

[54]  Kalina Bontcheva,et al.  Developing Language Processing Components with GATE Version 5 (a User Guide) , 2010 .

[55]  Ira P. Goldstein,et al.  The FRL Manual , 1977 .

[56]  F. Inglis How To Do Things With Words. , 1971 .

[57]  Catherine Dolbear,et al.  Lege Feliciter: Using Structured English to represent a Topographic Hydrology Ontology , 2007, OWLED.

[58]  Pierre Isabelle,et al.  TAUM-AVIATION: Its Technical Features and Some Experimental Results , 1985, Comput. Linguistics.

[59]  Samantha Bail,et al.  The Cognitive Complexity of OWL Justifications , 2011, Description Logics.

[60]  Hamish Cunningham,et al.  FREyA: An Interactive Way of Querying Linked Data Using Natural Language , 2011, ESWC Workshops.

[61]  Aldo Gangemi,et al.  Modelling Ontology Evaluation and Validation , 2006, ESWC.

[62]  Bijan Parsia,et al.  Explaining by Example: Model Exploration for Ontology Comprehension , 2009, Description Logics.

[63]  Thorsten Liebig,et al.  OntoTrack: A semantic approach for ontology authoring , 2005, J. Web Semant..

[64]  Peter Clark,et al.  Capturing and answering questions posed to a knowledge-based system , 2007, K-CAP '07.

[65]  Thibaud Latour,et al.  Automated Dialogue-Based Ontology Elicitation , 2011, FET.

[66]  John A. Bateman,et al.  Daisie: Information State Dialogues for Situated Systems , 2009, TSD.

[67]  Abraham Bernstein,et al.  Querix: A Natural Language Interface to Query Ontologies Based on Clarification Dialogs , 2006 .

[68]  Aldo Gangemi,et al.  Ontology Design Patterns for Semantic Web Content , 2005, SEMWEB.

[69]  Yolanda Gil,et al.  Towards Managing Knowledge Collection from Volunteer Contributors , 2005, AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors.

[70]  Catherine Dolbear,et al.  Talking Rabbit: A User Evaluation of Sentence Production , 2009, CNL.

[71]  Mitsuru Ikeda,et al.  Hozo: An Environment for Building/Using Ontologies Based on a Fundamental Consideration of "Role" and "Relationship" , 2002, EKAW.

[72]  Boris Motik,et al.  Hypertableau Reasoning for Description Logics , 2009, J. Artif. Intell. Res..

[73]  Yolanda Gil,et al.  Acquiring Problem-Solving Knowledge from End Users: Putting Interdependency Models to the Test , 2000, AAAI/IAAI.

[74]  Mark Liberman,et al.  A formal framework for linguistic annotation , 1999, Speech Commun..

[75]  Staffan Larsson,et al.  Issue-based Dialogue Management , 2002 .

[76]  Catherine Dolbear,et al.  Supporting domain experts to construct conceptual ontologies: A holistic approach , 2011, J. Web Semant..

[77]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[78]  George A. Vouros,et al.  Human-centered ontology engineering: The HCOME methodology , 2006, Knowledge and Information Systems.

[79]  Kôiti Hasida,et al.  Towards an ISO Standard for Dialogue Act Annotation , 2010, LREC.

[80]  Martin L. King,et al.  Towards a Methodology for Building Ontologies , 1995 .

[81]  Nicola Guarino,et al.  An Overview of OntoClean , 2004, Handbook on Ontologies.

[82]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[83]  Enrico Motta,et al.  Identifying Key Concepts in an Ontology, through the Integration of Cognitive Principles with Statistical and Topological Measures , 2008, ASWC.

[84]  Graham Wilcock Talking OWLs: Towards an Ontology Verbalizer , 2003 .

[85]  Kaarel Kaljurand,et al.  General Architecture of a Controlled Natural Language Based Multilingual Semantic Wiki , 2012, CNL.

[86]  Kaarel Kaljurand,et al.  Bidirectional Mapping Between OWL DL and Attempto Controlled English , 2006, PPSWR.

[87]  Satoshi Nakamura,et al.  Statistical dialog management applied to WFST-based dialog systems , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[88]  Abraham Bernstein,et al.  Talking to the Semantic Web - A Controlled English Query Interface for Ontologies* , 2004 .

[89]  Catherine Dolbear,et al.  Rabbit: Developing a Control Natural Language for Authoring Ontologies , 2008, ESWC.

[90]  Lina Zhou,et al.  Ontology learning: state of the art and open issues , 2007, Inf. Technol. Manag..

[91]  John Darlington,et al.  Unlocking the Potential of Public Sector Information with Semantic Web Technology , 2007, ISWC/ASWC.

[92]  Franz Baader Description Logics , 2009, Reasoning Web.

[93]  Luciano Serafini,et al.  Collaborative Knowledge Engineering via Semantic MediaWiki , 2008, I-Semantics 2008.

[94]  Peter Clark,et al.  Knowledge entry as the graphical assembly of components , 2001, K-CAP '01.

[95]  Asunción Gómez-Pérez,et al.  Towards a Glossary of Activities in the Ontology Engineering Field , 2008, LREC.

[96]  Steffen Staab,et al.  The Ontology Inference Layer OIL , 2000 .

[97]  Ian Horrocks,et al.  Ontologies and the semantic web , 2008, CACM.

[98]  Alexander I. Rudnicky,et al.  The RavenClaw dialog management framework: Architecture and systems , 2009, Comput. Speech Lang..

[99]  Fernando Pereira,et al.  Definite clause grammars for language analysis , 1986 .

[100]  Asunción Gómez-Pérez,et al.  ONTOGENERATION: Reusing Domain and Linguistic Ontologies for Spanish Text Generation , 1998 .

[101]  Nicola Guarino,et al.  Sweetening Ontologies with DOLCE , 2002, EKAW.

[102]  Ramanathan V. Guha,et al.  The evolution of CycL, the Cyc representation language , 1991, SGAR.

[103]  Kalina Bontcheva,et al.  CLOnE: Controlled Language for Ontology Editing , 2007, ISWC/ASWC.

[104]  Kaarel Kaljurand ACE View --- an Ontology and Rule Editor based on Attempto Controlled English , 2008, OWLED.

[105]  Ian Horrocks,et al.  Modular Reuse of Ontologies: Theory and Practice , 2008, J. Artif. Intell. Res..

[106]  Samson W. Tu,et al.  Supporting Collaborative Ontology Development in Protégé , 2008, SEMWEB.

[107]  Jeroen van Grondelle,et al.  Design of a CNL to Involve Domain Experts in Modeling , 2010 .

[108]  K. D. Joshi,et al.  A collaborative approach to ontology design , 2002, CACM.

[109]  Stellan Ohlsson,et al.  Expert vs. Non-expert Tutoring: Dialogue Moves, Interaction Patterns and Multi-utterance Turns , 2009, CICLing.

[110]  Rolf Schwitter,et al.  CREATING AND QUERYING FORMAL ONTOLOGIES VIA CONTROLLED NATURAL LANGUAGE , 2010, Appl. Artif. Intell..

[111]  Ronald J. Brachman,et al.  An overview of the KL-ONE Knowledge Representation System , 1985 .

[112]  Steffen Staab,et al.  On-To-Knowledge Methodology (OTKM) , 2004, Handbook on Ontologies.

[113]  Ralf Klischewski Ontologies for e-document management in public administration , 2006, Bus. Process. Manag. J..

[114]  Hamish Cunningham,et al.  Natural Language Interfaces to Ontologies: Combining Syntactic Analysis and Ontology-Based Lookup through the User Interaction , 2010, ESWC.

[115]  Peter Clark,et al.  KM – The Knowledge Machine 2.0: Users Manual , 2003 .

[116]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[117]  Peter Clark,et al.  Naturalness vs. Predictability: A Key Debate in Controlled Languages , 2009, CNL.

[118]  Guus Schreiber,et al.  Knowledge Engineering and Management: The CommonKADS Methodology , 1999 .

[119]  Daniel Bonevac Discourse Representation Theory , 2012 .

[120]  Catherine Dolbear,et al.  A Comparison of three Controlled Natural Languages for OWL 1.1 , 2008, OWLED.

[121]  Rolf Schwitter,et al.  Representing Knowledge in Controlled Natural Language: A Case Study , 2004, KES.

[122]  Davide Fossati,et al.  Be Brief, And They Shall Learn: Generating Concise Language Feedback for a Computer Tutor , 2008, Int. J. Artif. Intell. Educ..

[123]  Ian Horrocks,et al.  Ontology Integration Using Mappings: Towards Getting the Right Logical Consequences , 2009, ESWC.

[124]  Jens Lehmann,et al.  ORE - A Tool for Repairing and Enriching Knowledge Bases , 2010, SEMWEB.

[125]  Kalina Bontcheva,et al.  RoundTrip Ontology Authoring , 2008, SEMWEB.

[126]  Bay Arinze,et al.  A natural language front-end for knowledge acquisition , 1989, SGAR.

[127]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[128]  Robert Stevens,et al.  The Manchester OWL Syntax , 2006, OWLED.