Ontology-Based Interpretation of Natural Language

For humans, understanding a natural language sentence or discourse is so effortless that we hardly ever think about it. For machines, however, the task of interpreting natural language, especially grasping meaning beyond the literal content, has proven extremely difficult and requires a large amount of background knowledge. This book focuses on the interpretation of natural language with respect to specific domain knowledge captured in ontologies. The main contribution is an approach that puts ontologies at the center of the interpretation process. This means that ontologies not only provide a formalization of domain knowledge necessary for interpretation but also support and guide the construction of meaning representations. We start with an introduction to ontologies and demonstrate how linguistic information can be attached to them by means of the ontology lexicon model lemon. These lexica then serve as basis for the automatic generation of grammars, which we use to compositionally construct meaning representations that conform with the vocabulary of an underlying ontology. As a result, the level of representational granularity is not driven by language but by the semantic distinctions made in the underlying ontology and thus by distinctions that are relevant in the context of a particular domain. We highlight some of the challenges involved in the construction of ontology-based meaning representations, and show how ontologies can be exploited for ambiguity resolution and the interpretation of temporal expressions. Finally, we present a question answering system that combines all tools and techniques introduced throughout the book in a real-world application, and sketch how the presented approach can scale to larger, multi-domain scenarios in the context of the Semantic Web.

[1]  Jean Carletta,et al.  The NITE XML Toolkit: Data Model and Query Language , 2005, Lang. Resour. Evaluation.

[2]  Paul Dekker,et al.  A Guide to Dynamic Semantics , 2008 .

[3]  Jan van Eijck,et al.  Computational Semantics with Functional Programming , 2010 .

[4]  Jens Lehmann,et al.  Template-based question answering over RDF data , 2012, WWW.

[5]  Peter Thanisch,et al.  Natural language interfaces to databases – an introduction , 1995, Natural Language Engineering.

[6]  Philipp Cimiano,et al.  Ontology learning and population from text - algorithms, evaluation and applications , 2006 .

[7]  Nicola Guarino,et al.  Ontologies and Knowledge Bases. Towards a Terminological Clarification , 1995 .

[8]  Aravind K. Joshi,et al.  Tree Adjunct Grammars , 1975, J. Comput. Syst. Sci..

[9]  Nicola Guarino,et al.  An Overview of OntoClean , 2004, Handbook on Ontologies.

[10]  Anne Abeillé,et al.  FTAG : current status and parsing scheme , 1999 .

[11]  Johan Bos,et al.  Linguistically Motivated Large-Scale NLP with C&C and Boxer , 2007, ACL.

[12]  Philipp Cimiano,et al.  Integrating WordNet and Wiktionary with lemon , 2012, Linked Data in Linguistics.

[13]  Nicola Guarino,et al.  Understanding and building, using ontologies , 1997, Int. J. Hum. Comput. Stud..

[14]  James Pustejovsky,et al.  The Semantics of Lexical Underspecification , 1998 .

[15]  Laura Kallmeyer,et al.  Developing a TT-MCTAG for German with an RCG-based Parser , 2008, LREC.

[16]  Philipp Cimiano,et al.  Pythia: Compositional Meaning Construction for Ontology-Based Question Answering on the Semantic Web , 2011, NLDB.

[17]  Menzo Windhouwer,et al.  Linking to Linguistic Data Categories in ISOcat , 2012, Linked Data in Linguistics.

[18]  Mats Rooth,et al.  Structural Ambiguity and Lexical Relations , 1991, ACL.

[19]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[20]  Dov M. Gabbay,et al.  Handbook of Philosophical Logic , 2002 .

[21]  Eduard Hovy,et al.  New Trends of Research in Ontologies and Lexical Resources , 2013, Theory and Applications of Natural Language Processing.

[22]  Mark Liberman,et al.  A formal framework for linguistic annotation , 1999, Speech Commun..

[23]  Michael Moortgat,et al.  Categorial Type Logics , 1997, Handbook of Logic and Language.

[24]  James Pustejovsky,et al.  Annotating and Reasoning about Time and Events , 2005, The Language of Time - A Reader.

[25]  Asunción Gómez-Pérez,et al.  METHONTOLOGY: From Ontological Art Towards Ontological Engineering , 1997, AAAI 1997.

[26]  Steffen Staab,et al.  International Handbooks on Information Systems , 2013 .

[27]  Rudolf Carnap,et al.  Meaning and Necessity , 1947 .

[28]  Remko Scha,et al.  The Interpretation of Relational Nouns , 1988, ACL.

[29]  Juntae Yoon,et al.  A Feature-Based Lexicalized Tree Adjoining Grammar for Korean , 2000 .

[30]  Luke S. Zettlemoyer,et al.  Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial Grammars , 2005, UAI.

[31]  Enrico Motta,et al.  Is Question Answering fit for the Semantic Web?: A survey , 2011, Semantic Web.

[32]  Aldo Gangemi,et al.  Ontology Design Patterns , 2005 .

[33]  Hans Kamp,et al.  Discourse Representation Theory: What it is and Where it Ought to Go , 1988, Natural Language at the Computer.

[34]  Philipp Cimiano,et al.  Collaborative semantic editing of linked data lexica , 2012, LREC.

[35]  Aravind K. Joshi,et al.  Tree-adjoining grammars and lexicalized grammars , 1992, Tree Automata and Languages.

[36]  Gert Smolka,et al.  Attributive Concept Descriptions with Complements , 1991, Artif. Intell..

[37]  Philipp Cimiano,et al.  Flexible semantic composition with DUDES , 2009 .

[38]  Philipp Cimiano,et al.  A Corpus-Based Approach for the Induction of Ontology Lexica , 2013, NLDB.

[39]  Raymond Reiter,et al.  The Frame Problem in the Situation Calculus: A Simple Solution (Sometimes) and a Completeness Result for Goal Regression , 1991, Artificial and Mathematical Theory of Computation.

[40]  Rohit J. Kate,et al.  Using String-Kernels for Learning Semantic Parsers , 2006, ACL.

[41]  Laura Kallmeyer,et al.  Semantic construction in feature-based TAG , 2003 .

[42]  Marc Kemps-Snijders,et al.  ISOcat: Corralling Data Categories in the Wild , 2008, LREC.

[43]  Johan Bos,et al.  Predicate logic unplugged , 1996 .

[44]  Michael R. Genesereth,et al.  Introduction to Logic, Second Edition , 2013, Introduction to Logic.

[45]  Eduardo H. Galeano,et al.  Soccer in Sun and Shadow , 1998 .

[46]  Uwe Reyle,et al.  UDRT-based Semantics Construction for LTAG - and what it tells us about the role of adjunction in LTAG , 2007 .

[47]  Raymond J. Mooney,et al.  Learning Synchronous Grammars for Semantic Parsing with Lambda Calculus , 2007, ACL.

[48]  Aarne Ranta Grammatical Framework - Programming with Multilingual Grammars , 2011, CSLI Studies in Computational Linguistics.

[49]  Steffen Staab,et al.  What Is an Ontology? , 2009, Handbook on Ontologies.

[50]  Katrin Erk,et al.  Measuring semantic relatedness with vector space models and random walks , 2009, Graph-based Methods for Natural Language Processing.

[51]  John P. McCrae,et al.  Design Patterns for Engineering the Ontology-Lexicon Interface , 2014, Towards the Multilingual Semantic Web.

[52]  James Pustejovsky,et al.  Temporal and Event Information in Natural Language Text , 2005, Lang. Resour. Evaluation.

[53]  Aravind K. Joshi,et al.  An Earley-Type Parsing Algorithm for Tree Adjoining Grammars , 1988, ACL.

[54]  N. F. Noy,et al.  Ontology Development 101: A Guide to Creating Your First Ontology , 2001 .

[55]  Tomek Strzalkowski,et al.  From Discourse to Logic , 1991 .

[56]  Jan van Eijck,et al.  Representing Discourse in Context , 1997, Handbook of Logic and Language.

[57]  Ido Dagan,et al.  Recognizing textual entailment: Rational, evaluation and approaches , 2009 .

[58]  A. Joshi,et al.  Factoring Predicate Argument and Scope Semantics: Underspecified Semantics with LTAG , 2003 .

[59]  Nancy Ide,et al.  Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art , 1998, Comput. Linguistics.

[60]  Richard A. Frost,et al.  Adjectives: A Uniform Semantic Approach , 2005, Canadian AI.

[61]  Masaki Murata,et al.  Hypernym Discovery Based on Distributional Similarity and Hierarchical Structures , 2009, EMNLP.

[62]  Uwe Reyle,et al.  From Discourse to Logic - Introduction to Modeltheoretic Semantics of Natural Language, Formal Logic and Discourse Representation Theory , 1993, Studies in linguistics and philosophy.

[63]  Manfred Pinkal,et al.  Vagueness, Ambiguity, and Underspecification , 1996 .

[64]  Mark Steedman,et al.  Surface structure and interpretation , 1996, Linguistic inquiry.

[65]  H. Kamp A Theory of Truth and Semantic Representation , 2008 .

[66]  Joseph Le Roux,et al.  XMG: eXtensible MetaGrammar , 2013, Computational Linguistics.

[67]  Mirella Lapata,et al.  Vector-based Models of Semantic Composition , 2008, ACL.

[68]  Nicola Guarino,et al.  A Formal Ontology of Properties , 2000, EKAW.

[69]  Harry Bunt,et al.  Underspecification in Semantic Representations : Which Technique for What Purpose? , 2003 .

[70]  Dan Flickinger,et al.  Minimal Recursion Semantics: An Introduction , 2005 .

[71]  G. Cinque Adverbs and Functional Heads: A Cross-Linguistic Perspective , 1999 .

[72]  Uwe Reyle,et al.  Dealing with Ambiguities by Underspecification: Construction, Representation and Deduction , 1993, J. Semant..

[73]  Asunción Gómez-Pérez,et al.  Interchanging lexical resources on the Semantic Web , 2012, Language Resources and Evaluation.

[74]  Graeme Hirst,et al.  Semantic Interpretation and the Resolution of Ambiguity , 1987, Studies in natural language processing.

[75]  Christian Chiarcos,et al.  The Open Linguistics Working Group , 2012, LREC.

[76]  Raymond J. Mooney,et al.  Discriminative Reranking for Semantic Parsing , 2006, ACL.

[77]  Sebastian Rudolph,et al.  Foundations of Semantic Web Technologies , 2009 .

[78]  Asunción Gómez-Pérez,et al.  A Pattern-Based Method for Re-Engineering Non-Ontological Resources into Ontologies , 2010, Int. J. Semantic Web Inf. Syst..

[79]  Katrin Erk,et al.  Vector Space Models of Word Meaning and Phrase Meaning: A Survey , 2012, Lang. Linguistics Compass.

[80]  James Pustejovsky,et al.  The Generative Lexicon , 1995, CL.

[81]  Enrico Motta,et al.  Evaluating question answering over linked data , 2013, J. Web Semant..

[82]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[83]  Philipp Cimiano,et al.  Representing and resolving ambiguities in ontology-based question answering , 2011, TextInfer@EMNLP.

[84]  Csongor Nyulas,et al.  WebProtégé: A collaborative ontology editor and knowledge acquisition tool for the Web , 2013, Semantic Web.

[85]  Philipp Cimiano,et al.  Linking Lexical Resources and Ontologies on the Semantic Web with Lemon , 2011, ESWC.

[86]  P. Blackburn,et al.  Book Reviews: The Proper Treatment of Events, by Michiel van Lambalgen and Fritz Hamm , 2005, CL.

[87]  James Pustejovsky,et al.  ISO-TimeML: An International Standard for Semantic Annotation , 2010, LREC.

[88]  Sergei Nirenburg,et al.  Book Review: Ontological Semantics, by Sergei Nirenburg and Victor Raskin , 2004, CL.

[89]  James F. Allen Towards a General Theory of Action and Time , 1984, Artif. Intell..

[90]  Dieter Fensel,et al.  Knowledge Engineering: Principles and Methods , 1998, Data Knowl. Eng..

[91]  Joachim Niehren,et al.  Bridging the gap between underspecification formalisms: hole semantics as dominance constraints , 2002 .

[92]  James F. Allen,et al.  Extracting Events and Temporal Expressions from Text , 2010, 2010 IEEE Fourth International Conference on Semantic Computing.

[93]  Nicola Guarino,et al.  Some Ontological Principles for Designing Upper Level Lexical Resources , 1998, LREC.

[94]  Paul Buitelaar,et al.  LexInfo: A declarative model for the lexicon-ontology interface , 2011, J. Web Semant..

[95]  Walt Detmar Meurers,et al.  Head-driven phrase structure grammar: linguistic approach, formal foundations, and computational realization , 2006 .

[96]  Jerry R. Hobbs,et al.  An ontology of time for the semantic web , 2004, TALIP.

[97]  Uwe Reyle,et al.  Ontology-based semantic construction underspecification and disambiguation , 2003 .

[98]  Johan Bos,et al.  Wide-Coverage Semantic Analysis with Boxer , 2008, STEP.

[99]  Deborah L. McGuinness,et al.  OWL Web ontology language overview , 2004 .

[100]  Mark Steedman,et al.  Temporal Ontology and Temporal Reference , 1988, CL.

[101]  James F. Allen,et al.  Actions and Events in Interval Temporal Logic , 1994 .

[102]  Aravind K. Joshi,et al.  Mathematical and computational aspects of lexicalized grammars , 1990 .

[103]  Jerry R. Hobbs,et al.  Interpretation as Abduction , 1993, Artif. Intell..

[104]  Harry Bunt,et al.  Semantic Underspecification: Which Technique For What Purpose? , 2008 .

[105]  Francis Jeffry Pelletier,et al.  Representation and Inference for Natural Language: A First Course in Computational Semantics , 2005, Computational Linguistics.

[106]  Uwe Reyle,et al.  On Reasoning with Ambiguities , 1995, EACL.

[107]  Gregory Grefenstette,et al.  SEXTANT: Exploring Unexplored Contexts for Semantic Extraction from Syntactic Analysis , 1992, ACL.

[108]  Christoph Tempich,et al.  Argumentation-Based Ontology Engineering , 2007, IEEE Intelligent Systems.

[109]  Mark Steedman,et al.  Dynamic Semantics for Tense and Aspect , 1995, IJCAI.