Towards Better Ontological Support for Recognizing Textual Entailment

Many applications in modern information technology utilize ontological knowledge to increase their performance, precision, and success rate. However, the integration of ontological sources is in general a difficult task since the semantics of all concepts, individuals, and relations must be preserved across the various sources. In this paper we discuss the importance of combined background knowledge for recognizing textual entailment (RTE). We present and analyze formally a new graph-based procedure for integration of concepts and individuals from ontologies based on the hierarchy of WordNet. We embed it in our experimental RTE framework where a deep-shallow semantic text analysis combined with logical inference is used to identify the logical relations between two English texts. Our results show that fine-grained and consistent knowledge coming from diverse sources is a necessary condition determining the correctness and traceability of results. The RTE application performs significantly better when a substantial amount of problem-relevant knowledge has been integrated into its inference process.

[1]  Johan Bos,et al.  Linguistically Motivated Large-Scale NLP with C&C and Boxer , 2007, ACL.

[2]  George Boolos,et al.  Computability and logic , 1974 .

[3]  Ido Dagan,et al.  Recognizing textual entailment: Rational, evaluation and approaches , 2009 .

[4]  Ulrich Callmeier,et al.  PET – a platform for experimentation with efficient HPSG processing techniques , 2000, Natural Language Engineering.

[5]  William McCune,et al.  Mace4 Reference Manual and Guide , 2003, ArXiv.

[6]  Christopher D. Manning,et al.  An extended model of natural logic , 2009, IWCS.

[7]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[8]  Ido Dagan,et al.  The Third PASCAL Recognizing Textual Entailment Challenge , 2007, ACL-PASCAL@ACL.

[9]  Ian H. Witten,et al.  A knowledge-based search engine powered by wikipedia , 2007, CIKM '07.

[10]  Johan Bos,et al.  Recognising Textual Entailment with Logical Inference , 2005, HLT.

[11]  Gerhard Weikum,et al.  YAGO: A Large Ontology from Wikipedia and WordNet , 2008, J. Web Semant..

[12]  Ulrich Schäfer,et al.  Integrating deep and shallow natural language processing components: representations and hybrid architectures , 2006 .

[13]  Dan I. Moldovan,et al.  A Logic-Based Semantic Approach to Recognizing Textual Entailment , 2006, ACL.

[14]  Ulrich Schäfer,et al.  Shallow Processing with Unification and Typed Feature Structures - Foundations and Applications , 2004, Künstliche Intell..

[15]  Gerhard Weikum,et al.  Transductive Learning for Text Classification Using Explicit Knowledge Models , 2006, PKDD.

[16]  Dan Flickinger,et al.  On building a more effcient grammar by exploiting types , 2000, Natural Language Engineering.

[17]  Barbara H. Partee,et al.  Properties, types and meaning , 1988 .

[18]  Francis Jeffry Pelletier,et al.  Representation and Inference for Natural Language: A First Course in Computational Semantics , 2005, Computational Linguistics.

[19]  Uwe Reyle,et al.  From discourse to logic , 1993 .

[20]  Alon Y. Halevy,et al.  Semantic Integration , 2005, AI Mag..

[21]  Dan Flickinger,et al.  Minimal Recursion Semantics: An Introduction , 2005 .

[22]  David R. Dowty On the Semantic Content of the Notion of ‘Thematic Role’ , 1989 .

[23]  M. Felisa Verdejo,et al.  Techniques for Recognizing Textual Entailment and Semantic Equivalence , 2005, CAEPIA.

[24]  Ido Dagan,et al.  The Fourth PASCAL Recognizing Textual Entailment Challenge | NIST , 2009 .

[25]  Fabian M. Suchanek,et al.  Integrating YAGO into the Suggested Upper Merged Ontology , 2008, 2008 20th IEEE International Conference on Tools with Artificial Intelligence.

[26]  Alberto Bugarín,et al.  Current Topics in Artificial Intelligence, 11th Conference of the Spanish Association for Artificial Intelligence, CAEPIA 2005, Santiago de Compostela, Spain, November 16-18, 2005, Revised Selected Papers , 2006, CAEPIA.

[27]  Oren Etzioni,et al.  Structured Querying of Web Text Data: A Technical Challenge , 2007, CIDR.

[28]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[29]  Rob A. van der Sandt,et al.  Presupposition Projection as Anaphora Resolution , 1992, J. Semant..

[30]  Ted Pedersen,et al.  An Adapted Lesk Algorithm for Word Sense Disambiguation Using WordNet , 2002, CICLing.

[31]  Peter Clark,et al.  The Seventh PASCAL Recognizing Textual Entailment Challenge , 2011, TAC.

[32]  Thorsten Brants,et al.  TnT – A Statistical Part-of-Speech Tagger , 2000, ANLP.

[33]  Michael J. Witbrock,et al.  An Introduction to the Syntax and Content of Cyc , 2006, AAAI Spring Symposium: Formalizing and Compiling Background Knowledge and Its Applications to Knowledge Representation and Question Answering.