Augmenting WordNet for Deep Understanding of Text

One of the big challenges in understanding text, i.e., constructing an overall coherent representation of the text, is that much information needed in that representation is unstated (implicit). Thus, in order to "fill in the gaps" and create an overall representation, language processing systems need a large amount of world knowledge, and creating those knowledge resources remains a fundamental challenge. In our current work, we are seeking to augment WordNet as a knowledge resource for language understanding in several ways: adding in formal versions of its word sense definitions (glosses); classifying the morphosemantic links between nouns and verbs; encoding a small number of "core theories" about WordNet's most commonly used terms; and adding in simple representations of scripts. Although this is still work in progress, we describe our experiences so far with what we hope will be a significantly improved resource for the deep understanding of language.

[1]  Ramanathan V. Guha,et al.  Building large knowledge-based systems , 1989 .

[2]  Ido Dagan,et al.  The Third PASCAL Recognizing Textual Entailment Challenge , 2007, ACL-PASCAL@ACL.

[3]  Ramanathan V. Guha,et al.  Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project , 1990 .

[4]  Ido Dagan,et al.  The Third PASCAL Recognizing Textual Entailment Challenge , 2007, ACL-PASCAL@ACL.

[5]  Eric Yeh,et al.  Learning Alignments and Leveraging Natural Logic , 2007, ACL-PASCAL@ACL.

[6]  Patrick Pantel,et al.  Discovery of inference rules for question-answering , 2001, Natural Language Engineering.

[7]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[8]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[9]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[10]  Jean Véronis,et al.  EXTRACTING KNOWLEDGE BASES FROM MACHINE- READABLE DICTIONARIES : HAVE WE WASTED OUR TIME? , 1999 .

[11]  Roger C. Schank,et al.  SCRIPTS, PLANS, GOALS, AND UNDERSTANDING , 1988 .

[12]  Valeria de Paiva,et al.  Deverbal Nouns in Knowledge Representation , 2006, FLAIRS Conference.

[13]  Nancy Ide,et al.  Knowledge Extraction from Machine-Readable Dictionaries: An Evaluation , 1993, EAMT Workshop.

[14]  Sanda M. Harabagiu,et al.  Textual Entailment Through Extended Lexical Overlap and Lexico-Semantic Matching , 2007, ACL-PASCAL@ACL.

[15]  Christiane Fellbaum,et al.  Putting Semantics into WordNet's "Morphosemantic" Links , 2009, LTC.

[16]  Timothy Chklovski,et al.  Collecting paraphrase corpora from volunteer contributors , 2005, K-CAP '05.

[17]  Jerry R. Hobbs Toward a Useful Concept of Causality for Lexical Semantics , 2005, J. Semant..

[18]  Peter Clark,et al.  Boeing’s NLP System and the Challenges of Semantic Representation , 2008, STEP.

[19]  A. Graesser Prose Comprehension Beyond the Word , 1981 .