Layers of Interpretation: On Grammar and Compositionality

With the recent resurgence of interest in semantic annotation of corpora for improved semantic parsing, we observe a tendency which we view as ill-advised, to conflate sentence meaning and speaker meaning into a single mapping, whether done by annotators or by a parser. We argue instead for the more traditional hypothesis that sentence meaning, but not speaker meaning, is compositional, and accordingly that NLP systems would benefit from reusable, automatically derivable, taskindependent semantic representations which target sentence meaning, in order to capture exactly the information in the linguistic signal itself. We further argue that compositional construction of such sentence meaning representations affords better consistency, more comprehensiveness, greater scalability, and less duplication of effort for each new NLP application. For concreteness, we describe one well-tested grammar-based method for producing sentence meaning representations which is efficient for annotators, and which exhibits many of the above benefits. We then report on a small inter-annotator agreement study to quantify the consistency of semantic representations produced via this grammar-based method.

[1]  Olga Babko-Malaya,et al.  PropBank Annotation Guidelines , 2010 .

[2]  Philipp Koehn,et al.  Abstract Meaning Representation for Sembanking , 2013, LAW@ACL.

[3]  Christopher D. Manning,et al.  LinGO Redwoods A Rich and Dynamic Treebank for HPSG , 2002 .

[4]  H. Alshawi,et al.  The Core Language Engine , 1994 .

[5]  Nancy Ide,et al.  GrAF: A Graph-based Format for Linguistic Annotations , 2007, LAW@ACL.

[6]  Rohit J. Kate,et al.  Semantic Parsing: The Task, the State of the Art and the Future , 2010, ACL 2010.

[7]  Geoffrey K. Pullum,et al.  Processing English with a Generalized Phrase Structure Grammar , 1982, ACL.

[8]  Stephan Oepen,et al.  Parser Evaluation Using Elementary Dependency Matching , 2011, IWPT.

[9]  Alex Lascarides,et al.  Reasoning dynamically about what one says , 2011, Synthese.

[10]  Stephan Oepen,et al.  Discriminant-Based MRS Banking , 2006, LREC.

[11]  Michael J. Reddy Metaphor and Thought: The conduit metaphor: A case of frame conflict in our language about language , 1993 .

[12]  Timothy Baldwin,et al.  Treeblazing: Using External Treebanks to Filter Parse Forests for Parse Selection and Treebanking , 2011, IJCNLP.

[13]  Koenraad De Smedt,et al.  CONSTRUCTING A PARSED CORPUS WITH A LARGE LFG GRAMMAR , 2005 .

[14]  Stephan Oepen,et al.  WikiWoods: Syntacto-Semantic Annotation for English Wikipedia , 2010, LREC.

[15]  David Carter The TreeBanker , 2000 .

[16]  D. Flickinger Accuracy vs. Robustness in Grammar Engineering , 2010 .

[17]  Dan I. Moldovan,et al.  Semantic Representation of Negation Using Focus Detection , 2011, ACL.

[18]  Johan Bos,et al.  Parsimonious Semantic Representations with Projection Pointers , 2013, IWCS.

[19]  Alex Lascarides,et al.  An Algebra for Semantic Construction in Constraint-based Grammars , 2001, ACL.

[20]  Rob A. van der Sandt,et al.  Presupposition Projection as Anaphora Resolution , 1992, J. Semant..

[21]  Johan Bos,et al.  Developing a large semantically annotated corpus , 2012, LREC.

[22]  Berthold Crysmann,et al.  Some Fine Points of Hybrid Natural Language Parsing , 2008, LREC.

[23]  付伶俐 打磨Using Language,倡导新理念 , 2014 .

[24]  David M. Carter,et al.  The TreeBanker: a Tool for Supervised Training of Parsed Corpora , 1997, ArXiv.

[25]  Lauri Karttunen,et al.  Veridicity annotation in the lexicon? A look at factive adjectives , 2013, ACL 2013.

[26]  Elizabeth Shriberg,et al.  The ICSI Meeting Recorder Dialog Act (MRDA) Corpus , 2004, SIGDIAL Workshop.

[27]  Kevin Knight,et al.  Generation that Exploits Corpus-Based Statistical Knowledge , 1998, ACL.

[28]  Willard Van Orman Quine,et al.  Word and Object , 1960 .

[29]  János Csirik,et al.  The BioScope corpus: biomedical texts annotated for uncertainty, negation and their scopes , 2008, BMC Bioinformatics.

[30]  Woodley Packard Full Forest Treebanking , 2015 .

[31]  FlickingerDan On building a more efficient grammar by exploiting types , 2000 .

[32]  Martha Palmer,et al.  From TreeBank to PropBank , 2002, LREC.

[33]  Dan Flickinger,et al.  Minimal Recursion Semantics: An Introduction , 2005 .

[34]  Emily M. Bender,et al.  Are We There Yet?: The Development of a Corpus Annotated for Social Acts in Multilingual Online Discourse , 2013, Dialogue Discourse.

[35]  Jerry R. Hobbs Coherence and Coreference , 1979, Cogn. Sci..

[36]  Siobhan Chapman Logic and Conversation , 2005 .

[37]  H. Grice Utterer’s Meaning, Sentence-Meaning, and Word-Meaning , 1968 .

[38]  Derrick Higgins,et al.  A Machine Learning Approach to Modeling Scope Preferences , 2003, CL.

[39]  Owen Rambow,et al.  The Pursuit of Power and Its Manifestation in Written Dialog , 2012, 2012 IEEE Sixth International Conference on Semantic Computing.

[40]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[41]  Ivan A. Sag,et al.  Book Reviews: Head-driven Phrase Structure Grammar and German in Head-driven Phrase-structure Grammar , 1996, CL.