Towards an Encyclopedia of Compositional Semantics: Documenting the Interface of the English Resource Grammar

We motivate and describe the design and development of an emerging encyclopedia of compositional semantics, pursuing three objectives. We first seek to compile a comprehensive catalogue of interoperable semantic analyses, i.e., a precise characterization of meaning representations for a broad range of common semantic phenomena. Second, we operationalize the discovery of semantic phenomena and their definition in terms of what we call their semantic fingerprint, a formal account of the building blocks of meaning representation involved and their configuration. Third, we ground our work in a carefully constructed semantic test suite of minimal exemplars for each phenomenon, along with a `target’ fingerprint that enables automated regression testing. We work towards these objectives by codifying and documenting the body of knowledge that has been constructed in a long-term collaborative effort, the development of the LinGO English Resource Grammar. Documentation of its semantic interface is a prerequisite to use by non-experts of the grammar and the analyses it produces, but this effort also advances our own understanding of relevant interactions among phenomena, as well as of areas for future work in the grammar.

[1]  Stephan Oepen,et al.  WikiWoods: Syntacto-Semantic Annotation for English Wikipedia , 2010, LREC.

[2]  Emily M. Bender,et al.  Parser Evaluation over Local and Non-Local Deep Dependencies in a Large Corpus , 2011, EMNLP.

[3]  Josef Ruppenhofer,et al.  FrameNet II: Extended theory and practice , 2006 .

[4]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[5]  Christopher D. Manning,et al.  LinGO Redwoods A Rich and Dynamic Treebank for HPSG , 2002 .

[6]  Ivan A. Sag,et al.  Book Reviews: Head-driven Phrase Structure Grammar and German in Head-driven Phrase-structure Grammar , 1996, CL.

[7]  Geoffrey K. Pullum,et al.  Processing English with a Generalized Phrase Structure Grammar , 1982, ACL.

[8]  Ann A. Copestake,et al.  Invited Talk: Slacker Semantics: Why Superficiality, Dependency and Avoidance of Commitment can be the Right Way to Go , 2009, EACL.

[9]  Stephan Oepen,et al.  Discriminant-Based MRS Banking , 2006, LREC.

[10]  Eunsol Choi,et al.  Scaling Semantic Parsers with On-the-Fly Ontology Matching , 2013, EMNLP.

[11]  Berthold Crysmann,et al.  Some Fine Points of Hybrid Natural Language Parsing , 2008, LREC.

[12]  Gerhard Weikum,et al.  Natural Language Questions for the Web of Data , 2012, EMNLP.

[13]  Stephan Oepen,et al.  Semantic Technologies for Querying Linguistic Annotations: An Experiment Focusing on Graph-Structured Data , 2014, LREC.

[14]  Alexander Yates,et al.  Large-scale Semantic Parsing via Schema Matching and Lexicon Extension , 2013, ACL.

[15]  Dan Flickinger,et al.  Minimal Recursion Semantics: An Introduction , 2005 .

[16]  Claire Bonial,et al.  English PropBank Annotation Guidelines , 2012 .