Semantic Parsing for English as a Second Language

This paper is concerned with semantic parsing for English as a second language (ESL). Motivated by the theoretical emphasis on the learning challenges that occur at the syntax-semantics interface during second language acquisition, we formulate the task based on the divergence between literal and intended meanings. We combine the complementary strengths of English Resource Grammar, a linguistically-precise hand-crafted deep grammar, and TLE, an existing manually annotated ESL UD-TreeBank with a novel reranking model. Experiments demonstrate that in comparison to human annotations, our method can obtain a very promising SemBanking quality. By means of the newly created corpus, we evaluate state-of-the-art semantic parsing as well as grammatical error correction models. The evaluation profiles the performance of neural NLP techniques for handling ESL data and suggests some research directions.

[1]  Emily M. Bender,et al.  Layers of Interpretation: On Grammar and Compositionality , 2015, IWCS.

[2]  Timothy Dozat,et al.  Simpler but More Accurate Semantic Dependency Parsing , 2018, ACL.

[3]  Kevin Duh,et al.  AMR Parsing as Sequence-to-Graph Transduction , 2019, ACL.

[4]  H. Ng,et al.  A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction , 2018, AAAI.

[5]  John Blitzer,et al.  Domain adaptation of natural language processing systems , 2008 .

[6]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[7]  Anna Feldman,et al.  Evaluating and automating the annotation of a learner corpus , 2013, Language Resources and Evaluation.

[8]  Luke S. Zettlemoyer,et al.  Deep Semantic Role Labeling: What Works and What’s Next , 2017, ACL.

[9]  Phil Blunsom,et al.  The Role of Syntax in Vector Space Models of Compositional Semantics , 2013, ACL.

[10]  Ivan A. Sag,et al.  Book Reviews: Head-driven Phrase Structure Grammar and German in Head-driven Phrase-structure Grammar , 1996, CL.

[11]  Markus Dickinson,et al.  Dependency Annotation for Learner Corpora , 2009 .

[12]  Alexander Koller,et al.  Compositional Semantic Parsing across Graphbanks , 2019, ACL.

[13]  Stefano Rastelli Learner Corpora without Error Tagging , 2013 .

[14]  Stephan Oepen,et al.  Parser Evaluation Using Elementary Dependency Matching , 2011, IWPT.

[15]  Lydia White,et al.  Second language acquisition at the interfaces , 2011 .

[16]  Weiwei Sun,et al.  Semantic Role Labeling for Learner Chinese: the Importance of Syntactic Parsing and L2-L1 Parallel Data , 2018, EMNLP.

[17]  Philipp Koehn,et al.  Abstract Meaning Representation for Sembanking , 2013, LAW@ACL.

[18]  Ari Rappoport,et al.  Universal Conceptual Cognitive Annotation (UCCA) , 2013, ACL.

[19]  Kevin Knight,et al.  Smatch: an Evaluation Metric for Semantic Feature Structures , 2013, ACL.

[20]  Raymond Hendy Susanto,et al.  The CoNLL-2014 Shared Task on Grammatical Error Correction , 2014 .

[21]  Emily M. Bender,et al.  English Resource Semantics , 2016, NAACL.

[22]  Matthias Gallé,et al.  To Annotate or Not? Predicting Performance Drop under Domain Shift , 2019, EMNLP.

[23]  Yan Xiao,et al.  Second Language Acquisition: An Introductory Course , 2014 .

[24]  Weiwei Sun,et al.  Accurate SHRG-Based Semantic Parsing , 2018, ACL.

[25]  Dan Roth,et al.  Annotating ESL Errors: Challenges and Rewards , 2010 .

[26]  James Fleming,et al.  English as a Global language , 1998, Crossings: A Journal of English Studies.

[27]  Sampo Pyysalo,et al.  Universal Dependencies v1: A Multilingual Treebank Collection , 2016, LREC.

[28]  Baobao Chang,et al.  Syntax Aware LSTM model for Semantic Role Labeling , 2017, SPNLP@EMNLP.

[29]  Walt Detmar Meurers,et al.  Towards interlanguage POS annotation for effective learner corpora in SLA and FLT , 2009 .

[30]  A. Sorace Gradience and optionality in mature and developing grammars , 2006 .

[31]  Weiwei Sun,et al.  Neural Maximum Subgraph Parsing for Cross-Domain Semantic Dependency Analysis , 2018, CoNLL.

[32]  Koby Crammer,et al.  A theory of learning from different domains , 2010, Machine Learning.

[33]  Ah Chung Tsoi,et al.  The Graph Neural Network Model , 2009, IEEE Transactions on Neural Networks.

[34]  Emily M. Bender,et al.  Towards an Encyclopedia of Compositional Semantics: Documenting the Interface of the English Resource Grammar , 2014, LREC.

[35]  D. Flickinger Accuracy vs. Robustness in Grammar Engineering , 2010 .

[36]  Weiwei Sun,et al.  Peking at MRP 2019: Factorization- and Composition-Based Parsing for Elementary Dependency Structures , 2019, CoNLL.

[37]  Wei Zhao,et al.  Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data , 2019, NAACL.

[38]  Keisuke Sakaguchi,et al.  Phrase Structure Annotation and Parsing for Learner English , 2016, ACL.

[39]  Boris Katz,et al.  Universal Dependencies for Learner English , 2016, ACL.

[40]  Edward W. D. Whittaker,et al.  Creating a manually error-tagged and shallow-parsed learner corpus , 2011, ACL.

[41]  Weiwei Sun,et al.  Graph-Based Meaning Representations: Design and Processing , 2019, ACL.

[42]  Noam Chomsky,et al.  वाक्यविन्यास का सैद्धान्तिक पक्ष = Aspects of the theory of syntax , 1965 .

[43]  Stephan Oepen,et al.  Discriminant-Based MRS Banking , 2006, LREC.

[44]  Markus Dickinson,et al.  Defining Syntax for Learner Language Annotation , 2012, COLING.

[45]  Kevin Knight,et al.  Generation that Exploits Corpus-Based Statistical Knowledge , 1998, ACL.

[46]  Anke Lüdeling,et al.  Syntactic annotation of non-canonical linguistic structures , 2007 .

[47]  Dan Flickinger,et al.  On building a more effcient grammar by exploiting types , 2000, Natural Language Engineering.