Evaluating Dependency Parsing Performance on German Learner Language

We present an experiment on dependency parsing of German learner language. Ultimately aiming at evaluating the meaning of learner answers to German reading comprehension questions, we are interested in how reliable a parser trained on native language can identify the main argument relations. To that end, we manually annotated a small set of learner answers and parsed it using MaltParser (Nivre et al., 2007) trained on TuBa-D/Z (Telljohann et al., 2004). The evaluation of the results shows that semantically salient relations such as SUBJ and OBJ can generally be found reliably. Qualitative analysis indicates that the omission of syntactically central material, such as the finite verb, yields incorrect parses while other errors, e.g. in agreement or word order, can still be parsed robustly.

[1]  Walt Detmar Meurers Diagnosing Meaning Errors in Short Answers to Reading Comprehension Questions , 2008 .

[2]  Helmut Schmidt,et al.  Probabilistic part-of-speech tagging using decision trees , 1994 .

[3]  Patrick Grommes,et al.  Mehrdeutigkeiten und Kategorisierung: Probleme bei der Annotation von Lernerkorpora , 2008 .

[4]  Joakim Nivre,et al.  A Transition-Based Parser for 2-Planar Dependency Structures , 2010, ACL.

[5]  Rodney D. Nielsen,et al.  Recognizing entailment in intelligent tutoring systems* , 2009, Natural Language Engineering.

[6]  Erhard W. Hinrichs,et al.  The Tüba-D/Z Treebank: Annotating German with a Context-Free Backbone , 2004, LREC.

[7]  Wolfgang Menzel,et al.  Error Diagnosis for Language Learning Systems , 1999 .

[8]  Markus Dickinson,et al.  Dependency Annotation for Learner Corpora , 2009 .

[9]  Walt Detmar Meurers,et al.  On using intelligent computer-assisted language learning in real-life foreign language teaching and learning , 2011, ReCALL.

[10]  Anke Lüdeling,et al.  Multi-level error annotation in learner corpora , 2005 .

[11]  Yannick Versley Parser evaluation across Text Types , 2005 .

[12]  Walt Detmar Meurers Compiling a Task-Based Corpus for the Analysis of Learner Language in Context , 2009 .

[13]  Kilian A. Foth Eine umfassende Constraint-Dependenz-Grammatik des Deutschen , 2006 .

[14]  Joakim Nivre,et al.  MaltParser: A Language-Independent System for Data-Driven Dependency Parsing , 2007, Natural Language Engineering.

[15]  Sabine Buchholz,et al.  CoNLL-X Shared Task on Multilingual Dependency Parsing , 2006, CoNLL.

[16]  Sabine Brants,et al.  The TIGER Treebank , 2001 .