Comparing Rule-Based and Data-Driven Dependency Parsing of Learner Language

We explore the performance of two dependency parsing approaches, the rulebased WCDG approach (Foth and Menzel 2006) and the data-driven dependency parser MaltParser (Nivre et al. 2007) on texts written by language learners. We show that WCDG outperforms MaltParser in identifying the main functorargument relations, whereas MaltParser is more successful than WCDG in establishing optional, adjunct dependency relations. This can be interpreted as a tradeoff between the rich, hand-crafted lexical resources capturing obligatory argument relations in WCDG and the ability of a datadriven parser to identify optional, adjunct relations based on the linguistic and world knowledge encoded in the gold-standard training corpora.

[1]  Niels Ott,et al.  Evaluating Dependency Parsing Performance on German Learner Language , 2010 .

[2]  Thorsten Brants,et al.  TnT – A Statistical Part-of-Speech Tagger , 2000, ANLP.

[3]  Ines Rehbein,et al.  Syntactic Misuse, Overuse and Underuse: A Study of a Parsed Learner Corpus and its Target Hypothesis , 2010 .

[4]  Koenraad De Smedt,et al.  Syntactic Annotation of Learner Corpora , 2010 .

[5]  Detmar Meurers,et al.  Integrating parallel analysis modules to evaluate the meaning of answers to reading comprehension questions , 2011 .

[6]  Yannick Versley Parser evaluation across Text Types , 2005 .

[7]  Yuji Matsumoto MaltParser: A language-independent system for data-driven dependency parsing , 2005 .

[8]  Markus Dickinson,et al.  Dependency Annotation for Learner Corpora , 2009 .

[9]  Helmut Schmidt,et al.  Probabilistic part-of-speech tagging using decision trees , 1994 .

[10]  Erhard W. Hinrichs,et al.  The Tüba-D/Z Treebank: Annotating German with a Context-Free Backbone , 2004, LREC.

[11]  Sabine Buchholz,et al.  CoNLL-X Shared Task on Multilingual Dependency Parsing , 2006, CoNLL.

[12]  Lilja Øvrelid,et al.  Improving data-driven dependency parsing using large-scale LFG grammars , 2009, ACL/IJCNLP.

[13]  Wolfgang Menzel,et al.  Co-Parsing with Competitive Models , 2009, RANLP.

[14]  Kilian A. Foth Eine umfassende Constraint-Dependenz-Grammatik des Deutschen , 2006 .

[15]  Joakim Nivre,et al.  MaltParser: A Language-Independent System for Data-Driven Dependency Parsing , 2007, Natural Language Engineering.

[16]  Wolfgang Menzel,et al.  Hybrid Parsing: Using Probabilistic Models as Predictors for a Symbolic Parser , 2006, ACL.

[17]  Walt Detmar Meurers Compiling a Task-Based Corpus for the Analysis of Learner Language in Context , 2009 .

[18]  Joakim Nivre,et al.  A Transition-Based Parser for 2-Planar Dependency Structures , 2010, ACL.