Underuse of syntactic categories in Falko: a case study on modification

This paper shows how the automatic syntactic analysis of a corpus of advanced learners of German as a foreign language helps in understanding the acquisition of modification. In former corpus research modification has been studied only by comparing the distributions of single words (or groups of words) in learner and native speaker data. We argue that in order to study modification as a syntactic category it is necessary to work with syntactically analyzed corpora. In this vein, we sketch out our approach to parsing learner language and conduct two contrastive interlanguage studies on modification in the syntactically annotated corpus, showing that not only lexical modifiers can be underused (as shown in many other studies), but that modification as a whole category (including multi-word modifiers such as prepositional phrases, and clausal modifiers such as relative clauses) is underused in our learner corpus data.

[1]  Kari Tenfjord,et al.  The "Hows" and the "Whys" of Coding Categories in a Learner Corpus (or "How and Why an Error-Tagged Learner Corpus is not 'ipso facto' One Big Comparative Fallacy") , 2006 .

[2]  Eli Hinkel,et al.  Adverbial markers and tone in L1 and L2 students' writing , 2003 .

[3]  Anke Lüdeling,et al.  Multi-level error annotation in learner corpora , 2005 .

[4]  Adam Przepiórkowski,et al.  Case Assignment and the Complement/Adjunct Dichotomy: A Non-Configurational Constraint-Based Approach , 1999 .

[5]  Robert E. Litan,et al.  Going digital , 1998 .

[6]  Nicolas Ballier,et al.  Automatic Treatment and Analysis of Learner Corpus Data , 2013 .

[7]  Koenraad De Smedt,et al.  Syntactic Annotation of Learner Corpora , 2010 .

[8]  Bernd Bohnet,et al.  Top Accuracy and Fast Dependency Parsing is not a Contradiction , 2010, COLING.

[9]  Martina Möllering,et al.  The Acquisition of German Modal Particles: A Corpus-Based Approach , 2004 .

[10]  Harald Weydt,et al.  Partikeln und Deutschunterricht , 1983 .

[11]  Ines Rehbein,et al.  Better tags give better trees – or do they? , 2011 .

[12]  Walt Detmar Meurers,et al.  Towards interlanguage POS annotation for effective learner corpora in SLA and FLT , 2009 .

[13]  Nina Vyatkina,et al.  DEVELOPMENT OF SECOND LANGUAGE PRAGMATIC COMPETENCE: THE DATA-DRIVEN TEACHING OF GERMAN MODAL PARTICLES BASED ON A LEARNER CORPUS , 2007 .

[14]  Helmut Schmidt,et al.  Probabilistic part-of-speech tagging using decision trees , 1994 .

[15]  Hsin I Chen,et al.  CONTRASTIVE LEARNER CORPUS ANALYSIS OF EPISTEMIC MODALITY AND INTERLANGUAGE PRAGMATIC COMPETENCE IN L2 WRITING , 2010 .

[16]  Victorine Hancock,et al.  The acquisition of four adverbs in a learner corpus of L2 French , 2009 .

[17]  Hitoshi Isahara,et al.  Error Annotation for Corpus of Japanese Learner English , 2005, IJCNLP.

[18]  Adam Przepiórkowski,et al.  Proceedings of the Eighth International Workshop on Treebanks and Linguistic Theories (TLT8). 4-5 December 2009, Milan, Italy , 2009 .

[19]  Karin Aijmer,et al.  Modality in advanced Swedish learners’ written interlanguage , 2002 .

[20]  Minhua Jiang Deutsche Modalpartikeln als Lehr- und Lernproblem im Fach Deutsch als Fremdsprache für Ausländer mit didaktischen Überlegungen , 1994 .

[21]  Marzena Watorek,et al.  THE SCOPE OF ADDITIVE PARTICLES IN BASIC LEARNER LANGUAGES , 2000, Studies in Second Language Acquisition.

[22]  S. P. Corder,et al.  Error analysis and interlanguage , 1981 .

[23]  Anke Lüdeling,et al.  What’s Hard? Quantitative Evidence for Difficult Constructions in German Learner Data , 2008 .

[24]  Anke Lüdeling,et al.  Syntactic annotation of non-canonical linguistic structures , 2007 .

[25]  Seanna Doolittle,et al.  Das Lernerkorpus Falko , 2008 .

[26]  Silvia Hansen,et al.  Developments in the TIGER Annotation Scheme and their Realization in the Corpus , 2002, LREC.

[27]  Suzanne Schlyter Adverbs and functional categories in L1 and L2 acquisition of French , 2005 .

[28]  Anke Lüdeling,et al.  Competing Target Hypotheses in the Falko Corpus: A Flexible Multi-Layer Corpus Architecture , 2011 .

[29]  Markus Dickinson,et al.  Dependency Annotation for Learner Corpora , 2009 .