Are the existing training corpora unnecessarily large?

This research is funded by the Spanish Ministry of Education and Science (TIN2009-14659-C03-01 Project), Universidad Complutense de Madrid and Banco Santander Central Hispano (GR58/08 Research Group Grant).

[1]  Stelios Piperidis,et al.  Theoretical and Practical Issues in the Construction of a Greek Dependency Treebank , 2005 .

[2]  Pablo Gervás,et al.  Towards a Dependency Parser for Greek Using a Small Training Data Set , 2008, Proces. del Leng. Natural.

[3]  Chu-Ren Huang,et al.  Sinica Treebank: Design Criteria, Representational Issues and Implementation , 2004 .

[4]  Gertjan van Noord,et al.  The Alpino Dependency Treebank , 2001, CLIN.

[5]  Anne Abeillé,et al.  Treebanks: Building and Using Parsed Corpora , 2003 .

[6]  FBiG 3LB: Construcción de una base de datos de árboles sintáctico semánticos. , 2003 .

[7]  Joakim Nivre,et al.  Comparing the Influence of Different Treebank Annotations on Dependency Parsing , 2010, LREC.

[8]  Saso Dzeroski,et al.  Towards a Slovene Dependency Treebank , 2006, LREC.

[9]  Joakim Nivre,et al.  MaltParser: A Language-Independent System for Data-Driven Dependency Parsing , 2007, Natural Language Engineering.

[10]  Sabine Brants,et al.  The TIGER Treebank , 2001 .

[11]  Joakim Nivre,et al.  MAMBA Meets TIGER: Reconstructing a Swedish Treebank from Antiquity , 2005 .

[12]  Sabine Buchholz,et al.  CoNLL-X Shared Task on Multilingual Dependency Parsing , 2006, CoNLL.

[13]  Petya Osenova,et al.  Design and Implementation of the Bulgarian HPSG-based Treebank , 2004 .

[14]  Yuji Matsumoto MaltParser: A language-independent system for data-driven dependency parsing , 2005 .

[15]  Jan Hajic,et al.  Prague Arabic Dependency Treebank: Development in Data and Tools , 2004 .

[16]  Dilek Z. Hakkani-Tür,et al.  Building a Turkish Treebank , 2003 .

[17]  Virginia Francisco,et al.  Improving Parsing Accuracy for Spanish using Maltparser , 2010, Proces. del Leng. Natural.

[18]  Alicia Ageno,et al.  3LB: Construcción de una base de datos de árboles sintáctico-semánticos para el catalán, euskera y castellano , 2004, Proces. del Leng. Natural.

[19]  Eckhard Bick,et al.  Floresta Sintá(c)tica: A treebank for Portuguese , 2002, LREC.