Dependency Parsing by Inference over High-recall Dependency Predictions

As more and more syntactically-annotated corpora become available for a wide variety of languages, machine learning approaches to parsing gain interest as a means of developing parsers without having to repeat some of the labor-intensive and language-specific activities required for traditional parser development, such as manual grammar engineering, for each new language. The CoNLL-X shared task on multi-lingual dependency parsing (Buchholz et al., 2006) aims to evaluate and advance the state-of-the-art in machine learning-based dependency parsing by providing a standard benchmark set comprising thirteen languages. In this paper, we describe two different machine learning approaches to the CoNLL-X shared task.

[1]  P. Resnik Treebanks : Building and Using Parsed Corpora , 2022 .

[2]  Eckhard Bick,et al.  Floresta Sintá(c)tica: A treebank for Portuguese , 2002, LREC.

[3]  Joakim Nivre,et al.  MAMBA Meets TIGER: Reconstructing a Swedish Treebank from Antiquity , 2005 .

[4]  Sabine Brants,et al.  The TIGER Treebank , 2001 .

[5]  Sabine Buchholz,et al.  CoNLL-X Shared Task on Multilingual Dependency Parsing , 2006, CoNLL.

[6]  Montserrat Civit Torruella,et al.  Design Principles for a Spanish Treebank , 2002 .

[7]  Jan Hajic,et al.  Prague Arabic Dependency Treebank: Development in Data and Tools , 2004 .

[8]  Kiril Ivanov Simov,et al.  Practical Annotation Scheme for an HPSG Treebank of Bulgarian , 2003, LINC@EACL.

[9]  Erik F. Tjong Kim Sang,et al.  Memory-Based Shallow Parsing , 2002, J. Mach. Learn. Res..

[10]  Petya Osenova,et al.  Design and Implementation of the Bulgarian HPSG-based Treebank , 2004 .

[11]  Chu-Ren Huang,et al.  Sinica Treebank: Design Criteria, Representational Issues and Implementation , 2004 .

[12]  Yuji Matsumoto,et al.  Statistical Dependency Analysis with Support Vector Machines , 2003, IWPT.

[13]  Kemal Oflazer,et al.  The Annotation Process in the Turkish Treebank , 2003, LINC@EACL.

[14]  Gertjan van Noord,et al.  The Alpino Dependency Treebank , 2001, CLIN.

[15]  Walter Daelemans,et al.  TiMBL: Tilburg Memory-Based Learner, version 2.0, Reference guide , 1998 .

[16]  Anne Abeillé,et al.  Treebanks: Building and Using Parsed Corpora , 2003 .

[17]  Dilek Z. Hakkani-Tür,et al.  Building a Turkish Treebank , 2003 .

[18]  Saso Dzeroski,et al.  Towards a Slovene Dependency Treebank , 2006, LREC.