Probabilistic Parsing Action Models for Multi-Lingual Dependency Parsing

Deterministic dependency parsers use parsing actions to construct dependencies. These parsers do not compute the probability of the whole dependency tree. They only determine parsing actions stepwisely by a trained classifier. To globally model parsing actions of all steps that are taken on the input sentence, we propose two kinds of probabilistic parsing action models that can compute the probability of the whole dependency tree. The tree with the maximal probability is outputted. The experiments are carried on 10 languages, and the results show that our probabilistic parsing action models outperform the original deterministic dependency parser.

[1]  Sabine Buchholz,et al.  CoNLL-X Shared Task on Multilingual Dependency Parsing , 2006, CoNLL.

[2]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[3]  Díaz de Ilarraza Construction of a Basque Dependency Treebank , 2003 .

[4]  Lluís Màrquez i Villodre,et al.  Anotación semiautomática con papeles temáticos de los corpus CESS-ECE , 2007, Proces. del Leng. Natural.

[5]  Joakim Nivre,et al.  An Efficient Algorithm for Projective Dependency Parsing , 2003, IWPT.

[6]  Chu-Ren Huang,et al.  Sinica Treebank: Design Criteria, Representational Issues and Implementation , 2004 .

[7]  Sebastian Riedel,et al.  The CoNLL 2007 Shared Task on Dependency Parsing , 2007, EMNLP.

[8]  Yuji Matsumoto,et al.  Statistical Dependency Analysis with Support Vector Machines , 2003, IWPT.

[9]  Jan Hajic,et al.  Prague Arabic Dependency Treebank: Development in Data and Tools , 2004 .

[10]  Stelios Piperidis,et al.  Theoretical and Practical Issues in the Construction of a Greek Dependency Treebank , 2005 .

[11]  Joakim Nivre,et al.  Labeled Pseudo-Projective Dependency Parsing with Support Vector Machines , 2006, CoNLL.

[12]  Roberto Basili,et al.  Building the Italian Syntactic-Semantic Treebank , 2003 .

[13]  János Csirik,et al.  The Szeged Treebank , 2005, TSD.

[14]  Joakim Nivre,et al.  Pseudo-Projective Dependency Parsing , 2005, ACL.

[15]  Richard Johansson,et al.  Extended Constituent-to-Dependency Conversion for English , 2007, NODALIDA.

[16]  Dilek Z. Hakkani-Tür,et al.  Building a Turkish Treebank , 2003 .