Apprentissage d’analyseur en dépendances cross-lingue par projection partielle de dépendances (Cross-lingual learning of dependency parsers from partially projected dependencies )[In French]

Cross-lingual learning of dependency parsers from partially projected dependencies This paper presents a simple strategy for transferring dependency parsers across languages. We first show that learning transition-based parser from partially annotated data is possible and effective. Then we propose to build large partially annotated dataset for several target languages via the projection of annotations through unambiguous word alignments. Based on the results obtained with such methodology, we show that our method is therefore easy to implement and compete with recent algorithmically costly methods at a much cheaper computational cost. MOTS-CLÉS : Transfert cross-lingue, Analyse en dépendances, Annotations partielles.

[1]  Fei Xia,et al.  Unsupervised Dependency Parsing with Transferring Distribution via Parallel Guidance and Entropy Regularization , 2014, ACL.

[2]  Joakim Nivre,et al.  A Dynamic Oracle for Arc-Eager Dependency Parsing , 2012, COLING.

[3]  David Marecek Combining Diverse Word-Alignment Symmetrizations Improves Dependency Tree Projection , 2011, CICLing.

[4]  Philipp Koehn,et al.  Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.

[5]  Joakim Nivre,et al.  An Efficient Algorithm for Projective Dependency Parsing , 2003, IWPT.

[6]  Joakim Nivre,et al.  Target Language Adaptation of Discriminative Transfer Parsers , 2013, NAACL.

[7]  Jörg Tiedemann,et al.  Treebank Translation for Cross-Lingual Parser Induction , 2014, CoNLL.

[8]  Slav Petrov,et al.  A Universal Part-of-Speech Tagset , 2011, LREC.

[9]  Jörg Tiedemann Rediscovering Annotation Projection for Cross-Lingual Parser Induction , 2014, COLING.

[10]  Noah A. Smith,et al.  Unsupervised Structure Prediction with Non-Parallel Multilingual Guidance , 2011, EMNLP.

[11]  Joakim Nivre,et al.  A Transition-Based System for Joint Part-of-Speech Tagging and Labeled Non-Projective Dependency Parsing , 2012, EMNLP.

[12]  Joakim Nivre,et al.  Transition-based Dependency Parsing with Rich Non-local Features , 2011, ACL.

[13]  Anders Søgaard Data point selection for cross-language adaptation of dependency parsers , 2011, ACL.

[14]  Mark Johnson,et al.  Using Universal Linguistic Knowledge to Guide Grammar Induction , 2010, EMNLP.

[15]  Joakim Nivre,et al.  Universal Dependency Annotation for Multilingual Parsing , 2013, ACL.

[16]  Noah A. Smith,et al.  A Simple, Fast, and Effective Reparameterization of IBM Model 2 , 2013, NAACL.

[17]  Philip Resnik,et al.  Bootstrapping parsers via syntactic projection across parallel texts , 2005, Natural Language Engineering.

[18]  Sylwia Ozdowska Projecting POS tags and syntactic dependencies from English and French to Polish in aligned corpora , 2006 .

[19]  Miles Osborne,et al.  Statistical Machine Translation , 2010, Encyclopedia of Machine Learning and Data Mining.

[20]  Jonas Kuhn,et al.  Data-Driven Dependency Parsing of New Languages Using Incomplete and Noisy Training Data , 2009, CoNLL.

[21]  Min Zhang,et al.  Soft Cross-lingual Syntax Projection for Dependency Parsing , 2014, COLING.

[22]  Mohammad Sadegh Rasooli,et al.  Density-Driven Cross-Lingual Transfer of Dependency Parsers , 2015, EMNLP.

[23]  Philip Resnik,et al.  Cross-Language Parser Adaptation between Related Languages , 2008, IJCNLP.

[24]  Slav Petrov,et al.  Multi-Source Transfer of Delexicalized Dependency Parsers , 2011, EMNLP.

[25]  James Parker,et al.  on Knowledge and Data Engineering, , 1990 .