Conversion et améliorations de corpus du français annotés en Universal Dependencies [Conversion and Improvement of Universal Dependencies French corpora]

Cet article decrit l'effort d'amelioration de deux corpus du francais annotes en dependances syntaxiques, qui s'inscrit dans le cadre du projet Universal Dependencies (UD) qui vise a elaborer un schema d'annotation syntaxique permettant d'analyser de facon similaire plusieurs langues differentes. Nous avons cherche a rendre plus conformes au schema UD ces deux corpus du francais, et nous avons evalue l'impact des modifications apportees aux corpus sur la conformite avec le schema UD et la coherence interne de leur annotation.

[1]  Timothy Osborne,et al.  The status of function words in dependency grammar: A critique of Universal Dependencies (UD) , 2019, Glossa: a journal of general linguistics.

[2]  Guy Perrier,et al.  SUD or Surface-Syntactic Universal Dependencies: An annotation scheme near-isomorphic to UD , 2018, UDW@EMNLP.

[3]  Joakim Nivre,et al.  Expletives in Universal Dependency Treebanks , 2018, UDW@EMNLP.

[4]  Amit Seker,et al.  The Hebrew Universal Dependency Treebank: Past Present and Future , 2018, UDW@EMNLP.

[5]  Daniel Zeman,et al.  Challenges in Converting the Index Thomisticus Treebank into Universal Dependencies , 2018, UDW@EMNLP.

[6]  Chiara Alzetta,et al.  Assessing the Impact of Incremental Error Detection and Correction. A Case Study on the Italian Universal Dependency Treebank , 2018, UDW@EMNLP.

[7]  Alina Wróblewska,et al.  Extended and Enhanced Polish Dependency Bank in Universal Dependencies Format , 2018, UDW@EMNLP.

[8]  Martin Potthast,et al.  CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies , 2018, CoNLL.

[9]  Adam Przepiórkowski,et al.  From Lexical Functional Grammar to enhanced Universal Dependencies , 2018, Language Resources and Evaluation.

[10]  Guy Perrier,et al.  Application of Graph Rewriting to Natural Language Processing , 2018 .

[11]  Benoît Sagot,et al.  Cheating a Parser to Death: Data-driven Cross-Treebank Annotation Transfer , 2018, LREC.

[12]  Guillaume Wisniewski,et al.  Errator: a Tool to Help Detect Annotation Errors in the Universal Dependencies Project , 2018, LREC.

[13]  Filip Ginter,et al.  Assessing the Annotation Consistency of the Universal Dependencies Corpora , 2017, DepLing.

[14]  Sylvain Kahane,et al.  Trois schémas d’annotation syntaxique en dépendance pour un même corpus de français oral : le cas de la macrosyntaxe , 2017 .

[15]  Gertjan van Noord,et al.  Increasing Return on Annotation Investment: The Automatic Construction of a Universal Dependency Treebank for Dutch , 2017, UDW@NoDaLiDa.

[16]  Nizar Habash,et al.  Universal Dependencies for Arabic , 2017, WANLP@EACL.

[17]  Marie Candito,et al.  Hard Time Parsing Questions: Building a QuestionBank for French , 2016, LREC.

[18]  Sampo Pyysalo,et al.  Universal Dependencies v1: A Multilingual Treebank Collection , 2016, LREC.

[19]  Corentin Ribeyre Méthodes d'analyse supervisée pour l'interface syntaxe-sémantique , 2016 .

[20]  Bruno Guillaume,et al.  Dependency Parsing with Graph Rewriting , 2015, IWPT.

[21]  Veronika Laippala,et al.  Universal Dependencies for Finnish , 2015, NODALIDA.

[22]  Marie Candito,et al.  Strategies for Contiguous Multiword Expression Analysis and Dependency Parsing , 2014, ACL.

[23]  Eric P. Xing,et al.  Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , 2014, ACL 2014.

[24]  Éric Villemonte de la Clergerie,et al.  Deep Syntax Annotation of the Sequoia French Treebank , 2014, LREC.

[25]  Joakim Nivre,et al.  Universal Dependency Annotation for Multilingual Parsing , 2013, ACL.

[26]  Felice Dell'Orletta,et al.  Linguistically-driven Selection of Correct Arcs for Dependency Parsing , 2013 .

[27]  Marie Candito,et al.  Effectively long-distance dependencies in French : annotation and parsing evaluation , 2012 .

[28]  Marie Candito,et al.  Le corpus Sequoia : annotation syntaxique et exploitation pour l’adaptation d’analyseur par pont lexical (The Sequoia Corpus : Syntactic Annotation and Use for a Parser Lexical Domain Adaptation Method) [in French] , 2012, JEP/TALN/RECITAL.

[29]  Sylvain Kahane,et al.  Une approche paresseuse de l’analyse sémantique ou comment construire une interface syntaxe-sémantique à partir d’exemples , 2010, JEPTALNRECITAL.

[30]  Benoît Sagot,et al.  The Lefff, a Freely Available and Large-coverage Morphological and Syntactic Lexicon for French , 2010, LREC.

[31]  Claire Gardent,et al.  Semantic Normalisation : a Framework and an Experiment , 2009, IWCS.

[32]  Walt Detmar Meurers,et al.  On Detecting Errors in Dependency Treebanks , 2008 .

[33]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[34]  Walt Detmar Meurers,et al.  Detecting Errors in Discontinuous Structural Annotation , 2005, ACL.

[35]  Anne Abeillé,et al.  Enriching a French Treebank , 2004, LREC.

[36]  Leo Wanner,et al.  On Using a Parallel Graph Rewriting Formalism in Generation , 2001, EWNLG@ACL.

[37]  Margaret E. Winters,et al.  Pronom et syntaxe: L'approche pronominale et son application au français@@@Pronom et syntaxe: L'approche pronominale et son application au francais , 1987 .

[38]  Chiara Alzetta,et al.  Dangerous Relations in Dependency Treebanks , 2018, TLT.

[39]  Emily Pitler,et al.  CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies , 2017, CoNLL.

[40]  Milan Straka,et al.  Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe , 2017, CoNLL.

[41]  Marie Mikulová,et al.  Prague Dependency Treebank , 2017 .

[42]  Cristina Bosco,et al.  PartTUT: The Turin University Parallel Treebank , 2015, Italian Natural Language Processing within the PARLI Project.

[43]  Maria Simi,et al.  Evolution of Italian Treebank and Dependency Parsing towards Universal Dependencies , 2015 .

[44]  Daniel Zeman,et al.  Slavic Languages in Universal Dependencies , 2015 .

[45]  Valentin Jijkoun,et al.  Learning to Transform Linguistic Graphs , 2007 .

[46]  Walt Detmar Meurers,et al.  Detecting Inconsistencies in Treebanks , 2003 .

[47]  Eero Hyvonen,et al.  Semantic Parsing as Graph Language Transformation - A Multidimensional Approach to Parsing Highly Inflectional Languages , 1984, ACL.