Annotation précise du français en sémantique de rôles par projection cross-linguistique

This paper considers the task of the automatic induction of role-semantic annotations for new languages with high precision. To this end we test the generalisability of the language-independent, projection-based annotation framework introduced by Pado and Lapata (2005, 2006) by (a) applying it to a new, more distant, language pair (English-French), and (b), using automatic, and thus noisy, input annotation. We show that even under these conditions, high-quality role annotations for French can be obtained that rival existing results for German. We conclude that the framework has considerable potential in reducing the manual effort involved in creating role-semantic resources for a wider range of languages. Mots-cles : Multilingue, FrameNet, Annotation semantique automatique, Semantique lexicale, Projection d'annotation de roles, roles semantiques

[1]  Katrin Erk,et al.  SALTO - A Versatile Multi-Level Annotation Tool , 2006, LREC.

[2]  Philip Resnik,et al.  Evaluating Translational Correspondence using Annotation Projection , 2002, ACL.

[3]  Cécile Fabre,et al.  Apprentissage de ressources lexicales pour l'extension de requêtes , 2000 .

[4]  D. Bourigault,et al.  Syntex, analyseur syntaxique de corpus , 2005 .

[5]  Philipp Koehn,et al.  Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.

[6]  Richard Johansson,et al.  A FrameNet-Based Semantic Role Labeler for Swedish , 2006, ACL.

[7]  Hans Christian Boas,et al.  Bilingual FrameNet Dictionaries for Machine Translation , 2002, LREC.

[8]  Hermann Ney,et al.  A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[9]  Sanda M. Harabagiu,et al.  Question Answering Based on Semantic Structures , 2004, COLING.

[10]  David Yarowsky,et al.  Inducing Multilingual Text Analysis Tools via Robust Projection across Aligned Corpora , 2001, HLT.

[11]  Alessandro Moschitti,et al.  Semantic Role Labeling via FrameNet, VerbNet and PropBank , 2006, ACL.

[12]  Pascale Fung,et al.  BiFrameNet: Bilingual Frame Semantics Resource Construction by Cross-lingual Induction , 2004, COLING.

[13]  Mirella Lapata,et al.  Optimal Constituent Alignment with Edge Covers for Semantic Projection , 2006, ACL.

[14]  Daniel Gildea,et al.  Automatic Labeling of Semantic Roles , 2000, ACL.

[15]  Sanda M. Harabagiu,et al.  Using Predicate-Argument Structures for Information Extraction , 2003, ACL.

[16]  Christopher R. Johnson,et al.  Background to Framenet , 2003 .

[17]  Daniel Jurafsky,et al.  Automatic Labeling of Semantic Roles , 2002, CL.