Création d'un corpus annoté pour le traitement des descriptions définies

The training and the evaluation of anaphora resolvers require the development of annotated corpora. In this paper, we propose a methodology for annotating definite descriptions that supports the consistent annotation of a large corpus (roughly 5 000 definite descriptions). We then present the results of the annotation work and discuss the implications for the automatic resolution of definite descriptions. MOTS-CLES : Descriptions definies,annotation de corpus,resolution automatique d'anaphores

[1]  Manfred Pinkal,et al.  Towards a Resource for Lexical Semantics: A Large German Corpus with Extensive Semantic Annotation , 2003, ACL.

[2]  Thorsten Brants,et al.  Inter-annotator Agreement for a German Newspaper Corpus , 2000, LREC.

[3]  Massimo Poesio,et al.  Learning to Resolve Bridging References , 2004, ACL.

[4]  Olga Uryupina,et al.  High-precision Identification of Discourse New and Unique Noun Phrases , 2003, ACL.

[5]  Susanne Salmon-Alt,et al.  Résolution automatique d’anaphores infidèles en français : Quelles ressources pour quels apports ? , 2004, JEPTALNRECITAL.

[6]  G. Kleiber,et al.  L'anaphore associative , 2001 .

[7]  Michael Strube,et al.  Annotating Anaphoric and Bridging Relations with MMAX , 2001, SIGDIAL Workshop.

[8]  Renata Vieira,et al.  An Empirically-based System for Processing Definite Descriptions , 2000, CL.

[9]  V SXW6Y,et al.  Resolving Bridging References in Unrestricted Text , 1997 .

[10]  Claire Cardie,et al.  Identifying Anaphoric and Non-Anaphoric Noun Phrases to Improve Coreference Resolution , 2002, COLING.

[11]  Eric Gaussier,et al.  Annotating a large corpus with anaphoric links , 2000 .

[12]  P. Christophersen The articles : A study of their theory and use in English , 1940 .

[13]  André Chervel,et al.  Recherches sur le syntagme substantif , 1966 .

[14]  Massimo Poesio,et al.  The MATE/GNOME Proposals for Anaphoric Annotation, Revisited , 2004, SIGDIAL Workshop.

[15]  Ellen F. Prince,et al.  Toward a taxonomy of given-new information , 1981 .

[16]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[17]  Nordine Fourour,et al.  Identification et catégorisation automatiques des anthroponymes du Français , 2001, JEPTALNRECITAL.

[18]  Jean Carletta,et al.  Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[19]  Robert Dale,et al.  Mining a Corpus to Support Associative Anaphora Resolution , 2002 .

[20]  Nikiforos Karamanis,et al.  Entity coherence for descriptive text structuring , 2004 .

[21]  J. Groenendijk,et al.  Coreference and modality , 1996 .

[22]  Saul A. Kripke,et al.  Naming and Necessity , 1980 .

[23]  Massimo Poesio,et al.  A General-Purpose, Off-the-shelf Anaphora Resolution Module: Implementation and Preliminary Evaluation , 2004, LREC.

[24]  Kees van Deemter,et al.  On Coreferring: Coreference in MUC and Related Annotation Schemes , 2000, CL.

[25]  C. Fellbaum An Electronic Lexical Database , 1998 .

[26]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[27]  Gustave Guillaume,et al.  Le problème de l'article et sa solution dans la langue française , 1975 .

[28]  Massimo Poesio,et al.  Discourse Annotation and Semantic Annotation in the GNOME corpus , 2004, Proceedings of the 2004 ACL Workshop on Discourse Annotation - DiscAnnotation '04.

[29]  G. Kleiber,et al.  Des anaphores associatives méronymiques aux anaphores associatives locatives , 1997 .

[30]  Claire Gardent,et al.  Which bridges for bridging definite descriptions? , 2003, LINC@EACL.

[31]  Renata Vieira,et al.  A Corpus-based Investigation of Definite Description Use , 1997, CL.

[32]  Chris Mellish,et al.  Corpus-based NP Modifier Generation , 2001, NAACL.

[33]  Dag Westerståhl,et al.  Determiners and Context Sets , 1985 .

[34]  Constantin Orasan,et al.  Annotation of Anaphoric Expressions in an Aligned Bilingual Corpus , 2004, LREC.

[35]  Francis Corblin Chaînes référentielles et communautés épistémiques , 2002 .

[36]  Hua Cheng,et al.  Modelling aggregation motivated interactions in descriptive text generation , 2002 .

[37]  Razvan C. Bunescu,et al.  Associative Anaphora Resolution: A Web-Based Approach , 2003 .

[38]  Massimo Poesio,et al.  Acquiring Lexical Knowledge for Anaphora Resolution , 2002, LREC.

[39]  Susanne Salmon-Alt,et al.  Nominal Expressions in Multilingual Corpora: Definites and Demonstratives , 2002, LREC.

[40]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[41]  Renata Vieira,et al.  Discourse-New Detectors for Definite Description Resolution: A Survey and a Preliminary Proposal , 2004 .

[42]  Hélène Manuélian,et al.  Descriptions définies et démonstratives : Analyses de Corpus pour la Génération de Textes , 2003 .

[43]  Michael Strube,et al.  MMAX: A Tool for the Annotation of Multi-modal Corpora , 2001, IJCAI 2001.

[44]  Frank Keller,et al.  Finding Syntactic Structure in Unparsed Corpora The Gsearch Corpus Query System , 2001, Comput. Humanit..

[45]  Kari Fraurud,et al.  Definiteness and the Processing of Noun Phrases in Natural Discourse , 1990, J. Semant..

[46]  B. Fradin,et al.  Anaphorisation et stereotypes nominaux , 1984 .