Pronominal Anaphora Generation in an English-Spanish MT Approach

This paper presents the pronominal anaphora generation module in a Machine Translation (MT) system. The MT interlingua approach -AGIR (Anaphora Generation with an Interlingua Representation)- allows the generation of anaphoric expressions into the target language from the interlingua representation of the source text.AGIR uses different kinds of knowledge (lexical, syntactic, morphological and semantic information) to solve the Natural Language Processing (NLP) problems of the source text. Subsequently, an interlingua representation of the whole text is obtained that allows the correct generation of anaphoric expressions. In this paper we have evaluated the generation of English and Spanish (including zero pronouns) third person personal pronouns into the target language. We have obtained the following results: a precision of 80.39% and 84.77% in the generation of Spanish and English pronominal anaphora respectively.

[1]  Jesús Peral Cortés,et al.  Resolución y generación de la anáfora pronominal en español e inglés en un sistema interlingua de Traducción Automática , 2002, Proces. del Leng. Natural.

[2]  John Chandioux METEO : un système opérationnel pour la traduction automatique des bulletins météorologiques destinés au grand public , 1976 .

[3]  Shalom Lappin,et al.  An Algorithm for Pronominal Anaphora Resolution , 1994, CL.

[4]  C. D. Paice,et al.  Towards the automatic recognition of anaphoric features in English text: the impersonal pronoun “it” , 1987 .

[5]  Jesús Peral Cortés Proposal of an English-Spanish Interlingual Mechanism Focused on Pronominal Anaphora Resolution and Generation in Machine Translation Systems , 1999, ESSLLI Student Papers.

[6]  Antonio Ferrandez,et al.  An Application of the Interlingua System ISS for Spanish-English Pronominal Anaphora Generation , 2000 .

[7]  John D. Lafferty,et al.  The Candide System for Machine Translation , 1994, HLT.

[8]  Christiane Fellbaum,et al.  Building Semantic Concordances , 1998 .

[9]  Manuel Palomar,et al.  Coreference-oriented Interlingual Slot Structure & Machine Translation , 1999, COREF@ACL.

[10]  Manuel Palomar,et al.  An Empirical Approach to Spanish Anaphora Resolution , 1999, Machine Translation.

[11]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[12]  Antonio Ferrandez,et al.  A Computational Approach to Zero-pronouns in Spanish , 2000, ACL 2000.

[13]  A. P. M. Witkam Distributed Language Translation : feasibility study of a multilingual facility for videotex information networks , 1983 .

[14]  Jaime G. Carbonell,et al.  An Efficient Interlingua Translation System for Multi-lingual Document Production , 1991, MTSUMMIT.

[15]  Antonio Ferrández Rodríguez,et al.  Generation of Spanish Zero-Pronouns into English , 2000, Natural Language Processing.