Bilingual alignment of anaphoric expressions

In this paper we present an automatic mechanism for bilingual (Spanish-English) alignment of anaphoric expressions. For this purpose, two anaphora resolution systems were used. Both are based on linguistic preferences and constraints, for Spanish (SUPPAR) and for English (MARS). These systems have been independently developed and each of them is presented individually with their evaluation results. The majority of the paper presents an automatic alignment method for pronominal anaphora in Spanish and English. Once an anaphor has been solved (in both languages) this method matches anaphoric expressions and antecedents from both texts. A bitext map method has been used for the alignment with a set of bilingual texts for the evaluation. These texts have been extracted from several European Community Official documents (EUR-lex database). The alignment mechanism can be applied to different tasks related to Machine Translation such us pattern learning for translation or evaluation for automatic generation of multilingual anaphora.

[1]  EstimationPeter,et al.  The Mathematics of Machine Translation : Parameter , 2004 .

[2]  Xiaobo Ren,et al.  Translation Analysis and Translation Automation , 1993, TMI.

[3]  Elliott Macklovitch Using Bi-textual Alignment for Translation Validation: the TransCheck system , 1994, AMTA.

[4]  Evelyne Tzoukermann,et al.  Combining corpus and machine-readable dictionary data for building bilingual lexicons , 1995, Machine Translation.

[5]  Richard Evans,et al.  A New, Fully Automatic Version of Mitkov's Knowledge-Poor Pronoun Resolution Method , 2002, CICLing.

[6]  Jean Véronis,et al.  Methods and Practical Issues in Evaluating Alignment Techniques , 1998, COLING-ACL.

[7]  Antonio Ferrández Rodríguez,et al.  A Computational Approach to Zero-pronouns in Spanish , 2000, ACL.

[8]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[9]  Richard Evans,et al.  Applying Machine Learning Toward an Automatic Classification of It , 2001, Lit. Linguistic Comput..

[10]  Kenneth Ward Church,et al.  Robust Bilingual Word Alignment for Machine Aided Translation , 1993, VLC@ACL.

[11]  T. Reinhart Anaphora and semantic interpretation , 1983 .

[12]  Charles F. Goldfarb,et al.  SGML handbook , 1990 .

[13]  Elliott Macklovitch,et al.  Line ‘Em Up: Advances in Alignment Technology and their Impact on Translation Support Tools , 2004, Machine Translation.

[14]  Michel Simard,et al.  Bilingual Sentence Alignment: Balancing Robustness and Accuracy , 2004, Machine Translation.

[15]  Shalom Lappin,et al.  An Algorithm for Pronominal Anaphora Resolution , 1994, CL.

[16]  I. Dan Melamed,et al.  A Geometric Approach to Mapping Bitext Correspondence , 1996, EMNLP.

[17]  Manuel Palomar,et al.  Coreference-oriented Interlingual Slot Structure & Machine Translation , 1999, COREF@ACL.

[18]  Richard Evans,et al.  Enhancing Preference-Based Anaphora Resolution with Genetic Algorithms , 2000, Natural Language Processing.

[19]  Martin Kay,et al.  Text-Translation Alignment , 1993, Comput. Linguistics.

[20]  Jerry R. Hobbs Resolving pronoun references , 1986 .

[21]  Branimir Boguraev,et al.  Anaphora for Everyone: Pronominal Anaphora Resolution without a Parser , 1996, COLING.

[22]  Ruslan Mitkov,et al.  Robust Pronoun Resolution with Limited Knowledge , 1998, ACL.

[23]  Manuel Palomar,et al.  Anaphor Resolution In Unrestricted Texts With Partial Parsing , 1998, ACL.

[24]  Kenneth Ward Church,et al.  A Program for Aligning Sentences in Bilingual Corpora , 1993, CL.

[25]  Jian-Yun Nie,et al.  Using a Probabilistic Translation Model for Cross-Language Information Retrieval , 1998, VLC@COLING/ACL.

[26]  M. Walker,et al.  A bilateral approach to givenness : A hearer-status algorithm and a centering algorithm , 1996 .

[27]  Robert Gaizauskas,et al.  Quantitative evaluation of coreference algorithms in an information extraction system , 2000 .

[28]  Ferran Plà,et al.  Tagging and Chunking with Bigrams , 2000, COLING.

[29]  Robert L. Mercer,et al.  Aligning Sentences in Parallel Corpora , 1991, ACL.

[30]  Timo Järvinen,et al.  A non-projective dependency parser , 1997, ANLP.