Incorporating pronoun function into statistical machine translation

Pronouns are used frequently in language, and perform a range of functions. Some pronouns are used to express coreference, and others are not. Languages and genres differ in how and when they use pronouns and this poses a problem for Statistical Machine Translation (SMT) systems (Le Nagard and Koehn, 2010; Hardmeier and Federico, 2010; Novák, 2011; Guillou, 2012; Weiner, 2014; Hardmeier, 2014). Attention to date has focussed on coreferential (anaphoric) pronouns with NP antecedents, which when translated from English into a language with grammatical gender, must agree with the translation of the head of the antecedent. Despite growing attention to this problem, little progress has been made, and little attention has been given to other pronouns. The central claim of this thesis is that pronouns performing different functions in text should be handled differently by SMT systems and when evaluating pronoun translation. This motivates the introduction of a new framework to categorise pronouns according to their function: Anaphoric/cataphoric reference, event reference, extra-textual reference, pleonastic, addressee reference, speaker reference, generic reference, or other function. Labelling pronouns according to their function also helps to resolve instances of functional ambiguity arising from the same pronoun in the source language having multiple functions, each with different translation requirements in the target language. The categorisation framework is used in corpus annotation, corpus analysis, SMT system development and evaluation. I have directed the annotation and conducted analyses of a parallel corpus of English-German texts called ParCor (Guillou et al., 2014), in which pronouns are manually annotated according to their function. This provides a first step toward understanding the problems that SMT systems face when translating pronouns. In the thesis, I show how analysis of manual translation can prove useful in identifying and understanding systematic differences in pronoun use between two languages and can help inform the design of SMT systems. In particular, the analysis revealed that the German translations in ParCor contain more anaphoric and pleonastic pronouns than their English originals, reflecting differences in pronoun use. This raises a particular problem for the evaluation of pronoun translation. Automatic evaluation methods that rely on reference translations to assess pronoun translation, will not be able to provide an adequate evaluation when the

[1]  Alon Lavie,et al.  METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments , 2005, IEEvaluation@ACL.

[2]  R. Mitkov ANAPHORA RESOLUTION: THE STATE OF THE ART , 2007 .

[3]  Ondrej Bojar,et al.  Failures in English-Czech Phrase-Based MT ∗ , 2010 .

[4]  Andrei Popescu-Belis,et al.  How Comparable are Parallel Corpora? Measuring the Distribution of General Vocabulary and Connectives , 2011, BUCC@ACL.

[5]  Jörg Tiedemann To Cache or Not To Cache? Experiments with Adaptive Models in Statistical Machine Translation , 2010, WMT@ACL.

[6]  Pascale Fung,et al.  Semantic Roles for SMT: A Hybrid Two-Pass Model , 2009, NAACL.

[7]  Karin Naumann,et al.  Manual for the Annotation of in-document Referential Relations , 2006 .

[8]  Preslav Nakov,et al.  Pronoun-Focused MT and Cross-Lingual Pronoun Prediction: Findings of the 2015 DiscoMT Shared Task on Pronoun Translation , 2015, DiscoMT@EMNLP.

[9]  Shalom Lappin,et al.  An Algorithm for Pronominal Anaphora Resolution , 1994, CL.

[10]  Christian Hardmeier On Statistical Machine Translation and Translation Theory , 2015, DiscoMT@EMNLP.

[11]  Dan Klein,et al.  Learning Accurate, Compact, and Interpretable Tree Annotation , 2006, ACL.

[12]  Benoît Sagot,et al.  The Lefff, a Freely Available and Large-coverage Morphological and Syntactic Lexicon for French , 2010, LREC.

[13]  P. Kiparsky Disjoint reference and the typology of pronouns , 2002 .

[14]  Kun Yu,et al.  Example-based machine translation based on deeper NLP , 2006, IWSLT.

[15]  Chris Callison-Burch,et al.  Modality and Negation in SIMT Use of Modality and Negation in Semantically-Informed Syntactic MT , 2012, CL.

[16]  Erich Steiner,et al.  Cross-Linguistic Corpora for the Study of Translations: Insights from the Language Pair English-German , 2012 .

[17]  Petr Pajas,et al.  TectoMT: Highly Modular MT System with Tectogrammatics Used as Transfer Layer , 2008, WMT@ACL.

[18]  Yannick Versley,et al.  Extending BART to Provide a Coreference Resolution System for German , 2010, LREC.

[19]  Sharid Loáiciga,et al.  Rule-Based Pronominal Anaphora Treatment for Machine Translation , 2015, DiscoMT@EMNLP.

[20]  Annie Louis,et al.  Structured and Unstructured Cache Models for SMT Domain Adaptation , 2014, EACL.

[21]  Carla Umbach,et al.  Anaphora Resolution in Machine Translation , 1992 .

[22]  Nicholas Asher,et al.  Reference to abstract objects in discourse , 1993, Studies in linguistics and philosophy.

[23]  Bonnie Webber A Formal Approach to Discourse Anaphora , 2016 .

[24]  Viktor Becher,et al.  Explicitation and implicitation in translation. A corpus-based study of English-German and German-English translations of business texts , 2011 .

[25]  Thomas Meyer,et al.  Machine Translation with Many Manually Labeled Discourse Connectives , 2013, DiscoMT@ACL.

[26]  Joel Tetreault,et al.  A Corpus-Based Evaluation of Centering and Pronoun Resolution , 2001, Computational Linguistics.

[27]  Ian Michael,et al.  English Grammatical Categories: and the Tradition to 1800 , 1971 .

[28]  Michael Halliday,et al.  An Introduction to Functional Grammar , 1985 .

[29]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[30]  M. N ovak Utilization of Anaphora in Machine Translation , 2011 .

[31]  Michal Novák,et al.  Translation of “It” in a Deep Syntax Framework , 2013, DiscoMT@ACL.

[32]  Noah A. Smith,et al.  Rich Source-Side Context for Statistical Machine Translation , 2008, WMT@ACL.

[33]  Valentin Shevchuk,et al.  Corpus-based translation studies: theory, findings, applications , 2009 .

[34]  Andrei Popescu-Belis,et al.  Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns , 2012, LREC.

[35]  Geoffrey Leech,et al.  Change in Contemporary English: A Grammatical Study , 2009 .

[36]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[37]  Surabhi Gupta,et al.  Disambiguating Between Generic and Referential “You” in Dialog , 2007, ACL.

[38]  Kenneth Ward Church,et al.  A Program for Aligning Sentences in Bilingual Corpora , 1993, CL.

[39]  David Chiang,et al.  A Hierarchical Phrase-Based Model for Statistical Machine Translation , 2005, ACL.

[40]  Adrien Barbaresi German Political Speeches - Corpus and Visualization , 2012 .

[41]  Jian Su,et al.  A Twin-Candidate Based Approach for Event Pronoun Resolution using Composite Kernel , 2010, COLING.

[42]  Scott Weinstein,et al.  Providing a Unified Account of Definite Noun Phrases in Discourse , 1983, ACL.

[43]  Jingbo Zhu,et al.  Document-level Consistency Verification in Machine Translation , 2011, MTSUMMIT.

[44]  Liane Guillou Automatic Post-Editing for the DiscoMT Pronoun Translation Task , 2015, DiscoMT@EMNLP.

[45]  Philipp Koehn,et al.  Factored Translation Models , 2007, EMNLP.

[46]  Marcello Federico,et al.  Modelling pronominal anaphora in statistical machine translation , 2010, IWSLT.

[47]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[48]  Andrei Popescu-Belis,et al.  Using Sense-labeled Discourse Connectives for Statistical Machine Translation , 2012, ESIRMT/HyTra@EACL.

[49]  Mona Baker,et al.  'Corpus Linguistics and Translation Studies: Implications and Applications' , 1993 .

[50]  Nadir Durrani,et al.  Edinburgh SLT and MT System Description for the IWSLT 2014 Evaluation , 2013 .

[51]  Jörg Tiedemann,et al.  Document-Wide Decoding for Phrase-Based Statistical Machine Translation , 2012, EMNLP.

[52]  Marie Mikulová,et al.  Announcing Prague Czech-English Dependency Treebank 2.0 , 2012, LREC.

[53]  Alexander Gode,et al.  Interlingua;: A grammar of the international language, , 1971 .

[54]  Scott Weinstein,et al.  Centering: A Framework for Modeling the Local Coherence of Discourse , 1995, CL.

[55]  Sharid Loáiciga,et al.  Italian and Spanish Null Subjects. A Case Study Evaluation in an MT Perspective , 2012, LREC.

[56]  Sharid Loáiciga,et al.  Improving machine translation of null subjects in Italian and Spanish , 2012, EACL.

[57]  Yannick Versley,et al.  BART: A Modular Toolkit for Coreference Resolution , 2008, ACL.

[58]  Christoph Müller,et al.  Multi-level annotation of linguistic data with MMAX 2 , 2006 .

[59]  Kenneth Heafield,et al.  KenLM: Faster and Smaller Language Model Queries , 2011, WMT@EMNLP.

[60]  David Yarowsky,et al.  NADA: A Robust System for Non-referential Pronoun Detection , 2011, DAARC.

[61]  Joseph H. Greenberg,et al.  Some Universals of Grammar with Particular Reference to the Order of Meaningful Elements , 1990, On Language.

[62]  Rudolf Rosa Depfix, a Tool for Automatic Rule-based Post-editing of SMT , 2014, Prague Bull. Math. Linguistics.

[63]  George R. Doddington,et al.  Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics , 2002 .

[64]  Bonnie L. Webber,et al.  Analysing ParCor and its Translations by State-of-the-art SMT Systems , 2015, DiscoMT@EMNLP.

[65]  Dan Klein,et al.  Easy Victories and Uphill Battles in Coreference Resolution , 2013, EMNLP.

[66]  Philipp Koehn,et al.  Abstract Meaning Representation for Sembanking , 2013, LAW@ACL.

[67]  Andrei Popescu-Belis,et al.  Pronoun Translation and Prediction with or without Coreference Links , 2015, DiscoMT@EMNLP.

[68]  Liane Guillou,et al.  PROTEST: A Test Suite for Evaluating Pronouns in Machine Translation , 2016, LREC.

[69]  Daniel Marcu,et al.  Statistical Phrase-Based Translation , 2003, NAACL.

[70]  Andrei Popescu-Belis,et al.  Assessing the Accuracy of Discourse Connective Translations: Validation of an Automatic Metric , 2013, CICLing.

[71]  Ruslan Mitkov,et al.  Introduction: Special Issue on Anaphora Resolution in Machine Translation and Multilingual NLP , 1999, Machine Translation.

[72]  Heeyoung Lee,et al.  Stanford’s Multi-Pass Sieve Coreference Resolution System at the CoNLL-2011 Shared Task , 2011, CoNLL Shared Task.

[73]  R. Huddleston Introduction to the Grammar of English: Verbs, nouns and adjectives: the boundaries between them , 1984 .

[74]  Heeyoung Lee,et al.  A Multi-Pass Sieve for Coreference Resolution , 2010, EMNLP.

[75]  András Kornai,et al.  Parallel corpora for medium density languages , 2007 .

[76]  青木 昌吉 Deutsche Grammatik = 獨逸小文典 , 1921 .

[77]  Whitney Gegg-Harrison,et al.  Identifying Non-Referential it: A Machine Learning Approach Incorporating Linguistically Motivated Patterns , 2005, ACL 2005.

[78]  Satoru Ikehara,et al.  Intrasentential Resolution of Japanese Zero Pronouns in a Machine Translation System using Semantic and Pragmatic Constraints , 1995 .

[79]  Christian Hardmeier,et al.  Discourse in Statistical Machine Translation , 2014 .

[80]  Makoto Nagao,et al.  A framework of a mechanical translation between Japanese and English by analogy principle , 1984 .

[81]  Ondrej Bojar,et al.  Evaluation of Machine Translation Metrics for Czech as the Target Language , 2009, Prague Bull. Math. Linguistics.

[82]  Rudolf Rosa,et al.  Two-step translation with grammatical post-processing , 2011, WMT@EMNLP.

[83]  Silvia Bernardini,et al.  A New Approach to the Study of Translationese: Machine-learning the Difference between Original and Translated Text , 2005, Lit. Linguistic Comput..

[84]  Daniel Marcu,et al.  HyTER: Meaning-Equivalent Semantics for Translation Evaluation , 2012, NAACL.

[85]  Petr Sgall,et al.  The Meaning Of The Sentence In Its Semantic And Pragmatic Aspects , 1986 .

[86]  Nianwen Xue,et al.  CoNLL-2011 Shared Task: Modeling Unrestricted Coreference in OntoNotes , 2011, CoNLL Shared Task.

[87]  Philipp Koehn,et al.  Aiding Pronoun Translation with Co-Reference Resolution , 2010, WMT@ACL.

[88]  Philipp Koehn,et al.  Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.

[89]  Marcello Federico,et al.  Complexity of spoken versus written language for machine translation , 2014, EAMT.

[90]  Manfred Consten,et al.  Anaphors in text : cognitive, formal and applied approaches to anaphoric reference , 2007 .

[91]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[92]  Jörg Tiedemann,et al.  Context Adaptation in Statistical Machine Translation Using Models with Exponentially Decaying Cache , 2010, ACL 2010.

[93]  Christian Hardmeier A Document-Level SMT System with Integrated Pronoun Prediction , 2015, DiscoMT@EMNLP.

[94]  Jerry R. Hobbs Resolving pronoun references , 1986 .

[95]  K. Kunz,et al.  Cross-linguistic analysis of discourse variation across registers , 2015 .

[96]  Guodong Zhou,et al.  Cache-based Document-level Statistical Machine Translation , 2011, EMNLP.

[97]  Jörg Tiedemann,et al.  ParCor 1.0: A Parallel Pronoun-Coreference Corpus to Support Statistical MT , 2014, LREC.

[98]  Ding Liu,et al.  Semantic Role Features for Machine Translation , 2010, COLING.

[99]  Andreas Eisele,et al.  MultiUN: A Multilingual Corpus from United Nation Documents , 2010, LREC.

[100]  Liane Guillou,et al.  Improving Pronoun Translation for Statistical Machine Translation , 2012, EACL.

[101]  Bonnie L. Webber,et al.  Tense as Discourse Anaphor , 1988, CL.

[102]  Franz Josef Och,et al.  Minimum Error Rate Training in Statistical Machine Translation , 2003, ACL.

[103]  Matthew G. Snover,et al.  A Study of Translation Edit Rate with Targeted Human Annotation , 2006, AMTA.

[104]  G. V. D. Meer,et al.  Personal pronouns in present-day English , 1998 .

[105]  Amr Ahmed,et al.  Syntax-Based Statistical Machine Translation : A review , 2006 .

[106]  Andrei Popescu-Belis,et al.  A Corpus-based Contrastive Analysis for Defining Minimal Semantics of Inter-sentential Dependencies for Machine Translation , 2011 .

[107]  Mona Diab An Unsupervised Approach for Bootstrapping Arabic Sense Tagging , 2004 .

[108]  Anders Bj Data-driven Multilingual Coreference Resolution using Resolver Stacking , 2012 .

[109]  Pushpak Bhattacharyya,et al.  Projecting Parameters for Multilingual Word Sense Disambiguation , 2009, EMNLP.

[110]  Udo Hahn,et al.  Functional Centering - Grounding Referential Coherence in Information Structure , 1999, Comput. Linguistics.

[111]  Jochen Weiner,et al.  Pronominal Anaphora in Machine Translation , 2014 .

[112]  Carl Pollard,et al.  A Centering Approach to Pronouns , 1987, ACL.

[113]  Jörg Tiedemann Baseline Models for Pronoun Prediction and Pronoun-Aware Translation , 2015, DiscoMT@EMNLP.

[114]  Moshe Koppel,et al.  Translationese and Its Dialects , 2011, ACL.