Hierarchical Permutation Complexity for Word Order Evaluation

Existing approaches for evaluating word order in machine translation work with metrics computed directly over a permutation of word positions in system output relative to a reference translation. However, every permutation factorizes into a permutation tree (PET) built of primal permutations, i.e., atomic units that do not factorize any further. In this paper we explore the idea that permutations factorizing into (on average) shorter primal permutations should represent simpler ordering as well. Consequently, we contribute Permutation Complexity, a class of metrics over PETs and their extension to forests, and define tight metrics, a sub-class of metrics implementing this idea. Subsequently we define example tight metrics and empirically test them in word order evaluation. Experiments on the WMT13 data sets for ten language pairs show that a tight metric is more often than not better than the baselines.

[1]  A. Waibel,et al.  Analyzing the potential of source sentence reordering in statistical machine translation , 2013, IWSLT.

[2]  Hiroshi Ichikawa,et al.  A Lightweight Evaluation Framework for Machine Translation Reordering , 2011, WMT@EMNLP.

[3]  Daniel Gildea,et al.  Binarization of Synchronous Context-Free Grammars , 2009, CL.

[4]  John DeNero,et al.  Inducing Sentence Structure from Parallel Corpora for Reordering , 2011, EMNLP.

[5]  Alexandra Birch,et al.  Reordering Metrics for MT , 2011, ACL.

[6]  Ondrej Bojar,et al.  Results of the WMT13 Metrics Shared Task , 2015, WMT@EMNLP.

[7]  Dekai Wu,et al.  Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora , 1997, CL.

[8]  Gad M. Landau,et al.  Permutation Pattern Discovery in Biosequences , 2004, J. Comput. Biol..

[9]  Mirella Lapata,et al.  Automatic Evaluation of Information Ordering: Kendall’s Tau , 2006, CL.

[10]  Philipp Koehn,et al.  Soft Dependency Constraints for Reordering in Hierarchical Phrase-Based Translation , 2011, EMNLP.

[11]  Takeaki Uno,et al.  Fast Algorithms to Enumerate All Common Intervals of Two Permutations , 1997, Algorithmica.

[12]  Xin-She Yang,et al.  Introduction to Algorithms , 2021, Nature-Inspired Optimization Algorithms.

[13]  Philipp Koehn,et al.  Findings of the 2013 Workshop on Statistical Machine Translation , 2013, WMT@ACL.

[14]  Fabienne Braune,et al.  Long-distance reordering during search for hierarchical phrase-based SMT , 2012, EAMT.

[15]  Alexandra Birch,et al.  LRscore for Evaluating Lexical and Reordering Quality in MT , 2010, WMT@ACL.

[16]  Alon Lavie,et al.  Meteor 1.3: Automatic Metric for Reliable Optimization and Evaluation of Machine Translation Systems , 2011, WMT@EMNLP.

[17]  Arianna Bisazza,et al.  Dynamically Shaping the Reordering Search Space of Phrase-Based Statistical Machine Translation , 2013, Transactions of the Association for Computational Linguistics.

[18]  Kevin Duh,et al.  Automatic Evaluation of Translation Quality for Distant Language Pairs , 2010, EMNLP.

[19]  Mike D. Atkinson,et al.  Simple permutations and pattern restricted permutations , 2005, Discret. Math..

[20]  Alexandra Birch,et al.  Metrics for MT evaluation: evaluating reordering , 2010, Machine Translation.

[21]  Taro Watanabe,et al.  Inducing a Discriminative Parser to Optimize Machine Translation Reordering , 2012, EMNLP.

[22]  Khalil Sima'an,et al.  Fitting Sentence Level Translation Evaluation with Many Dense Features , 2014, EMNLP.

[23]  Giorgio Satta,et al.  Factoring Synchronous Grammars by Sorting , 2006, ACL.

[24]  Khalil Sima'an,et al.  Evaluating Word Order Recursively over Permutation-Forests , 2014, SSST@EMNLP.

[25]  Philipp Koehn,et al.  Predicting Success in Machine Translation , 2008, EMNLP.

[26]  Daniel Marcu,et al.  Binarizing Syntax Trees to Improve Syntax-Based Machine Translation Accuracy , 2007, EMNLP.

[27]  Daniel Gildea,et al.  Factorization of Synchronous Context-Free Grammars in Linear Time , 2007, SSST@HLT-NAACL.

[28]  Slav Petrov,et al.  Training a Parser for Machine Translation Reordering , 2011, EMNLP.

[29]  William T. Trotter,et al.  Critically indecomposable partially ordered sets, graphs, tournaments and other binary relational structures , 1993, Discret. Math..

[30]  David Chiang,et al.  Hierarchical Phrase-Based Translation , 2007, CL.

[31]  Bing Xiang,et al.  Improving Reordering for Statistical Machine Translation with Smoothed Priors and Syntactic Features , 2011, SSST@ACL.

[32]  Slav Petrov,et al.  Training Structured Prediction Models with Extrinsic Loss Functions , 2011 .