Investigating the Relationship between Classification Quality and SMT Performance in Discriminative Reordering Models

Reordering is one of the most important factors affecting the quality of the output in statistical machine translation (SMT). A considerable number of approaches that proposed addressing the reordering problem are discriminative reordering models (DRM). The core component of the DRMs is a classifier which tries to predict the correct word order of the sentence. Unfortunately, the relationship between classification quality and ultimate SMT performance has not been investigated to date. Understanding this relationship will allow researchers to select the classifier that results in the best possible MT quality. It might be assumed that there is a monotonic relationship between classification quality and SMT performance, i.e., any improvement in classification performance will be monotonically reflected in overall SMT quality. In this paper, we experimentally show that this assumption does not always hold, i.e., an improvement in classification performance might actually degrade the quality of an SMT system, from the point of view of MT automatic evaluation metrics. However, we show that if the improvement in the classification performance is high enough, we can expect the SMT quality to improve as well. In addition to this, we show that there is a negative relationship between classification accuracy and SMT performance in imbalanced parallel corpora. For these types of corpora, we provide evidence that, for the evaluation of the classifier, macro-averaged metrics such as macro-averaged F-measure are better suited than accuracy, the metric commonly used to date.

[1]  Barbara J. Grosz,et al.  Natural-Language Processing , 1982, Artificial Intelligence.

[2]  Kevin Small,et al.  All links are not the same: evaluating word alignments for statistical machine translation , 2007, MTSUMMIT.

[3]  Hermann Ney,et al.  A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[4]  Hermann Ney,et al.  Discriminative Reordering Models for Statistical Machine Translation , 2006, WMT@HLT-NAACL.

[5]  Danqi Chen,et al.  A Fast and Accurate Dependency Parser using Neural Networks , 2014, EMNLP.

[6]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[7]  Necip Fazil Ayan,et al.  Going Beyond AER: An Extensive Analysis of Word Alignments and Their Impact on MT , 2006, ACL.

[8]  Mahesan Niranjan,et al.  Scalable Reordering Models for SMT based on Multiclass SVM , 2015, Prague Bull. Math. Linguistics.

[9]  王星,et al.  A Topic-based Reordering Model for Statistical Machine Translation , 2014 .

[10]  Philipp Koehn,et al.  Predicting Success in Machine Translation , 2008, EMNLP.

[11]  Andy Way,et al.  Benchmarking SMT Performance for Farsi Using the TEP++ Corpus , 2015, EAMT.

[12]  Antonio Toral,et al.  Using Wordnet to Improve Reordering in Hierarchical Phrase-Based Statistical Machine Translation , 2016, GWC.

[13]  Hao Yu,et al.  Extending the Hierarchical Phrase Based Model with Maximum Entropy Based BTG , 2010, AMTA.

[14]  Arianna Bisazza,et al.  Dynamically Shaping the Reordering Search Space of Phrase-Based Statistical Machine Translation , 2013, Transactions of the Association for Computational Linguistics.

[15]  Christopher D. Manning,et al.  Improved Models of Distortion Cost for Statistical Machine Translation , 2010, NAACL.

[16]  Chris Quirk,et al.  Dependency Treelet Translation: Syntactically Informed Phrasal SMT , 2005, ACL.

[17]  Stephan Vogel,et al.  Integrating Phrase-based Reordering Features into a Chart-based Decoder for Machine Translation , 2013, ACL.

[18]  Philip Resnik,et al.  A Unified Model for Soft Linguistic Reordering Constraints in Statistical Machine Translation , 2014, ACL.

[19]  Dan Klein,et al.  Optimization, Maxent Models, and Conditional Estimation without Magic , 2003, NAACL.

[20]  Alon Lavie,et al.  Better Hypothesis Testing for Statistical Machine Translation: Controlling for Optimizer Instability , 2011, ACL.

[21]  Liang Tian,et al.  A Relationship: Word Alignment, Phrase Table, and Translation Quality , 2014, TheScientificWorldJournal.

[22]  Philipp Koehn,et al.  A unified framework for phrase-based, hierarchical, and syntax-based statistical machine translation , 2009, IWSLT.

[23]  Ralph Weischedel,et al.  A STUDY OF TRANSLATION ERROR RATE WITH TARGETED HUMAN ANNOTATION , 2005 .

[24]  Andy Way,et al.  Dependency-based Reordering Model for Constituent Pairs in Hierarchical SMT , 2015, EAMT.

[25]  Andy Way,et al.  Syntax- and semantic-based reordering in hierarchical phrase-based statistical machine translation , 2017, Expert Syst. Appl..

[26]  Qin Gao,et al.  Reassessment of the role of phrase extraction in pbsmt , 2009 .

[27]  Daniel Jurafsky,et al.  Discriminative Reordering with Chinese Grammatical Relations Features , 2009, SSST@HLT-NAACL.

[28]  Philipp Koehn,et al.  Soft Dependency Constraints for Reordering in Hierarchical Phrase-Based Translation , 2011, EMNLP.

[29]  Hermann Ney,et al.  AER: do we need to “improve” our alignments? , 2006, IWSLT.

[30]  Philip Koehn,et al.  Statistical Machine Translation , 2010, EAMT.

[31]  Haizhou Li,et al.  Modeling the Translation of Predicate-Argument Structure for SMT , 2012, ACL.

[32]  Kemal Oflazer Statistical Machine Translation into a Morphologically Complex Language , 2008, CICLing.

[33]  Alexander M. Fraser,et al.  Squibs and Discussions: Measuring Word Alignment Quality for Statistical Machine Translation , 2007, CL.

[34]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[35]  Qun Liu,et al.  Maximum Entropy Based Phrase Reordering Model for Statistical Machine Translation , 2006, ACL.

[36]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[37]  George F. Foster,et al.  Batch Tuning Strategies for Statistical Machine Translation , 2012, NAACL.

[38]  Khalil Sima ' an,et al.  Bilingual Markov Reordering Labels for Hierarchical SMT , 2014 .

[39]  Satoshi Nakamura,et al.  Learning local word reorderings for hierarchical phrase-based statistical machine translation , 2016, Machine Translation.

[40]  David Chiang,et al.  A Hierarchical Phrase-Based Model for Statistical Machine Translation , 2005, ACL.

[41]  Maosong Sun,et al.  A Neural Reordering Model for Phrase-based Translation , 2014, COLING.