论文信息 - Labelled Dependencies in Machine Translation Evaluation

Labelled Dependencies in Machine Translation Evaluation

We present a method for evaluating the quality of Machine Translation (MT) output, using labelled dependencies produced by a Lexical-Functional Grammar (LFG) parser. Our dependency-based method, in contrast to most popular string-based evaluation metrics, does not unfairly penalize perfectly valid syntactic variations in the translation, and the addition of WordNet provides a way to accommodate lexical variation. In comparison with other metrics on 16,800 sentences of Chinese-English newswire text, our method reaches high correlation with human scores.

Andy Way | Josef van Genabith | Karolina Owczarzak

[1] Andy Way,et al. Contextual Bitext-Derived Paraphrases in Automatic MT Evaluation , 2006, WMT@HLT-NAACL.

[2] Alex Kulesza,et al. A learning approach to improving sentence-level MT evaluation , 2004 .

[3] Philipp Koehn,et al. Pharaoh: A Beam Search Decoder for Phrase-Based Statistical Machine Translation Models , 2004, AMTA.

[4] Ralph Weischedel,et al. A STUDY OF TRANSLATION ERROR RATE WITH TARGETED HUMAN ANNOTATION , 2005 .

[5] Ronald M. Kaplan,et al. Lexical Functional Grammar A Formal System for Grammatical Representation , 2004 .

[6] Alon Lavie,et al. METEOR: An Automatic Metric for MT Evaluation with Improved Correlation with Human Judgments , 2005, IEEvaluation@ACL.

[7] Regina Barzilay,et al. Paraphrasing for Automatic Evaluation , 2006, NAACL.

[8] Joseph P. Turian,et al. Evaluation of machine translation and its evaluation , 2003, MTSUMMIT.

[9] Ying Zhang,et al. Measuring confidence intervals for the machine translation evaluation metrics , 2004, TMI.

[10] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[11] Philipp Koehn,et al. Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.