论文信息 - Automatic Error Analysis for Morphologically Rich Languages

Automatic Error Analysis for Morphologically Rich Languages

This paper presents AMEANA, an opensource tool for error analysis for natural language processing tasks targeting morphologically rich languages. Unlike standard evaluation metrics such as BLEU or WER, AMEANA automatically provides a detailed error analysis that can help researchers and developers better understand the strengths and weaknesses of their systems. AMEANA is easily adaptable to any language provided the existence of a morphological analyzer. In this paper, we focus on usability in the context of Machine Translation (MT) and demonstrate it specifically for English-to-Arabic MT.

Nizar Habash | Ahmed El Kholy

[1] Otakar Smrž. Functional Arabic Morphology: Formal System and Implementation , 2007 .

[2] D. R. Fulkerson,et al. Maximal Flow Through a Network , 1956 .

[3] Andreas Stolcke,et al. SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[4] Hermann Ney,et al. Error Analysis of Verb Inflections in Spanish Translation Output , 2006 .

[5] Hermann Ney,et al. Error Analysis of Statistical Machine Translation Output , 2006, LREC.

[6] Kemal Oflazer,et al. BLEU+: a Tool for Fine-Grained BLEU Computation , 2008, LREC.

[7] George R. Doddington,et al. Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics , 2002 .

[8] Philipp Koehn,et al. Agreement Constraints for Statistical Machine Translation into German , 2011, WMT@EMNLP.

[9] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[10] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[11] Nizar Habash,et al. Semi-automatic error analysis for large-scale statistical machine translation , 2007, MTSUMMIT.