论文信息 - IWSLT-06: experiments with commercial MT systems and lessons from subjective evaluations

IWSLT-06: experiments with commercial MT systems and lessons from subjective evaluations

This is a short report of our participation to IWSLT-06. First, we let 2 commercial systems participate as fairly as possible (Systran v5.0 for CE, JE, AE, & IE, Atlas-II for JE), taking care of preprocessing and postprocessing tasks, and tuning as many "pairs" as possible by creating "user dictionaries" and finding a good combination of parameters (such as dictionary priority). Second, we took part in the subjective evaluation of CE results (fluency and adequacy). Details on experiments and methodological remarks are provided, with a perspective to introduce less expensive and more objective humanand task-related evaluation methods.

Christian Boitet | Hervé Blanchon | Mutsuko Tomokiyo | Youcef Bey | Wenjie Cao

[1] Christian Boitet,et al. A Rationale for Using UNL as an Interlingua and More in Various Domains , 2005 .

[2] D. W. Barron. Machine Translation , 1968, Nature.

[3] Philipp Koehn,et al. Re-evaluating the Role of Bleu in Machine Translation Research , 2006, EACL.

[4] Christian Boitet,et al. Towards fairer evaluations of commercial MT systems on basic travel expressions corpora , 2004, IWSLT.

[5] Lynn Nadel,et al. Encyclopedia of Cognitive Science , 2003 .

[6] S. Siegel,et al. Nonparametric Statistics for the Behavioral Sciences , 2022, The SAGE Encyclopedia of Research Design.

[7] Eiichiro Sumita,et al. Toward a Broad-coverage Bilingual Corpus for Speech Translation of Travel Conversations in the Real World , 2002, LREC.

[8] Christian Boitet,et al. Spoken dialogue translation systems evaluation: results, new trends, problems and proposals , 2004, IWSLT.

[9] Jessie Pinkham,et al. Traduction automatique ancrée dans l’analyse linguistique , 2002, JEPTALNRECITAL.

[10] Francisco Casacuberta,et al. A Quantitative Method for Machine Translation Evaluation , 2003 .

[11] H. Ney,et al. A novel string-to-string distance measure with applications to machine translation evaluation , 2003, MTSUMMIT.

[12] Vladimir I. Levenshtein,et al. Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[13] Joseph P. Turian,et al. Evaluation of machine translation and its evaluation , 2003, MTSUMMIT.