论文信息 - FEMTI: creating and using a framework for MT evaluation

FEMTI: creating and using a framework for MT evaluation

This paper presents FEMTI, a web-based Framework for the Evaluation of Machine Translation in ISLE. FEMTI offers structured descriptions of potential user needs, linked to an overview of technical characteristics of MT systems. The description of possible systems is mainly articulated around the quality characteristics for software product set out in ISO/IEC standard 9126. Following the philosophy set out there and in the related 14598 series of standards, each quality characteristic bottoms out in metrics which may be applied to a particular instance of a system in order to judge how satisfactory the system is with respect to that characteristic. An evaluator can use the description of user needs to help identify the specific needs of his evaluation and the relations between them. He can then follow the pointers to system description to determine what metrics should be applied and how. In the current state of the framework, emphasis is on being exhaustive, including as much as possible of the information available in the literature on machine translation evaluation. Future work will aim at being more analytic, looking at characteristics and metrics to see how they relate to one another, validating metrics and investigating the correlation between particular metrics and human judgement.

M. King | Andrei Popescu-Belis | Eduard H. Hovy | Andrei Popescu-Belis

[1] John S. White,et al. The ARPA MT Evaluation Methodologies: Evolution, Lessons, and Future Approaches , 1994, AMTA.

[2] Hermann Ney,et al. Improved Alignment Models for Statistical Machine Translation , 1999, EMNLP.

[3] Douglas W. Oard. The CLEF 2001 Interactive Track , 2001, CLEF.

[4] John Hutchins. Machine translation and human translation: in competition or in complementation? , 2001 .

[5] Martin Rajman,et al. Automatic Ranking of MT Systems , 2002, LREC.

[6] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[7] Srinivas Bangalore,et al. Head-Transducer Models for Speech Translation and Their Automatic Acquisition from Bilingual Data , 2004, Machine Translation.

[8] Andrei Popescu-Belis,et al. Principles of Context-Based Machine Translation Evaluation , 2002, Machine Translation.