Evaluation techniques applied to domain tuning of MT lexicons

We describe a set of evaluation techniques applied to domain tuning of bilingual lexicons for machine translation. Our overall objective is to translate a domain-specific document in a foreign language (in this case, Chinese) to English. First, we perform an intrinsic evaluation of the effectiveness of our domain-tuning techniques by comparing our domain-tuned lexicon to a manually constructed domain-specific bilingual termlist. Our results indicate that we achieve 66% recall and 95% precision with respect to a human-derived gold standard. Next, an extrinsic evaluation demonstrates that our domain-tuned lexicon improves the Bleu scores 50% over a statistical system—with a smaller improvement when the system is trained on a uniformly-weighted dictionary.

[1]  W. Bruce Croft,et al.  Phrasal translation and query expansion techniques for cross-language information retrieval , 1997, SIGIR '97.

[2]  Sergei Nirenburg,et al.  A Statistical Approach to Machine Translation , 2003 .

[3]  Vasileios Hatzivassiloglou,et al.  Translating Collocations for Bilingual Lexicons: A Statistical Approach , 1996, CL.

[4]  Mark W. Davis,et al.  A TREC Evaluation of Query Translation Methods For Multi-Lingual Text Retrieval , 1995, TREC.

[5]  John Cocke,et al.  A Statistical Approach to Machine Translation , 1990, CL.

[6]  Ronald Rosenfeld,et al.  Statistical language modeling using the CMU-cambridge toolkit , 1997, EUROSPEECH.

[7]  Philip Resnik,et al.  Mining the Web for Bilingual Text , 1999, ACL.

[8]  Anette Hulth,et al.  Automatic Keyword Extraction Using Domain Knowledge , 2001, CICLing.

[9]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[10]  Patrick Pantel,et al.  Discovering word senses from text , 2002, KDD.

[11]  Philip Resnik,et al.  Semi-Automatic Acquisition of Domain-Specific Translation Lexicons , 1997, ANLP.

[12]  Kenneth Ward Church,et al.  Termight: Identifying and Translating Technical Terminology , 1994, ANLP.

[13]  I. Dan Melamed A Scalable Architecture for Bilingual Lexicography , 1997 .

[14]  Hermann Ney,et al.  Improved Statistical Alignment Models , 2000, ACL.

[15]  Douglas W. Oard Cross-Language Text Retrieval Research in the USA , 1997 .

[16]  Chu-Ren Huang,et al.  Induction of Classification from Lexicon Expansion: Assigning Domain Tags to WordNet Entries , 2002, COLING 2002.

[17]  Béatrice Daille,et al.  Study and Implementation of Combined Techniques for Automatic Extraction of Terminology , 1994 .