Translation by Machine of Complex Nominals: Getting it Right

We present a method for compositionally translating noun-noun (NN) compounds, using a word-level bilingual dictionary and syntactic templates for candidate generation, and corpus and dictionary statistics for selection. We propose a support vector learning-based method employing target language corpus and bilingual dictionary data, and evaluate it over a English ↔ Japanese machine translation task. We show the proposed method to be superior to previous methods and also robust over low-frequency NN compounds.

[1]  Judith N. Levi,et al.  The syntax and semantics of complex nominals , 1978 .

[2]  Ted Briscoe,et al.  Robust Accurate Statistical Annotation of General Text , 2002, LREC.

[3]  Gregory Grefenstette,et al.  The World Wide Web as a Resource for Example-Based Machine Translation Tasks , 1999, TC.

[4]  Kenji Suzuki,et al.  Using the Web as a Bilingual Dictionary , 2001, DDMMT@ACL.

[5]  John A. Carroll,et al.  Applied morphological processing of English , 2001, Natural Language Engineering.

[6]  Satoshi Shirai,et al.  Toward an MT System without Pre-Editing - Effects of New Methods in ALT-J/E - , 1995, ArXiv.

[7]  Timothy Baldwin,et al.  Noun-Noun Compound Machine Translation A Feasibility Study on Shallow Processing , 2003, Proceedings of the ACL 2003 workshop on Multiword expressions analysis, acquisition and treatment -.

[8]  Philipp Koehn,et al.  Feature-Rich Statistical Translation of Noun Phrases , 2003, ACL.

[9]  Yuji Matsumoto,et al.  Japanese Morphological Analysis System ChaSen version 2.0 Manual , 1999 .

[10]  Yuji Matsumoto,et al.  Japanese Dependency Analysis using Cascaded Chunking , 2002, CoNLL.

[11]  Pascale Fung,et al.  Finding Terminology Translations from Non-parallel Corpora , 1997, VLC.

[12]  Hang Li,et al.  Base Noun Phrase Translation Using Web Data and the EM Algorithm , 2002, COLING.

[13]  Kevin Knight,et al.  Generation that Exploits Corpus-Based Statistical Knowledge , 1998, ACL.

[14]  T. Tanaka Translation selection for Japanese-English noun-noun compounds , 2003 .

[15]  Laurie Bauer,et al.  On the need for pragmatics in the study of nominal compounding , 1979 .

[16]  Mark Stevenson,et al.  The Reuters Corpus Volume 1 -from Yesterday’s News to Tomorrow’s Language Resources , 2002, LREC.

[17]  Grace Ngai,et al.  Transformation Based Learning in the Fast Lane , 2001, NAACL.

[18]  James Breen Building An Electronic Japanese-English Dictionary , 1995 .