JETCAT - Japanese-English Translation Using Corpus-Based Acquisition of Transfer Rules

In this paper we present a rule-based formalism for the acquisition, representation, and application of the transfer knowledge used in a Japanese-English machine translation system. The transfer knowledge is learnt automatically from a parallel corpus by using structural matching between the parse trees of translation pairs. The user can customize the rule base by simply correcting translation results. We have extended the machine translation system with two user-friendly front ends: an MSWord interface and a Web interface. Since our system is mainly intended as a tool for language students to convey a better understanding of Japanese, we also offer the display of detailed information about lexical, syntactic, and transfer knowledge. The system has been implemented in Amzi! Prolog, using the Amzi! Logic Server Visual Basic Module and the Amzi! Logic Server CGI Interface to develop the front ends.

[1]  Kevin Knight,et al.  Building a Large-Scale Knowledge Base for Machine Translation , 1994, AAAI.

[2]  Hitoshi Iida,et al.  Cooperation between Transfer and Analysis in Example-Based Framework , 1992, COLING.

[3]  Takako Aikawa,et al.  English-Japanese Example-Based Machine Translation Using Abstract Linguistic Representations , 2002, COLING-02 on Machine translation in Asia -.

[4]  Taro Watanabe,et al.  Statistical machine translation based on hierarchical phrase alignment. , 2002, TMI.

[5]  Sergei Nirenburg,et al.  A lexicon for knowledge-based MT , 1995, Machine Translation.

[6]  Arul Menezes,et al.  Overcoming the customization bottleneck using example-based MT , 2001, DDMMT@ACL.

[7]  Gerald B. Mathias,et al.  A Dictionary of Intermediate Japanese Grammar , 1996 .

[8]  Matsuo Soga,et al.  A dictionary of basic Japanese grammar , 1986 .

[9]  John HUTCHINS Towards a Definition of Example-based Machine Translation , 2005, MTSUMMIT.

[10]  John Cocke,et al.  A Statistical Approach to Machine Translation , 1990, CL.

[11]  Kenji Imamura,et al.  Hierarchical Phrase Alignment Harmonized with Parsing , 2001, NLPRS.

[12]  Werner Winiwarter Incremental Learning of Transfer Rules for Customized Machine Translation , 2004, INAP/WLP.

[13]  Nigel G. Ward Machine Translation: Past, Present, Future , 2001 .

[14]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[15]  Sergei Nirenburg,et al.  Book Reviews: Machine Translation: A Knowledge-Based Approach , 1993, CL.

[16]  Harold L. Somers,et al.  An introduction to machine translation , 1992 .

[17]  Fred Popowich,et al.  What is example-based machine translation? , 2001, MTSUMMIT.

[18]  John Hutchins Has machine translation improved? some historical comparisons , 2003 .

[19]  Yuji Matsumoto,et al.  Japanese Morphological Analysis System ChaSen version 2.0 Manual , 1999 .

[20]  M. Carl Towards a Model of Competence for Corpus-Based Machine Translation , 1999 .

[21]  Makoto Nagao,et al.  A framework of a mechanical translation between Japanese and English by analogy principle , 1984 .

[22]  牧野 成一,et al.  日本語基本文法辞典 = A dictionary of basic Japanese grammar , 1989 .

[23]  Ulrich Germann Making Semantic Interpretation Parser-Independent , 1998, AMTA.

[24]  Andy Way,et al.  Recent Advances in Example-Based Machine Translation , 2004 .

[25]  John Newton Computers in translation : a practical appraisal , 1992 .

[26]  Akira Kubota,et al.  Handbook of modern Japanese grammar , 1982, The Journal of Asian Studies.

[27]  Kevin Knight,et al.  A Syntax-based Statistical Translation Model , 2001, ACL.

[28]  Deryle Lonsdale,et al.  A Reasoned Interlingua for Knowledge-Based Machine Translation , 1994 .

[29]  Harold L. Somers,et al.  Computers and translation : a translator's guide , 2003 .