TREF - TRanslation Enhancement Framework for Japanese-English

We present a method for improving existing statistical machine translation methods using an knowledge-base compiled from a bilingual corpus as well as sequence alignment and pattern matching techniques from the area of machine learning and bioinformatics. An alignment algorithm identifies similar sentences, which are then used to construct a better word order for the translation. Our preliminary test results indicate a significant improvement of the translation quality.

[1]  Kristian Kersting,et al.  Boosting Relational Sequence Alignments , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[2]  Werner Winiwarter,et al.  WILLIE - A Web Interface for a Language Learning and Instruction Environment , 2007, ICWL.

[3]  John Cocke,et al.  A Statistical Approach to Machine Translation , 1990, CL.

[4]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[5]  Yorick Wilks,et al.  Machine Translation: Its Scope and Limits , 2008 .

[6]  Kristian Kersting,et al.  Relational Sequence Alignments and Logos , 2007, ILP.

[7]  Hugo Liu,et al.  ConceptNet — A Practical Commonsense Reasoning Tool-Kit , 2004 .

[8]  Shan-Hwei Nienhuys-Cheng,et al.  Distance Between Herbrand Interpretations: A Measure for Approximations to a Target Concept , 1997, ILP.

[9]  Nicholas Ostler The Jungle is Neutral - Newcomer Languages Face New Media , 2009, EAMT.

[10]  牧野 成一,et al.  日本語基本文法辞典 = A dictionary of basic Japanese grammar , 1989 .

[11]  Makoto Nagao,et al.  A framework of a mechanical translation between Japanese and English by analogy principle , 1984 .

[12]  Y. Tanaka,et al.  Compilation of a multilingual parallel corpus , 2001 .

[13]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[14]  Andy Way,et al.  Recent Advances in Example-Based Machine Translation , 2004 .

[15]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[16]  Klaus Ries,et al.  The Karlsruhe-Verbmobil speech recognition engine , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17]  Yuji Matsumoto,et al.  Japanese Dependency Analysis using Cascaded Chunking , 2002, CoNLL.

[18]  Luc De Raedt,et al.  Probabilistic Inductive Logic Programming , 2004, Probabilistic Inductive Logic Programming.

[19]  Christian Boitet,et al.  Evolution of MT with the Web , 2009 .

[20]  Christus,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 2022 .

[21]  George R. Doddington,et al.  Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics , 2002 .

[22]  Jan Ramon Thesis: clustering and instance based learning in first order logic , 2002 .

[23]  W. Winiwarter,et al.  WETCAT - Web-Enabled Translation Using Corpus-Based Acquisition of Transfer Rules , 2006, 2006 Innovations in Information Technology.

[24]  Matsuo Soga,et al.  A dictionary of basic Japanese grammar , 1986 .

[25]  Teruko Mitamura,et al.  Hierarchical Lexical Structure and Interpretive Mapping in Machine Translation , 1992, COLING.

[26]  Akira Kubota,et al.  Handbook of modern Japanese grammar , 1982, The Journal of Asian Studies.

[27]  Hitoshi Isahara,et al.  Enhancing the Japanese WordNet , 2009, ALR7@IJCNLP.

[28]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[29]  Philipp Koehn,et al.  Re-evaluating the Role of Bleu in Machine Translation Research , 2006, EACL.

[30]  Harold L. Somers,et al.  An introduction to machine translation , 1992 .