A corpus-centered approach to spoken language translation

This paper reports the latest performance of components and features of a project named Corpus-Centered Computation (C3), which targets a translation technology suitable for spoken language translation. C3 places corpora at the center of the technology. Translation knowledge is extracted from corpora by both EBMT and SMT methods, translation quality is gauged by referring to corpora, the best translation among multiple-engine outputs is selected based on corpora and the corpora themselves are paraphrased or filtered by automated processes.

[1]  Toshiyuki Takezawa,et al.  Proposal of a very-large-corpus acquisition method by cell-formed registration , 2002, LREC.

[2]  Sergei Nirenburg,et al.  A Statistical Approach to Machine Translation , 2003 .

[3]  Eiichiro Sumita,et al.  Toward a Broad-coverage Bilingual Corpus for Speech Translation of Travel Conversations in the Real World , 2002, LREC.

[4]  Yuji Matsumoto,et al.  Automatic Construction of Machine Translation Knowledge Using Translation Literalness , 2003, EACL.

[5]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[6]  Taro Watanabe,et al.  Bidirectional Decoding for Statistical Machine Translation , 2002, COLING.

[7]  Yoshinori Sagisaka,et al.  Evaluation of the ATR-matrix speech translation system with a pair comparison method between the system and humans , 2000, INTERSPEECH.

[8]  Eiichiro Sumita,et al.  Reliability measures for translation quality , 2002, INTERSPEECH.

[9]  Yuji Matsumoto,et al.  Automatic construction of machine translation knowledge using translation literalness , 2003 .

[10]  Eiichiro Sumita,et al.  Automatic paraphrasing based on parallel corpus for normalization , 2002, LREC.

[11]  Kenji Imamura,et al.  Application of translation knowledge acquired by hierarchical phrase alignment for pattern-based MT. , 2002, TMI.

[12]  Ralf D. Brown,et al.  Applying Statistical English Language Modelling to Symbolic Machine Translation , 1995, TMI.

[13]  Makoto Nagao,et al.  A framework of a mechanical translation between Japanese and English by analogy principle , 1984 .

[14]  Eiichiro Sumita,et al.  Solutions to Problems Inherent in Spoken-language Translation: The ATR-MATRIX Approach , 1999 .

[15]  Harold L. Somers,et al.  Review Article: Example-based Machine Translation , 1999, Machine Translation.

[16]  Srinivas Bangalore,et al.  Learning Dependency Translation Models as Collections of Finite-State Head Transducers , 2000, Computational Linguistics.

[17]  Hermann Ney Stochastic Modelling: From Pattern Classification to Language Translation , 2001, DDMMT@ACL.

[18]  Taro Watanabe,et al.  Using Language and Translation Models to Select the Best among Outputs from Multiple MT Systems , 2002, COLING.

[19]  Eiichiro Sumita,et al.  Identifying Synonymous Expressions from a Bilingual Corpus for Example-Based Machine Translation , 2002, COLING 2002.

[20]  Eiichiro Sumita Example-based machine translation using DP-matching between work sequences , 2001, DDMMT@ACL.

[21]  Kevin Knight,et al.  Automating Knowledge Acquisition for Machine Translation , 1997, AI Mag..

[22]  Chris Callison-Burch,et al.  A program for automatically selecting the best output from multiple machine translation engines , 2001, MTSUMMIT.

[23]  John Cocke,et al.  A Statistical Approach to Machine Translation , 1990, CL.

[24]  Eiichiro Sumita,et al.  Using multiple edit distances to automatically rank machine translation output , 2001, MTSUMMIT.