NICT-ATR Speech-to-Speech Translation System

This paper describes the latest version of speech-to-speech translation systems developed by the team of NICT-ATR for over twenty years. The system is now ready to be deployed for the travel domain. A new noise-suppression technique notably improves speech recognition performance. Corpus-based approaches of recognition, translation, and synthesis enable coverage of a wide variety of topics and portability to other languages.

[1]  Shuntaro Isogai,et al.  Multi-class composite N-gram language model , 2003, Speech Commun..

[2]  Satoshi Nakamura,et al.  Development of client-server speech translation system on a multi-lingual speech communication platform , 2006, IWSLT.

[3]  Keiichi Tokuda,et al.  XIMERA: a new TTS from ATR based on corpus-based technologies , 2004, SSW.

[4]  Toshiyuki Takezawa,et al.  A Comparative Study on Human Communication Behaviors and Linguistic Characteristics for Speech-to-Speech Translation , 2004, LREC.

[5]  Frank K. Soong,et al.  Generalized posterior probability for minimum error verification of recognized sentences , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[6]  Eiichiro Sumita,et al.  Subword-based Tagging by Conditional Random Fields for Chinese Word Segmentation , 2006, NAACL.

[7]  Toshiyuki Takezawa,et al.  Collecting machine-translation-aided bilingual dialogues for corpus-based speech translation , 2003, INTERSPEECH.

[8]  Keiichi Tokuda,et al.  Speech parameter generation algorithms for HMM-based speech synthesis , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[9]  Eiichiro Sumita,et al.  The NiCT-ATR statistical machine translation system for IWSLT 2006 , 2006, IWSLT.

[10]  Satoshi Nakamura,et al.  Automatic generation of non-uniform context-dependent HMM topologies based on the MDL criterion , 2003, INTERSPEECH.

[11]  Satoshi Nakamura,et al.  The ATR Multilingual Speech-to-Speech Translation System , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Eiichiro Sumita,et al.  Creating corpora for speech-to-speech translation , 2003, INTERSPEECH.

[13]  Tomoki Toda,et al.  Optimizing sub-cost functions for segment selection based on perceptual evaluations in concatenative speech synthesis , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14]  Satoshi Nakamura,et al.  Optimal acoustic and language model weights for minimizing word verification errors , 2004, INTERSPEECH.