论文信息 - NICT-ATR Speech-to-Speech Translation System

NICT-ATR Speech-to-Speech Translation System

This paper describes the latest version of speech-to-speech translation systems developed by the team of NICT-ATR for over twenty years. The system is now ready to be deployed for the travel domain. A new noise-suppression technique notably improves speech recognition performance. Corpus-based approaches of recognition, translation, and synthesis enable coverage of a wide variety of topics and portability to other languages.

Satoshi Nakamura | Eiichiro Sumita | Tohru Shimizu

[1] Shuntaro Isogai,et al. Multi-class composite N-gram language model , 2003, Speech Commun..

[2] Satoshi Nakamura,et al. Development of client-server speech translation system on a multi-lingual speech communication platform , 2006, IWSLT.

[3] Keiichi Tokuda,et al. XIMERA: a new TTS from ATR based on corpus-based technologies , 2004, SSW.

[4] Toshiyuki Takezawa,et al. A Comparative Study on Human Communication Behaviors and Linguistic Characteristics for Speech-to-Speech Translation , 2004, LREC.

[5] Frank K. Soong,et al. Generalized posterior probability for minimum error verification of recognized sentences , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[6] Eiichiro Sumita,et al. Subword-based Tagging by Conditional Random Fields for Chinese Word Segmentation , 2006, NAACL.

[7] Toshiyuki Takezawa,et al. Collecting machine-translation-aided bilingual dialogues for corpus-based speech translation , 2003, INTERSPEECH.

[8] Keiichi Tokuda,et al. Speech parameter generation algorithms for HMM-based speech synthesis , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[9] Eiichiro Sumita,et al. The NiCT-ATR statistical machine translation system for IWSLT 2006 , 2006, IWSLT.

[10] Satoshi Nakamura,et al. Automatic generation of non-uniform context-dependent HMM topologies based on the MDL criterion , 2003, INTERSPEECH.

[11] Satoshi Nakamura,et al. The ATR Multilingual Speech-to-Speech Translation System , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[12] Eiichiro Sumita,et al. Creating corpora for speech-to-speech translation , 2003, INTERSPEECH.

[13] Tomoki Toda,et al. Optimizing sub-cost functions for segment selection based on perceptual evaluations in concatenative speech synthesis , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[14] Satoshi Nakamura,et al. Optimal acoustic and language model weights for minimizing word verification errors , 2004, INTERSPEECH.