An end-to-end model for cross-lingual transformation of paralinguistic information

[1]  Satoshi Nakamura,et al.  Toward Expressive Speech Translation: A Unified Sequence-to-Sequence LSTMs Approach for Translating Words and Emphasis , 2017, INTERSPEECH.

[2]  Navdeep Jaitly,et al.  Sequence-to-Sequence Models Can Directly Translate Foreign Speech , 2017, INTERSPEECH.

[3]  Tomoki Toda,et al.  Preserving Word-Level Emphasis in Speech-to-Speech Translation , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[4]  David Chiang,et al.  An Attentional Model for Speech Translation Without Transcription , 2016, NAACL.

[5]  Julie Carson-Berndsen,et al.  Facial expression-based affective speech translation , 2014, Journal on Multimodal User Interfaces.

[6]  Shrikanth S. Narayanan,et al.  Enriching machine-mediated speech-to-speech translation using contextual information , 2013, Comput. Speech Lang..

[7]  Alan W. Black,et al.  Intent transfer in speech-to-speech machine translation , 2012, 2012 IEEE Spoken Language Technology Workshop (SLT).

[8]  Tomoki Toda,et al.  Voice Conversion Based on Maximum-Likelihood Estimation of Spectral Parameter Trajectory , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  Philipp Koehn,et al.  Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL) , 2007 .

[10]  Philipp Koehn,et al.  Factored Translation Models , 2007, EMNLP.

[11]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[12]  Heiga Zen,et al.  Statistical Parametric Speech Synthesis , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[13]  Jordi Adell,et al.  Prosody Generation for Speech-to-Speech Translation , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[14]  Satoshi Nakamura,et al.  Multi-modal translation system and its evaluation , 2002, Proceedings. Fourth IEEE International Conference on Multimodal Interfaces.

[15]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[16]  Wolfgang Wahlster,et al.  Robust Translation of Spontaneous Speech: A Multi-Engine Approach , 2001, IJCAI.

[17]  David Pearce,et al.  The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.

[18]  N. Campbell,et al.  A Japanese-to-English speech translation system: ATR-MATRIX , 1998, ICSLP.

[19]  Eric Moulines,et al.  Continuous probabilistic transform for voice conversion , 1998, IEEE Trans. Speech Audio Process..

[20]  Satoshi Nakamura,et al.  Voice conversion through vector quantization , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[21]  R. G. Leonard,et al.  A database for speaker-independent digit recognition , 1984, ICASSP.

[22]  Navdeep Jaitly,et al.  Sequence-to-Sequence Models Can Directly Transcribe Foreign Speech , 2017, ArXiv.

[23]  Markus Dreyer,et al.  APRO: All-Pairs Ranking Optimization for MT Tuning , 2015, NAACL.

[24]  Tomoki Toda,et al.  Improving translation of emphasis with pause prediction in speech-to-speech translation systems , 2015, IWSLT.

[25]  Tomoki Toda,et al.  Generalizing continuous-space translation of paralinguistic information , 2013, INTERSPEECH.

[26]  Tomoki Toda,et al.  An empirical comparison of joint optimization techniques for speech translation , 2013, INTERSPEECH.

[27]  Tomoki Toda,et al.  The NAIST machine translation system for IWSLT2012 , 2012, IWSLT.

[28]  Tomoki Toda,et al.  A method for translation of paralinguistic information , 2012, IWSLT.

[29]  Andy Way,et al.  Phonetic Representation-Based Speech Translation , 2011, MTSUMMIT.

[30]  Chris Callison-Burch,et al.  Open Source Toolkit for Statistical Machine Translation: Factored Translation Models and Lattice Decoding , 2006 .