Bridging the Gap between Pre-Training and Fine-Tuning for End-to-End Speech Translation
暂无分享,去创建一个
M. Zhou | Shujie Liu | Yu Wu | Zhenglu Yang | Chengyi Wang
[1] Quoc V. Le,et al. SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition , 2019, INTERSPEECH.
[2] Jiajun Zhang,et al. End-to-End Speech Translation with Knowledge Distillation , 2019, INTERSPEECH.
[3] Matthias Sperber,et al. Attention-Passing Models for Robust and Data-Efficient End-to-End Speech Translation , 2019, TACL.
[4] Yuan Cao,et al. Leveraging Weakly Supervised Data to Improve End-to-end Speech-to-text Translation , 2018, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[5] Haifeng Wang,et al. STACL: Simultaneous Translation with Implicit Anticipation and Controllable Latency using Prefix-to-Prefix Framework , 2018, ACL.
[6] Adam Lopez,et al. Pre-training on high-resource speech recognition improves low-resource speech-to-text translation , 2018, NAACL.
[7] Taku Kudo,et al. Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates , 2018, ACL.
[8] Shuang Xu,et al. Speech-Transformer: A No-Recurrence Sequence-to-Sequence Model for Speech Recognition , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[9] Shinji Watanabe,et al. ESPnet: End-to-End Speech Processing Toolkit , 2018, INTERSPEECH.
[10] David Chiang,et al. Tied Multitask Learning for Neural Speech Translation , 2018, NAACL.
[11] Olivier Pietquin,et al. End-to-End Automatic Speech Translation of Audiobooks , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[12] Ali Can Kocabiyikoglu,et al. Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation , 2018, LREC.
[13] Kevin Duh,et al. The JHU/KyotoU Speech Translation System for IWSLT 2018 , 2018, IWSLT.
[14] Mauro Cettolo,et al. The IWSLT 2018 Evaluation Campaign , 2018, IWSLT.
[15] I-Hung Hsu,et al. Mitigating the impact of speech recognition errors on chatbot using sequence-to-sequence model , 2017, 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).
[16] Satoshi Nakamura,et al. Structured-Based Curriculum Learning for End-to-End English-Japanese Speech Translation , 2017, INTERSPEECH.
[17] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[18] Navdeep Jaitly,et al. Sequence-to-Sequence Models Can Directly Translate Foreign Speech , 2017, INTERSPEECH.
[19] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[20] Olivier Pietquin,et al. Listen and Translate: A Proof of Concept for End-to-End Speech-to-Text Translation , 2016, NIPS 2016.
[21] David Chiang,et al. An Attentional Model for Speech Translation Without Transcription , 2016, NAACL.
[22] Quoc V. Le,et al. Multi-task Sequence to Sequence Learning , 2015, ICLR.
[23] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[24] Sanjeev Khudanpur,et al. A time delay neural network architecture for efficient modeling of long temporal contexts , 2015, INTERSPEECH.
[25] Paul Deléglise,et al. Enhancing the TED-LIUM Corpus with Selected Data for Language Modeling and More TED Talks , 2014, LREC.
[26] Florian Metze,et al. Augmenting Translation Models with Simulated Acoustic Confusions for Improved Spoken Language Translation , 2014, EACL.
[27] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[28] Robert Munro. Crowdsourced translation for emergency response in Haiti: the global collaboration of local knowledge , 2010, AMTA.
[29] Sylvain Meignier,et al. LIUM SPKDIARIZATION: AN OPEN SOURCE TOOLKIT FOR DIARIZATION , 2010 .
[30] Jürgen Schmidhuber,et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.
[31] Hermann Ney,et al. On the integration of speech recognition and statistical machine translation , 2005, INTERSPEECH.
[32] Hermann Ney,et al. Alignment templates: the RWTH SMT system , 2004, IWSLT.
[33] Hermann Ney,et al. Speech translation: coupling of recognition and translation , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[34] Biing-Hwang Juang,et al. Hidden Markov Models for Speech Recognition , 1991 .