Data Augmentation Improves Recognition of Foreign Accented Speech
暂无分享,去创建一个
Bhuvana Ramabhadran | Andrew Rosenberg | Takashi Fukuda | Samuel Thomas | Alexander Sorin | Gakuto Kurata | Raul Fernandez | B. Ramabhadran | Samuel Thomas | A. Rosenberg | Raul Fernandez | Alexander Sorin | Takashi Fukuda | Gakuto Kurata | A. Sorin
[1] Bhuvana Ramabhadran,et al. Domain Adaptation of CNN Based Acoustic Models Under Limited Resource Settings , 2016, INTERSPEECH.
[2] Paavo Alku,et al. Glottal wave analysis with Pitch Synchronous Iterative Adaptive Inverse Filtering , 1991, Speech Commun..
[3] Xiaodong Cui,et al. Data Augmentation for Deep Neural Network Acoustic Modeling , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[4] Jean Carletta,et al. Unleashing the killer corpus: experiences in creating the multi-everything AMI Meeting Corpus , 2007, Lang. Resour. Evaluation.
[5] Yevgen Chebotar,et al. Distilling Knowledge from Ensembles of Neural Networks for Speech Recognition , 2016, INTERSPEECH.
[6] D. Crystal. English as a global language: Why a global language? , 2003 .
[7] Satoshi Nakamura,et al. Acoustical Sound Database in Real Environments for Sound Scene Understanding and Hands-Free Speech Recognition , 2000, LREC.
[8] Shuichi Itahashi,et al. Recent speech database projects in Japan , 1990, ICSLP.
[9] Bhuvana Ramabhadran,et al. Efficient Knowledge Distillation from an Ensemble of Teachers , 2017, INTERSPEECH.
[10] Richard M. Schwartz,et al. Two-Stage Data Augmentation for Low-Resourced Speech Recognition , 2016, INTERSPEECH.
[11] Li Deng,et al. Large-vocabulary speech recognition under adverse acoustic environments , 2000, INTERSPEECH.
[12] Hank Liao,et al. Speaker adaptation of context dependent deep neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[13] J. Liljencrants,et al. Dept. for Speech, Music and Hearing Quarterly Progress and Status Report a Four-parameter Model of Glottal Flow , 2022 .
[14] Slava Shechtman,et al. Semi Parametric Concatenative TTS with Instant Voice Modification Capabilities , 2017, INTERSPEECH.
[15] Bhuvana Ramabhadran,et al. Voice-transformation-based data augmentation for prosodic classification , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[16] Sanjeev Khudanpur,et al. Audio augmentation for speech recognition , 2015, INTERSPEECH.
[17] Brian Kingsbury,et al. Knowledge distillation across ensembles of multilingual models for low-resource languages , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[18] Matthew Richardson,et al. Compressing LSTMs into CNNs , 2015, ArXiv.
[19] Mark J. F. Gales,et al. Data augmentation for low resource languages , 2014, INTERSPEECH.