Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition
暂无分享,去创建一个
Bhuvana Ramabhadran | Mark Hasegawa-Johnson | Andrew Rosenberg | Samuel Thomas | Xuesong Yang | Kartik Audhkhasi | M. Hasegawa-Johnson | B. Ramabhadran | Samuel Thomas | Kartik Audhkhasi | A. Rosenberg | Xuesong Yang
[1] Andrew W. Senior,et al. Fast and accurate recurrent neural network acoustic models for speech recognition , 2015, INTERSPEECH.
[2] Daniel Jurafsky,et al. Lexicon-Free Conversational Speech Recognition with Neural Networks , 2015, NAACL.
[3] Yoshua Bengio,et al. End-to-end attention-based large vocabulary speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[4] Bin Liu,et al. CTC regularized model adaptation for improving LSTM RNN based multi-accent Mandarin speech recognition , 2016, 2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP).
[5] Yifan Gong,et al. Multi-accent deep neural network acoustic model with accent-specific top layer using the KLD-regularized model adaptation , 2014, INTERSPEECH.
[6] Hervé Bourlard,et al. Generalization and Parameter Estimation in Feedforward Netws: Some Experiments , 1989, NIPS.
[7] Yanpeng Li,et al. Improving deep neural networks based multi-accent Mandarin speech recognition using i-vectors and accent-specific top layer , 2015, INTERSPEECH.
[8] Julia Hirschberg,et al. Using prosody and phonotactics in Arabic dialect identification , 2009, INTERSPEECH.
[9] Steve Renals,et al. A study of the recurrent neural network encoder-decoder for large vocabulary speech recognition , 2015, INTERSPEECH.
[10] Chong Wang,et al. Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin , 2015, ICML.
[11] Olivier Siohan,et al. Selection and combination of hypotheses for dialectal speech recognition , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[12] Pedro J. Moreno,et al. Towards acoustic model unification across dialects , 2016, 2016 IEEE Spoken Language Technology Workshop (SLT).
[13] Hasim Sak,et al. Multi-accent speech recognition with hierarchical grapheme based models , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[14] Yi Su,et al. Accent detection and speech recognition for Shanghai-accented Mandarin , 2005, INTERSPEECH.
[15] Pedro J. Moreno,et al. Multi-Dialectical Languages Effect on Speech Recognition: Too Much Choice Can Hurt , 2015, ICNLSP.
[16] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[17] Yajie Miao,et al. EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
[18] Navdeep Jaitly,et al. Towards End-To-End Speech Recognition with Recurrent Neural Networks , 2014, ICML.
[19] A. D. Shveĭt︠s︡er,et al. Introduction to sociolinguistics , 1986 .
[20] Hal Daumé,et al. Frustratingly Easy Domain Adaptation , 2007, ACL.
[21] Janet B. Pierrehumbert,et al. Phonological Representation: Beyond Abstract Versus Episodic , 2016 .
[22] Erich Elsen,et al. Deep Speech: Scaling up end-to-end speech recognition , 2014, ArXiv.
[23] Geoffrey Zweig,et al. Achieving Human Parity in Conversational Speech Recognition , 2016, ArXiv.
[24] Yoshua Bengio,et al. End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results , 2014, ArXiv.
[25] Bhuvana Ramabhadran,et al. Direct Acoustics-to-Word Models for English Conversational Speech Recognition , 2017, INTERSPEECH.
[26] Quoc V. Le,et al. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[27] Xiaodong Cui,et al. English Conversational Telephone Speech Recognition by Humans and Machines , 2017, INTERSPEECH.
[28] Tao Chen,et al. Accent Issues in Large Vocabulary Continuous Speech Recognition , 2004, Int. J. Speech Technol..
[29] Brian Kingsbury,et al. The IBM Attila speech recognition toolkit , 2010, 2010 IEEE Spoken Language Technology Workshop.