Fully Convolutional ASR for Less-Resourced Endangered Languages
暂无分享,去创建一个
Raymond W. Ptucha | Emily Tucker Prud'hommeaux | Bao Thai | Robert Jimerson | R. Ptucha | Robert Jimerson | Bao Thai
[1] Brian Kingsbury,et al. Building Competitive Direct Acoustics-to-Word Models for English Conversational Speech Recognition , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[2] Solange Rossato,et al. Semi-supervised G2p bootstrapping and its application to ASR for a very under-resourced language: Iban , 2014, SLTU.
[3] Jayadev Billa,et al. ISI ASR System for the Low Resource Speech Recognition Challenge for Indian Languages , 2018, INTERSPEECH.
[4] Quoc V. Le,et al. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[5] Kenneth Heafield,et al. KenLM: Faster and Smaller Language Model Queries , 2011, WMT@EMNLP.
[6] Gabriel Synnaeve,et al. Wav2Letter: an End-to-End ConvNet-based Speech Recognition System , 2016, ArXiv.
[7] Hermann Ney,et al. Data augmentation, feature combination, and multilingual neural networks to improve ASR and KWS performance for low-resource languages , 2014, INTERSPEECH.
[8] Sanjeev Khudanpur,et al. Semi-Supervised Training of Acoustic Models Using Lattice-Free MMI , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[9] Steve Renals,et al. Untranscribed Web Audio for Low Resource Speech Recognition , 2019, INTERSPEECH.
[10] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition , 2012 .
[11] Tara N. Sainath,et al. State-of-the-Art Speech Recognition with Sequence-to-Sequence Models , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[12] Erich Elsen,et al. Deep Speech: Scaling up end-to-end speech recognition , 2014, ArXiv.
[13] Chong Wang,et al. Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin , 2015, ICML.
[14] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[15] Shubham Bansal,et al. Active Learning Methods for Low Resource End-to-End Speech Recognition , 2019, INTERSPEECH.
[16] Zhuowen Tu,et al. Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[17] Jürgen Schmidhuber,et al. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks , 2006, ICML.
[18] Petr Motlícek,et al. Using out-of-language data to improve an under-resourced speech recognizer , 2014, Speech Communication.
[19] Bhuvana Ramabhadran,et al. End-to-end speech recognition and keyword search on low-resource languages , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[20] Thomas Niesler,et al. Automatic sub-word unit discovery and pronunciation lexicon induction for ASR with application to under-resourced languages , 2019, Comput. Speech Lang..
[21] Dumitru Erhan,et al. Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[22] Sanjeev Khudanpur,et al. A time delay neural network architecture for efficient modeling of long temporal contexts , 2015, INTERSPEECH.
[23] Mark J. F. Gales,et al. Speech recognition and keyword spotting for low-resource languages: Babel project research at CUED , 2014, SLTU.
[24] Raymond W. Ptucha,et al. Improving ASR Output for Endangered Language Documentation , 2018, SLTU.
[25] Yu Zhang,et al. Very deep convolutional networks for end-to-end speech recognition , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[26] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .