Contextual Speech Recognition in End-to-end Neural Network Systems Using Beam Search
暂无分享,去创建一个
Tara N. Sainath | Ian Williams | David Rybach | Anjuli Kannan | Petar S. Aleksic | Anjuli Kannan | David Rybach | Ian Williams
[1] Alex Graves,et al. Sequence Transduction with Recurrent Neural Networks , 2012, ArXiv.
[2] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[3] Ian Williams,et al. Rescoring-Aware Beam Search for Reduced Search Errors in Contextual Automatic Speech Recognition , 2017, INTERSPEECH.
[4] Brian Roark,et al. Bringing contextual information to google speech recognition , 2015, INTERSPEECH.
[5] Tara N. Sainath,et al. A Comparison of Sequence-to-Sequence Models for Speech Recognition , 2017, INTERSPEECH.
[6] Quoc V. Le,et al. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[7] Brian Roark,et al. Composition-based on-the-fly rescoring for salient n-gram biasing , 2015, INTERSPEECH.
[8] Ian Williams,et al. Voice search language model adaptation using contextual information , 2016, 2016 IEEE Spoken Language Technology Workshop (SLT).
[9] David Li,et al. Cross-Lingual Phoneme Mapping for Language Robust Contextual Speech Recognition , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[10] Tara N. Sainath,et al. State-of-the-Art Speech Recognition with Sequence-to-Sequence Models , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[11] Tara N. Sainath,et al. An Analysis of Incorporating an External Language Model into a Sequence-to-Sequence Model , 2017, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[12] Yoshua Bengio,et al. End-to-end attention-based large vocabulary speech recognition , 2015, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[13] Lucy Vasserman,et al. Contextual language model adaptation using dynamic classes , 2016, 2016 IEEE Spoken Language Technology Workshop (SLT).
[14] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[15] Cyril Allauzen,et al. Improved recognition of contact names in voice commands , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[16] Adam Coates,et al. Cold Fusion: Training Seq2Seq Models Together with Language Models , 2017, INTERSPEECH.