COMPARING GRU AND LSTM FOR AUTOMATIC SPEECH RECOGNITION
暂无分享,去创建一个
Benjamin Lecouteux | Laurent Besacier | Shubham Khandelwal | L. Besacier | B. Lecouteux | Shubham Khandelwal
[1] Navdeep Jaitly,et al. Hybrid speech recognition with Deep Bidirectional LSTM , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[2] Hervé Bourlard,et al. Connectionist Speech Recognition: A Hybrid Approach , 1993 .
[3] Yajie Miao,et al. EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
[4] Björn W. Schuller,et al. Robust speech recognition using long short-term memory recurrent neural networks for hybrid acoustic modelling , 2014, INTERSPEECH.
[5] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[6] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.
[7] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.
[8] Hui Jiang,et al. Higher Order Recurrent Neural Networks , 2016, ArXiv.
[9] Georg Heigold,et al. Sequence discriminative distributed training of long short-term memory recurrent neural networks , 2014, INTERSPEECH.
[10] Navdeep Jaitly,et al. Towards End-To-End Speech Recognition with Recurrent Neural Networks , 2014, ICML.
[11] Andrew W. Senior,et al. Long short-term memory recurrent neural network architectures for large scale acoustic modeling , 2014, INTERSPEECH.
[12] Chong Wang,et al. Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin , 2015, ICML.
[13] Paul Deléglise,et al. TED-LIUM: an Automatic Speech Recognition dedicated corpus , 2012, LREC.
[14] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[15] Tim Rocktäschel,et al. MuFuRU: The Multi-Function Recurrent Unit , 2016, ArXiv.
[16] Quoc V. Le,et al. Listen, Attend and Spell , 2015, ArXiv.
[17] Alex Graves,et al. Connectionist Temporal Classification , 2012 .
[18] George Saon,et al. Feature and model space speaker adaptation with full covariance Gaussians , 2006, INTERSPEECH.
[19] Jürgen Schmidhuber,et al. Highway Networks , 2015, ArXiv.
[20] Geoffrey E. Hinton,et al. Acoustic Modeling Using Deep Belief Networks , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[21] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[22] William Chan,et al. Deep Recurrent Neural Networks for Acoustic Modelling , 2015, ArXiv.
[23] Jörg Tiedemann,et al. Parallel Data, Tools and Interfaces in OPUS , 2012, LREC.
[24] Yoshua Bengio,et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.