Segmental Recurrent Neural Networks for End-to-End Speech Recognition
暂无分享,去创建一个
Liang Lu | Noah A. Smith | Chris Dyer | Steve Renals | Lingpeng Kong | Chris Dyer | S. Renals | Lingpeng Kong | Liang Lu
[1] Geoffrey Zweig,et al. Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[2] Wojciech Zaremba,et al. Recurrent Neural Network Regularization , 2014, ArXiv.
[3] Steve Renals,et al. Speech Recognition Using Augmented Conditional Random Fields , 2009, IEEE Transactions on Audio, Speech, and Language Processing.
[4] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.
[5] Dong Yu,et al. Deep segmental neural networks for speech recognition , 2013, INTERSPEECH.
[6] Kevin Gimpel,et al. Discriminative segmental cascades for feature-rich phone recognition , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
[7] Alex Acero,et al. Hidden conditional random fields for phone classification , 2005, INTERSPEECH.
[8] Shinji Watanabe,et al. Joint CTC-attention based end-to-end speech recognition using multi-task learning , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[9] Yajie Miao,et al. EESEN: End-to-end speech recognition using deep RNN models and WFST-based decoding , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
[10] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[11] Tara N. Sainath,et al. Acoustic modelling with CD-CTC-SMBR LSTM RNNS , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
[12] Steve Renals,et al. A study of the recurrent neural network encoder-decoder for large vocabulary speech recognition , 2015, INTERSPEECH.
[13] Liang Lu,et al. Multitask Learning with CTC and Segmental CRF for Speech Recognition , 2017, INTERSPEECH.
[14] Navdeep Jaitly,et al. Towards End-To-End Speech Recognition with Recurrent Neural Networks , 2014, ICML.
[15] E. Fosler-Lussier,et al. Efficient Segmental Conditional Random Fields for Phone Recognition , 2012 .
[16] Larry Gillick,et al. Don't multiply lightly: Quantifying problems with the acoustic model assumptions in speech recognition , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.
[17] Mari Ostendorf,et al. From HMM's to segment models: a unified view of stochastic modeling for speech recognition , 1996, IEEE Trans. Speech Audio Process..
[18] Alex Graves. Hierarchical Subsampling Networks , 2012 .
[19] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.
[20] William W. Cohen,et al. Semi-Markov Conditional Random Fields for Information Extraction , 2004, NIPS.
[21] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[22] Rich Caruana,et al. Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.
[23] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..
[24] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[25] Mark J. F. Gales,et al. Speech Recognition using SVMs , 2001, NIPS.
[26] Noah A. Smith,et al. Segmental Recurrent Neural Networks , 2015, ICLR.
[27] Erich Elsen,et al. Deep Speech: Scaling up end-to-end speech recognition , 2014, ArXiv.
[28] Eric Fosler-Lussier,et al. Segmental conditional random fields with deep neural networks as acoustic models for first-pass word recognition , 2015, INTERSPEECH.
[29] Quoc V. Le,et al. Listen, Attend and Spell , 2015, ArXiv.
[30] Yoshua Bengio,et al. Attention-Based Models for Speech Recognition , 2015, NIPS.
[31] Geoffrey Zweig,et al. Classification and recognition with direct segment models , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[32] Andrew W. Senior,et al. Fast and accurate recurrent neural network acoustic models for speech recognition , 2015, INTERSPEECH.
[33] Eric Fosler-Lussier,et al. Conditional Random Fields in Speech, Audio, and Language Processing , 2013, Proceedings of the IEEE.