Endpoint Detection Using Grid Long Short-Term Memory Networks for Streaming Speech Recognition
暂无分享,去创建一个
Tara N. Sainath | Bo Li | Carolina Parada | Gabor Simko | Shuo-Yiin Chang | Bo Li | Shuo-yiin Chang | Carolina Parada | Gabor Simko
[1] Maxine Eskénazi,et al. Doing research on a deployed spoken dialogue system: one year of let's go! experience , 2006, INTERSPEECH.
[2] Marc'Aurelio Ranzato,et al. Large Scale Distributed Deep Networks , 2012, NIPS.
[3] David G. Novick,et al. Root causes of lost time and user stress in a simple dialog system , 2005, INTERSPEECH.
[4] Yun Lei,et al. All for one: feature combination for highly channel-degraded speech activity detection , 2013, INTERSPEECH.
[5] Andrew W. Senior,et al. Long short-term memory recurrent neural network architectures for large scale acoustic modeling , 2014, INTERSPEECH.
[6] Tara N. Sainath,et al. Improvements to Deep Convolutional Neural Networks for LVCSR , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[7] Navdeep Jaitly,et al. Hybrid speech recognition with Deep Bidirectional LSTM , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[8] Juha Häkkinen,et al. Robust end-of-utterance detection for real-time speech recognition applications , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[9] Brian Kingsbury,et al. Improvements to the IBM speech activity detection system for the DARPA RATS program , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[10] Tara N. Sainath,et al. Modeling Time-Frequency Patterns with LSTM vs. Convolutional Architectures for LVCSR Tasks , 2016, INTERSPEECH.
[11] Tara N. Sainath,et al. Feature Learning with Raw-Waveform CLDNNs for Voice Activity Detection , 2016, INTERSPEECH.
[12] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[13] Tara N. Sainath,et al. Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[14] Alex Graves,et al. Grid Long Short-Term Memory , 2015, ICLR.
[15] Björn W. Schuller,et al. Real-life voice activity detection with LSTM Recurrent Neural Networks and an application to Hollywood movies , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[16] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.
[17] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.