Improved End-of-Query Detection for Streaming Speech Recognition
暂无分享,去创建一个
Matt Shannon | Carolina Parada | Gabor Simko | Shuo-Yiin Chang | Matt Shannon | Shuo-yiin Chang | Carolina Parada | Gabor Simko
[1] David Schlangen,et al. From reaction to prediction: experiments with computational models of turn-taking , 2006, INTERSPEECH.
[2] Maxine Eskénazi,et al. Doing research on a deployed spoken dialogue system: one year of let's go! experience , 2006, INTERSPEECH.
[3] Julia Hirschberg,et al. Turn-taking cues in task-oriented dialogue , 2011, Comput. Speech Lang..
[4] Andreas Stolcke,et al. Is the speaker done yet? faster and more accurate end-of-utterance detection using prosody , 2002, INTERSPEECH.
[5] Andreas Stolcke,et al. A prosody-based approach to end-of-utterance detection that does not require speech recognition , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[6] Juha Häkkinen,et al. Robust end-of-utterance detection for real-time speech recognition applications , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[7] Björn W. Schuller,et al. Real-life voice activity detection with LSTM Recurrent Neural Networks and an application to Hollywood movies , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[8] Maxine Eskénazi,et al. Optimizing Endpointing Thresholds using Dialogue Features in a Spoken Dialogue System , 2008, SIGDIAL Workshop.
[9] Tara N. Sainath,et al. Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[10] Tara N. Sainath,et al. Feature Learning with Raw-Waveform CLDNNs for Voice Activity Detection , 2016, INTERSPEECH.
[11] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[12] David G. Novick,et al. Root causes of lost time and user stress in a simple dialog system , 2005, INTERSPEECH.
[13] Robert Porzel,et al. The Tao of CHI: Towards Effective Human-Computer Interaction , 2004, NAACL.
[14] Ariya Rastrow,et al. Accurate endpointing with expected pause duration , 2015, INTERSPEECH.