Feature enhancement by bidirectional LSTM networks for conversational speech recognition in highly non-stationary noise
暂无分享,去创建一个
Björn W. Schuller | Zixing Zhang | Martin Wöllmer | Gerhard Rigoll | Felix Weninger | Björn Schuller | M. Wöllmer | F. Weninger | G. Rigoll | Zixing Zhang
[1] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[2] Jürgen Schmidhuber,et al. Learning to forget: continual prediction with LSTM , 1999 .
[3] Björn W. Schuller,et al. Localization of non-linguistic events in spontaneous speech by Non-Negative Matrix Factorization and Long Short-Term Memory , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[4] Björn W. Schuller,et al. A novel bottleneck-BLSTM front-end for feature-level context modeling in conversational speech recognition , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.
[5] Björn W. Schuller,et al. Online Driver Distraction Detection Using Long Short-Term Memory , 2011, IEEE Transactions on Intelligent Transportation Systems.
[6] Paris Smaragdis,et al. A non-negative approach to semi-supervised separation of speech from noise with the use of temporal dynamics , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[7] Björn W. Schuller,et al. Combining Long Short-Term Memory and Dynamic Bayesian Networks for Incremental Emotion-Sensitive Artificial Listening , 2010, IEEE Journal of Selected Topics in Signal Processing.
[8] Björn W. Schuller,et al. Combining Bottleneck-BLSTM and Semi-Supervised Sparse NMF for Recognition of Conversational Speech in Highly Instationary Noise , 2012, INTERSPEECH.
[9] Björn W. Schuller,et al. Keyword spotting exploiting Long Short-Term Memory , 2013, Speech Commun..
[10] David Barber,et al. Switching Linear Dynamical Systems for Noise Robust Speech Recognition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[11] Tuomas Virtanen,et al. Exemplar-based speech enhancement and its application to noise-robust automatic speech recognition , 2011 .
[12] Kuldip K. Paliwal,et al. Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..
[13] Maurice Charbit,et al. Factorial Scaled Hidden Markov Model for polyphonic audio representation and source separation , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.
[14] Jan Cernocký,et al. Probabilistic and Bottle-Neck Features for LVCSR of Meetings , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.
[15] Jürgen Schmidhuber,et al. Framewise phoneme classification with bidirectional LSTM and other neural network architectures , 2005, Neural Networks.
[16] Marcus Liwicki,et al. A novel approach to on-line handwriting recognition based on bidirectional long short-term memory networks , 2007 .
[17] Phil D. Green,et al. Speech enhancement with missing data techniques using recurrent neural networks , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[18] Björn W. Schuller,et al. OpenBliSSART: Design and evaluation of a research toolkit for Blind Source Separation in Audio Recognition Tasks , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[19] Björn Schuller,et al. On-line Driver Distraction Detection using Long Short-Term Memory , 2011 .
[20] H. Sebastian Seung,et al. Algorithms for Non-negative Matrix Factorization , 2000, NIPS.
[21] Ning Ma,et al. The CHiME corpus: a resource and a challenge for computational hearing in multisource environments , 2010, INTERSPEECH.
[22] Tuomas Virtanen,et al. Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[23] Björn W. Schuller,et al. Recognition of Noisy Speech: A Comparative Survey of Robust Model Architecture and Feature Enhancement , 2009, EURASIP J. Audio Speech Music. Process..