Feature combination and stacking of recurrent and non-recurrent neural networks for LVCSR
暂无分享,去创建一个
[1] Hermann Ney,et al. Acoustic feature combination for robust speech recognition , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..
[2] Daniel Povey,et al. Revisiting Recurrent Neural Networks for robust ASR , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[3] Hermann Ney,et al. Improved Acoustic Feature Combination for LVCSR by Neural Networks , 2011, INTERSPEECH.
[4] Hermann Ney,et al. Hierarchical bottle neck features for LVCSR , 2010, INTERSPEECH.
[5] András Zolnay,et al. Acoustic feature combination for speech recognition , 2006 .
[6] Björn W. Schuller,et al. A novel bottleneck-BLSTM front-end for feature-level context modeling in conversational speech recognition , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.
[7] Björn W. Schuller,et al. Recognition of spontaneous conversational speech using long short-term memory phoneme predictions , 2010, INTERSPEECH.
[8] Salvador España Boquera,et al. Improving Offline Handwritten Text Recognition with Hybrid HMM/ANN Models , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[9] Kuldip K. Paliwal,et al. Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..
[10] Hynek Hermansky,et al. TRAPS - classifiers of temporal patterns , 1998, ICSLP.
[11] Hermann Ney,et al. Feature combination using linear discriminant analysis and its pitfalls , 2006, INTERSPEECH.
[12] Hermann Ney,et al. The RWTH 2010 Quaero ASR evaluation system for English, French, and German , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[13] Fabio Valente,et al. Hierarchical neural networks feature extraction for LVCSR system , 2007, INTERSPEECH.
[14] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[15] Hermann Ney,et al. Context-Dependent MLPs for LVCSR: TANDEM, Hybrid or Both? , 2012, INTERSPEECH.
[16] Steve Renals,et al. IPA: improved phone modelling with recurrent neural networks , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.
[17] Gunnar Evermann,et al. Posterior probability decoding, confidence estimation and system combination , 2000 .
[18] Hermann Ney,et al. LSTM Neural Networks for Language Modeling , 2012, INTERSPEECH.
[19] Hynek Hermansky,et al. Multi-resolution RASTA filtering for TANDEM-based ASR , 2005, INTERSPEECH.
[20] Dong Yu,et al. Conversational Speech Transcription Using Context-Dependent Deep Neural Networks , 2012, ICML.
[21] Georg Heigold,et al. Development of the GALE 2008 Mandarin LVCSR system , 2009, INTERSPEECH.
[22] Daniel P. W. Ellis,et al. Tandem connectionist feature extraction for conventional HMM systems , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[23] Hervé Bourlard,et al. Connectionist Speech Recognition: A Hybrid Approach , 1993 .
[24] Hermann Ney,et al. Gammatone Features and Feature Combination for Large Vocabulary Speech Recognition , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.
[25] Tara N. Sainath,et al. Making Deep Belief Networks effective for large vocabulary continuous speech recognition , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.
[26] Mathew Magimai-Doss,et al. Hierarchical processing of the modulation spectrum for GALE Mandarin LVCSR system , 2009, INTERSPEECH.
[27] Jan Cernocký,et al. Probabilistic and Bottle-Neck Features for LVCSR of Meetings , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.