Noisy training for deep neural networks in speech recognition
暂无分享,去创建一个
Thomas Fang Zheng | Dong Wang | Chao Liu | Yiye Lin | Shi Yin | Javier Tejedor | Yinguo Li | Zhiyong Zhang | Dong Wang | T. Zheng | Shi Yin | Yinguo Li | Javier Tejedor | Zhiyong Zhang | Chao Liu | Yiye Lin
[1] Dong Yu,et al. Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[2] Hermann Ney,et al. Hierarchical bottle neck features for LVCSR , 2010, INTERSPEECH.
[3] Dong Yu,et al. Deep Learning: Methods and Applications , 2014, Found. Trends Signal Process..
[4] Yves Grandvalet,et al. Comments on "Noise injection into inputs in back propagation learning" , 1995, IEEE Trans. Syst. Man Cybern..
[5] Dong Yu,et al. Feature engineering in Context-Dependent Deep Neural Networks for conversational speech transcription , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.
[6] Tara N. Sainath,et al. Making Deep Belief Networks effective for large vocabulary continuous speech recognition , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.
[7] Dong Wang,et al. Noisy training for deep neural networks , 2014, 2014 IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP).
[8] Partha Lal,et al. Cross-Lingual Automatic Speech Recognition Using Tandem Features , 2013, IEEE Transactions on Audio, Speech, and Language Processing.
[9] Dong Yu,et al. Roles of Pre-Training and Fine-Tuning in Context-Dependent DBN-HMMs for Real-World Speech Recognition , 2010 .
[10] Patrick Brézillon,et al. Lecture Notes in Artificial Intelligence , 1999 .
[11] Oriol Vinyals,et al. Comparing multilayer perceptron to Deep Belief Network Tandem features for robust ASR , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[12] Jasha Droppo,et al. Multi-task learning in deep neural networks for improved phoneme recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[13] Daniel P. W. Ellis,et al. Tandem connectionist feature extraction for conventional HMM systems , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[14] Hermann Ney,et al. Context-Dependent MLPs for LVCSR: TANDEM, Hybrid or Both? , 2012, INTERSPEECH.
[15] Guozhong An,et al. The Effects of Adding Noise During Backpropagation Training on a Generalization Performance , 1996, Neural Computation.
[16] Hervé Bourlard,et al. Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions , 1997, Summer School on Neural Networks.
[17] Frantisek Grézl,et al. Optimizing bottle-neck features for lvcsr , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.
[18] Yu Tsao,et al. An investigation of spectral restoration algorithms for deep neural networks based noise robust speech recognition , 2013, INTERSPEECH.
[19] Marco Gori,et al. Adaptive Processing of Sequences and Data Structures , 1998, Lecture Notes in Computer Science.
[20] Dong Yu,et al. Conversational Speech Transcription Using Context-Dependent Deep Neural Networks , 2012, ICML.
[21] Tara N. Sainath,et al. Optimization Techniques to Improve Training Speed of Deep Neural Networks for Large Speech Tasks , 2013, IEEE Transactions on Audio, Speech, and Language Processing.
[22] Dong Yu,et al. Improved Bottleneck Features Using Pretrained Deep Neural Networks , 2011, INTERSPEECH.
[23] Navdeep Jaitly,et al. Application of Pretrained Deep Neural Networks to Large Vocabulary Speech Recognition , 2012, INTERSPEECH.
[24] Quoc V. Le,et al. Recurrent Neural Networks for Noise Reduction in Robust ASR , 2012, INTERSPEECH.
[25] Yves Grandvalet,et al. Noise Injection: Theoretical Prospects , 1997, Neural Computation.
[26] Kiyotoshi Matsuoka,et al. Noise injection into inputs in back-propagation learning , 1992, IEEE Trans. Syst. Man Cybern..
[27] Dong Wang,et al. Bottleneck features based on gammatone frequency cepstral coefficients , 2013, INTERSPEECH.
[28] Tara N. Sainath,et al. Auto-encoder bottleneck features using deep belief networks , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[29] Petr Motlícek,et al. Impact of deep MLP architecture on different acoustic modeling techniques for under-resourced speech recognition , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[30] Khe Chai Sim,et al. Noise adaptive front-end normalization based on Vector Taylor Series for Deep Neural Networks in robust speech recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[31] Yoshua Bengio,et al. Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.
[32] Dong Yu,et al. Large vocabulary continuous speech recognition with context-dependent DBN-HMMS , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[33] Steve Renals,et al. Multi-level adaptive networks in tandem and hybrid ASR systems , 2013, ICASSP.
[34] Jinyu Li,et al. Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks. , 2013, ICLR 2013.
[35] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.
[36] R.J.F. Dow,et al. Neural net pruning-why and how , 1988, IEEE 1988 International Conference on Neural Networks.
[37] Christopher M. Bishop,et al. Current address: Microsoft Research, , 2022 .
[38] Robert J. Marks,et al. Similarities of error regularization, sigmoid gain scaling, target smoothing, and training with jitter , 1995, IEEE Trans. Neural Networks.
[39] Yongqiang Wang,et al. An investigation of deep neural networks for noise robust speech recognition , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.