Improving wideband speech recognition using mixed-bandwidth training data in CD-DNN-HMM
暂无分享,去创建一个
Yifan Gong | Jinyu Li | Dong Yu | Jui-Ting Huang | Dong Yu | Jinyu Li | J. Huang | Y. Gong
[1] Richard M. Stern,et al. Sources of degradation of speech recognition in the telephone network , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.
[2] Douglas D. O'Shaughnessy,et al. Statistical recovery of wideband speech from narrowband speech , 1992, IEEE Trans. Speech Audio Process..
[3] Peter Jax,et al. Wideband extension of telephone speech using a hidden Markov model , 2000, 2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421).
[4] Hyung Soon Kim,et al. Narrowband to wideband conversion of speech using GMM based transformation , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[5] Alex Acero,et al. Spoken Language Processing , 2001 .
[6] Geoffrey Zweig,et al. fMPE: discriminatively trained features for speech recognition , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..
[7] Alex Acero,et al. Robust bandwidth extension of noise-corrupted narrowband speech , 2005, INTERSPEECH.
[8] Geoffrey Zweig,et al. Automated directory assistance system - from theory to practice , 2007, INTERSPEECH.
[9] Alex Acero,et al. Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[10] Brian Kingsbury,et al. Boosted MMI for model and feature-space discriminative training , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.
[11] Brian Kingsbury,et al. Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.
[12] Dong Yu,et al. Investigation of full-sequence training of deep belief networks for speech recognition , 2010, INTERSPEECH.
[13] Dong Yu,et al. Roles of Pre-Training and Fine-Tuning in Context-Dependent DBN-HMMs for Real-World Speech Recognition , 2010 .
[14] Henrique S. Malvar,et al. Joint encoding of the waveform and speech recognition features using a transform codec , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[15] Dong Yu,et al. Conversational Speech Transcription Using Context-Dependent Deep Neural Networks , 2012, ICML.
[16] Tara N. Sainath,et al. Making Deep Belief Networks effective for large vocabulary continuous speech recognition , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.
[17] Vincent Vanhoucke,et al. Improving the speed of neural networks on CPUs , 2011 .
[18] Geoffrey E. Hinton,et al. Understanding how Deep Belief Networks perform acoustic modelling , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[19] Dong Yu,et al. Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition , 2012, IEEE Transactions on Audio, Speech, and Language Processing.
[20] Navdeep Jaitly,et al. Application of Pretrained Deep Neural Networks to Large Vocabulary Conversational Speech Recognition , 2012 .