A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions
暂无分享,去创建一个
Yifan Gong | Jinyu Li | Dong Yu | Li Deng | Alex Acero | L. Deng | Dong Yu | Jinyu Li | Y. Gong | A. Acero
[1] Li Deng,et al. Roles of high-fidelity acoustic modeling in robust speech recognition , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).
[2] Mark J. F. Gales,et al. Adaptive Training with Joint Uncertainty Decoding for Robust Recognition of Noisy Data , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.
[3] Yifan Gong,et al. High-performance hmm adaptation with joint compensation of additive and convolutive distortions via Vector Taylor Series , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).
[4] Yu Hu,et al. Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions , 2007, INTERSPEECH.
[5] Hank Liao,et al. Joint uncertainty decoding for robust large vocabulary speech recognition , 2006 .
[6] Antonio M. Peinado. Speech Recognition Over Digital Channels: Robustness and Standards , 2006 .
[7] Abeer Alwan,et al. Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR , 2005, IEEE Transactions on Speech and Audio Processing.
[8] Y. Gong. A method of joint compensation of additive and convolutive distortions for speaker-independent speech recognition , 2005, IEEE Transactions on Speech and Audio Processing.
[9] Geoffrey Zweig,et al. fMPE: discriminatively trained features for speech recognition , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..
[10] Li Deng,et al. Enhancement of log Mel power spectra of speech using a phase-sensitive model of the acoustic environment and sequential estimation of the corrupting noise , 2004, IEEE Transactions on Speech and Audio Processing.
[11] S. Molau,et al. Feature space normalization in adverse acoustic conditions , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[12] Denis Jouvet,et al. Evaluation of a noise-robust DSR front-end on Aurora databases , 2002, INTERSPEECH.
[13] Hakan Erdogan,et al. Incremental on-line feature space MLLR adaptation for telephony speech recognition , 2002, INTERSPEECH.
[14] Mukund Padmanabhan,et al. Maximum likelihood non-linear transformation for environment adaptation in speech recognition , 2001, INTERSPEECH.
[15] Geoffrey Zweig,et al. Linear feature space projections for speaker adaptation , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[16] Yan Ming Cheng,et al. SNR-dependent waveform processing for improving the robustness of ASR front-end , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).
[17] George Saon,et al. Robust digit recognition in noisy environments: the IBM Aurora 2 system , 2001, INTERSPEECH.
[18] David Pearce,et al. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.
[19] Li Deng,et al. HMM adaptation using vector taylor series for noisy speech recognition , 2000, INTERSPEECH.
[20] Li Deng,et al. Large-vocabulary speech recognition under adverse acoustic environments , 2000, INTERSPEECH.
[21] Anshu Agarwal,et al. TWO-STAGE MEL-WARPED WIENER FILTER FOR ROBUST SPEECH RECOGNITION , 1999 .
[22] Laurent Mauuary,et al. Blind equalization in the cepstral domain for robust telephone based speech recognition , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).
[23] Chin-Hui Lee,et al. On stochastic feature and model compensation approaches to robust speech recognition , 1998, Speech Commun..
[24] Mark J. F. Gales,et al. Maximum likelihood linear transformations for HMM-based speech recognition , 1998, Comput. Speech Lang..
[25] Chong Kwan Un,et al. Speech recognition in noisy environments using first-order vector Taylor series , 1998, Speech Commun..
[26] Biing-Hwang Juang,et al. Signal bias removal by maximum likelihood estimation for robust telephone speech recognition , 1996, IEEE Trans. Speech Audio Process..
[27] Michael Picheny,et al. Robust speech recognition in noise --- performance of the IBM continuous speech recogniser on the ARPA noise spoke task , 1995 .
[28] Mark J. F. Gales,et al. Model-based techniques for noise robust speech recognition , 1995 .
[29] Philip C. Woodland,et al. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..
[30] Yifan Gong,et al. Speech recognition in noisy environments: A survey , 1995, Speech Commun..
[31] Hynek Hermansky,et al. RASTA processing of speech , 1994, IEEE Trans. Speech Audio Process..
[32] Saeed Vaseghi,et al. Speech recognition in noisy environments , 1992, ICSLP.
[33] Mark J. F. Gales,et al. An improved approach to the hidden Markov model decomposition of speech and noise , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[34] Alejandro Acero,et al. Acoustical and environmental robustness in automatic speech recognition , 1991 .
[35] E. A. Martin,et al. Multi-style training for robust isolated-word speech recognition , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.
[36] Richard P. Lippmann,et al. Two-stage discriminant analysis for improved isolated-word recognition , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.
[37] Roger K. Moore. Computer Speech and Language , 1986 .
[38] S. Boll,et al. Suppression of acoustic noise in speech using spectral subtraction , 1979 .
[39] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[40] B. Atal. Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. , 1974, The Journal of the Acoustical Society of America.