Towards an Intelligent Acoustic Front End for Automatic Speech Recognition: Built-in Speaker Normalization
暂无分享,去创建一个
[1] Bruce R. Musicus. Fast MLM power spectrum estimation from uniformly spaced correlations , 1985, IEEE Trans. Acoust. Speech Signal Process..
[2] John H. L. Hansen,et al. Robust speech recognition in noise: an evaluation using the SPINE corpus , 2001, INTERSPEECH.
[3] S. Haykin,et al. Adaptive Filter Theory , 1986 .
[4] Thomas Niesler,et al. The 1998 HTK system for transcription of conversational telephone speech , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[6] Srinivasan Umesh,et al. A method for compensation of Jacobian in speaker normalization , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[7] Puming Zhan,et al. Speaker normalization based on frequency warping , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[8] William J. Byrne,et al. Speaker adaptation with all-pass transforms , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[9] Li Lee,et al. Speaker normalization using efficient frequency warping procedures , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[10] John H. L. Hansen,et al. "CU-move" : analysis & corpus development for interactive in-vehicle speech systems , 2001, INTERSPEECH.
[11] J. Makhoul,et al. Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.
[12] John H. L. Hansen,et al. A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition , 2008, Speech Commun..
[13] Melvyn J. Hunt,et al. Spectral Signal Processing for ASR , 2007 .
[14] Kadri Hacioglu,et al. Recent improvements in the CU Sonic ASR system for noisy speech: the SPINE task , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[15] Alan V. Oppenheim,et al. Discrete-Time Signal Pro-cessing , 1989 .
[16] John H. L. Hansen,et al. Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition , 1996, Speech Commun..
[17] John H. L. Hansen,et al. Morphological constrained feature enhancement with adaptive cepstral compensation (MCE-ACC) for speech recognition in noise and Lombard effect , 1994, IEEE Trans. Speech Audio Process..
[18] Alejandro Acero,et al. Acoustical and environmental robustness in automatic speech recognition , 1991 .
[19] Bhaskar D. Rao,et al. All-pole modeling of speech based on the minimum variance distortionless response spectrum , 2000, Conference Record of the Thirty-First Asilomar Conference on Signals, Systems and Computers (Cat. No.97CB36136).
[20] John H. L. Hansen,et al. CU-Move: Advanced In-Vehicle Speech Systems for Route Navigation , 2005 .
[21] Keiichi Tokuda,et al. Mel-generalized cepstral analysis - a unified approach to speech spectral estimation , 1994, ICSLP.
[22] Herbert Gish,et al. A parametric approach to vocal tract length normalization , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[23] John H. L. Hansen,et al. A comparative study of traditional and newly proposed features for recognition of speech under stress , 2000, IEEE Trans. Speech Audio Process..
[24] Reinhold Häb-Umbach. Investigations on inter-speaker variability in the feature space , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[25] Hermann Ney,et al. Vocal tract normalization equals linear transformation in cepstral space , 2001, IEEE Transactions on Speech and Audio Processing.
[26] Julius O. Smith,et al. Bark and ERB bilinear transforms , 1999, IEEE Trans. Speech Audio Process..
[27] John H. L. Hansen,et al. Perceptual MVDR-based cepstral coefficients (PMCCs) for high accuracy speech recognition , 2003, INTERSPEECH.
[28] Li Lee,et al. A frequency warping approach to speaker normalization , 1998, IEEE Trans. Speech Audio Process..
[29] John H. L. Hansen,et al. A new perspective on feature extraction for robust in-vehicle speech recognition , 2003, INTERSPEECH.
[30] Julius O. Smith,et al. Signal modeling for robust speech recognition with frequency warping and convex optimization , 2000 .