Adaptive compensation for robust speech recognition

Adaptation and compensation are two commonly adopted strategies to improve the robustness of speech recognition systems, especially in those cases when the testing data do not resemble the training data. In many ways, adaptation and compensation share similar goals and should be considered as a unified strategy for robust speech recognition. In this paper, we discuss adaptive compensation in which the compensation is accomplished through adaptive learning from the given testing data. Two major classes of adaptive compensation techniques can be considered, namely: (1) adaptive feature and model compensation, in which recognition features and/or model parameters are modified as needed; and (2) adaptive classifier compensation, in which the classifier structure and the corresponding parameters are modified as needed. We address the capabilities and limitations of these approaches.

[1]  Mark J. F. Gales,et al.  Robust speech recognition in additive and convolutional noise using parallel model combination , 1995, Comput. Speech Lang..

[2]  Chin-Hui Lee,et al.  A maximum-likelihood approach to stochastic matching for robust speech recognition , 1996, IEEE Trans. Speech Audio Process..

[3]  Chin-Hui Lee,et al.  Bayesian Adaptive Learning and Map Estimation of HMM , 1996 .

[4]  Jen-Tzung Chien,et al.  A hybrid algorithm for speaker adaptation using MAP transformation and adaptation , 1997 .

[5]  Alejandro Acero,et al.  Acoustical and environmental robustness in automatic speech recognition , 1991 .

[6]  Chin-Hui Lee,et al.  Maximum-likelihood stochastic matching approach to non-linear equalization for robust speech recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[7]  Kuldip K. Paliwal,et al.  Automatic Speech and Speaker Recognition , 1996 .

[8]  Douglas A. Reynolds,et al.  Integrated models of signal and background with application to speaker identification in noise , 1994, IEEE Trans. Speech Audio Process..

[9]  Chin-Hui Lee,et al.  A minimax classification approach with application to robust speech recognition , 1993, IEEE Trans. Speech Audio Process..

[10]  Philip C. Woodland,et al.  Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[11]  Vassilios Digalakis,et al.  Speaker adaptation using combined transformation and Bayesian methods , 1996, IEEE Trans. Speech Audio Process..