Optimization of filter-bank to improve the extraction of MFCC features in speech recognition

Mel-frequency cepstral coefficients (MFCC) have been demonstrated to perform very well under most conditions. However, some limited effort has been made to optimize the shape of the filters in the filter-bank using the conventional MFCC approach. This work develops several new approaches to designing the shapes of filters in the filter-bank. In these new approaches, principal component analysis (PCA) and linear discriminant analysis (LDA) are modified and then used to generate new filters. The experimental results reveal that the proposed approaches can improve the recognition performance of MFCC in noisy environments.

[1]  Lin-Shan Lee,et al.  Improved MFCC feature extraction by PCA-optimized filter-bank for speech recognition , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..

[2]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[3]  Biing-Hwang Juang,et al.  An application of discriminative feature extraction to filter-bank-based speech recognition , 2001, IEEE Trans. Speech Audio Process..

[4]  Mark J. F. Gales,et al.  Cepstral parameter compensation for HMM recognition in noise , 1993, Speech Commun..