Hybrid wavelet based LPC features for Hindi speech recognition

Hybrid features are presented for speech recognition that uses linear prediction in combination with multi-resolution capabilities of wavelet transform. Wavelet-Based Linear Prediction Coefficients (WBLPC) are obtained by applying 3 and 4-level wavelet decomposition and then having linear prediction of each sub-bands to get total 13 features. These features have been tested using a linear discriminant function and Hidden Markov Model (HMM) based classifier for speaker dependent and independent isolated Hindi digits recognition. 3-level WBLPC features gave higher percentage recognition than LPC features while 4-level WBLPC features using HMM gave the highest percentage recognition for both speaker dependent and independent cases.

[1]  Bhaskar D. Rao,et al.  Robust Feature Extraction for Continuous Speech Recognition Using the MVDR Spectrum Estimation Method , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  J. Makhoul,et al.  Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[3]  Daniel P. W. Ellis,et al.  Frequency-domain linear prediction for temporal features , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[4]  Dietrich Klakow,et al.  Robustness of linear discriminant analysis in automatic speech recognition , 2002, Object recognition supported by user interaction for service robots.

[5]  J. N. Gowdy,et al.  Feature extraction using discrete wavelet transform for speech recognition , 2000, Proceedings of the IEEE SoutheastCon 2000. 'Preparing for The New Millennium' (Cat. No.00CH37105).

[6]  Sungwook Chang,et al.  Speech feature extracted from adaptive wavelet for speech recognition , 1998 .

[7]  Stphane Mallat,et al.  A Wavelet Tour of Signal Processing, Third Edition: The Sparse Way , 2008 .

[8]  Shung-Yung Lung Wavelet Feature Selection Using Fuzzy Approach to Text Independent Speaker Recognition , 2005, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[9]  Ashish Verma,et al.  A large-vocabulary continuous speech recognition system for Hindi , 2004, IBM J. Res. Dev..

[10]  Chungyong Lee,et al.  Optimizing feature extraction for speech recognition , 2003, IEEE Trans. Speech Audio Process..

[11]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[12]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[13]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.

[14]  Omar Farooq,et al.  Mel filter-like admissible wavelet packet structure for speech recognition , 2001, IEEE Signal Processing Letters.

[15]  S. Mallat A wavelet tour of signal processing , 1998 .

[16]  David G. Stork,et al.  Pattern Classification , 1973 .

[17]  B. Juang,et al.  Selective feature extraction via signal decomposition , 1997, IEEE Signal Process. Lett..