Wavelet based sub-band parameters for classification of unaspirated Hindi stop consonants in initial position of CV syllables

This paper proposes a new feature extraction technique using wavelet based sub-band parameters (WBSP) for classification of unaspirated Hindi stop consonants. The extracted acoustic parameters show marked deviation from the values reported for English and other languages, Hindi having distinguishing manner based features. Since acoustic parameters are difficult to be extracted automatically for speech recognition.Mel Frequency Cepstral Coefficient (MFCC) based features are usually used. MFCC are based on short time Fourier transform (STFT) which assumes the speech signal to be stationary over a short period. This assumption is specifically violated in case of stop consonants.In WBSP, from acoustic study, the features derived from CV syllables have different weighting factors with the middle segment having the maximum. The wavelet transform has been applied to splitting of signal into 8 sub-bands of different bandwidths and the variation of energy in different sub-bands is also taken into account. WBSP gives improved classification scores. The number of filters used (8) for feature extraction in WBSP is less compared to the number (24) used for MFCC. Its classification performance has been compared with four other techniques using linear classifier. Further, Principal components analysis (PCA) has also been applied to reduce dimensionality.

[1]  Rafael A. Calvo,et al.  Fast Dimensionality Reduction and Simple PCA , 1998, Intell. Data Anal..

[2]  Omar Farooq,et al.  Evaluation of a Wavelet Based ASR Front-End , 2007, Int. J. Wavelets Multiresolution Inf. Process..

[3]  Glenn E. Prescott,et al.  Wavelet Transform Speech Recognition Using Vector Quantization, Dynamic Time Warping And Artificial , 1994 .

[4]  Daniel P. W. Ellis,et al.  Frequency-domain linear prediction for temporal features , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[5]  Dietrich Klakow,et al.  Robustness of linear discriminant analysis in automatic speech recognition , 2002, Object recognition supported by user interaction for service robots.

[6]  Reinhold Huber-Mörk,et al.  Classification of coins using an eigenspace approach , 2005, Pattern Recognit. Lett..

[7]  Omar Farooq,et al.  Phoneme recognition using wavelet based features , 2003, Inf. Sci..

[8]  J. N. Gowdy,et al.  Feature extraction using discrete wavelet transform for speech recognition , 2000, Proceedings of the IEEE SoutheastCon 2000. 'Preparing for The New Millennium' (Cat. No.00CH37105).

[9]  Stéphane Mallat,et al.  A Wavelet Tour of Signal Processing, 2nd Edition , 1999 .

[10]  A. Posadas,et al.  Spatial‐temporal analysis of a seismic series using the principal components method: The Antequera Series, Spain, 1989 , 1993 .

[11]  Hai Jiang,et al.  Feature extraction using wavelet packets strategy , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).

[12]  Tony J. Dodd,et al.  Active Bayesian perception for angle and position discrimination with a biomimetic fingertip , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  S. Mallat A wavelet tour of signal processing , 1998 .

[14]  Omar Farooq,et al.  Mel filter-like admissible wavelet packet structure for speech recognition , 2001, IEEE Signal Processing Letters.

[15]  Ram Prakash sharma Recognition of Hindi stop consonants , 2008 .

[16]  李幼升,et al.  Ph , 1989 .

[17]  Alex Pentland,et al.  Face recognition using eigenfaces , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[18]  Bayya Yegnanarayana,et al.  A constraint satisfaction model for recognition of stop consonant-vowel (SCV) utterances , 2002, IEEE Trans. Speech Audio Process..

[19]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[20]  David G. Stork,et al.  Pattern Classification , 1973 .

[21]  B. Juang,et al.  Selective feature extraction via signal decomposition , 1997, IEEE Signal Process. Lett..

[22]  Sungwook Chang,et al.  Speech feature extracted from adaptive wavelet for speech recognition , 1998 .

[23]  Keinosuke Fukunaga,et al.  Introduction to statistical pattern recognition (2nd ed.) , 1990 .

[24]  Zhang Xueying,et al.  The Speech Recognition Based on the Bark Wavelet and CZCPA Features , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[25]  L. J. P. van der Maaten,et al.  An Introduction to Dimensionality Reduction Using Matlab , 2007 .

[26]  Atiwong Suchato,et al.  Classification of stop consonant place of articulation , 2004 .

[27]  Sungyub Yoo,et al.  Relative energy and intelligibility of transient speech components , 2004, 2004 12th European Signal Processing Conference.