Wavelet packets based features selection for voiceless plosives classification

There are contradictory reports on the usefulness of the wavelet packet transform (WPT) for feature extraction. This is mainly the case of signals of non-stationary character. In this paper we examine this tool for a category of short non-stationary speech signals, namely voiceless plosive consonants /p/, /t/, /k/. Three approaches to feature selection have been implemented: best basis search algorithm over the averaged wavelet packet coefficients of all data, local discriminant basis (LDB) algorithm, i.e. application of the best basis algorithm on the discriminant measure between coefficients in three classes and singular value decomposition (SVD) of the entropy matrices calculated from the wavelet packets for each class. The experiments conducted over the context independent plosives from speech database of Polish gave a classification rate higher for WPT based features than for traditional DFT based cepstral coefficients.

[1]  K. Stevens,et al.  REVISITING PLACE OF ARTICULATION MEASURES FOR STOP CONSONANTS : IMPLICATIONS FOR MODELS OF CONSONANT PRODUCTION , 1999 .

[2]  Ronald R. Coifman,et al.  Entropy-based algorithms for best basis selection , 1992, IEEE Trans. Inf. Theory.

[3]  Stefan Grocholewski CORPORA - speech database for Polish diphones , 1997, EUROSPEECH.

[4]  Maryhelen Stevenson,et al.  Signal representation for classification of the transient myoelectric signal , 1998 .

[5]  Hidefumi Kobatake,et al.  Spectral transition dynamics of voiceless stop consonants , 1987 .

[6]  Wiktor Jassem Discriminant analysis of continuous consonantal spectra , 1993, EUROSPEECH.

[7]  Friedrich Jondral,et al.  Classification of transient time-varying signals using DFT and wavelet packet based methods , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[8]  R. Coifman,et al.  Local feature extraction and its applications using a library of bases , 1994 .

[9]  Michael Kiefte THE PERCEPTION OF SPECTRALLY REDUCED PREVOCALIC STOP CONSONANTS , 2000 .

[10]  Ewa Lukasik,et al.  Comparison of some time-frequency analysis methods for classification of plosives , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).

[11]  Sergios Theodoridis,et al.  Pattern Recognition , 1998, IEEE Trans. Neural Networks.

[12]  Keith Johnson,et al.  A CROSS-LINGUISTIC STUDY OF STOP PLACE PERCEPTION , 1999 .

[13]  Stefan Grocholewski Analysis of HMM models in alphabet letters recognition , 1999, EUROSPEECH.

[14]  Alan S. Willsky,et al.  A Wavelet Packet Approach to Transient Signal Classification , 1995 .