Discriminant wavelet basis construction for speech recognition

In this paper, a new feature extraction methodology based on Wavelet Transforms is examined, which unlike some conventional parameterisation techniques, is flexible enough to cope with the broadly differing characteristics of typical speech signals. A training phase is involved during which the final classifier is invoked to associate a cost function (a proxy for misclassification) with a given resolution. The sub spaces are then searched and pruned to provide a Wavelet Basis best suited to the classification problem. Comparative results are given illustrating some improvement over the Short-Time Fourier Transform using two differing subclasses of speech.

[1]  Naoki Saito,et al.  CLASSIFICATION OF GEOPHYSICAL ACOUSTIC WAVEFORMS USING TIME-FREQUENCY ATOMS , 1996 .

[2]  O. Rioul,et al.  Wavelets and signal processing , 1991, IEEE Signal Processing Magazine.

[3]  R. Coifman,et al.  Local feature extraction and its applications using a library of bases , 1994 .

[4]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Shubha Kadambe,et al.  Application of the wavelet transform for pitch detection of speech signals , 1992, IEEE Trans. Inf. Theory.

[6]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[7]  C. d'Alessandro,et al.  Random wavelet representation of unvoiced speech , 1992, [1992] Proceedings of the IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis.

[8]  J.A. Thripuraneni,et al.  Mixed Malvar-wavelets for non-stationary signal representation , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[9]  Ronald R. Coifman,et al.  Entropy-based algorithms for best basis selection , 1992, IEEE Trans. Inf. Theory.

[10]  M. Wickerhauser Acoustic signal compression with wavelet packets , 1993 .