Applying wavelet analysis to speech segmentation and classification

We propose the design of a hearing aid based on the wavelet transform. The fast wavelet transform is used to decompose speech into different frequency components. This paper presents the difficulties in the use of wavelet transforms for speech processing and shows how the careful selection of wavelet coefficients can enable the four major categories of speech - voiced speech, plosives, fricatives, and silence - to be identified. With knowledge of these four categories, it is shown how speech can be easily and effectively segmented.

[1]  G. E. Peterson,et al.  Control Methods Used in a Study of the Vowels , 1951 .

[2]  R. W. King,et al.  Wavelet parameterization for speech recognition: variations in translation and scale parameters , 1994, Proceedings of ICSIPNN '94. International Conference on Speech, Image Processing and Neural Networks.

[3]  Richard Kronland-Martinet,et al.  The Wavelet Transform for Analysis, Synthesis, and Processing of Speech and Music Sounds , 1988 .

[4]  Lawrence R. Rabiner,et al.  Applications of a nonlinear smoothing algorithm to speech processing , 1975 .

[5]  Mary Jane Irwin,et al.  Discrete wavelet transforms in VLSI , 1992, [1992] Proceedings of the International Conference on Application Specific Array Processors.

[6]  Judith C. Brown Calculation of a constant Q spectral transform , 1991 .

[7]  M. Wickerhauser Acoustic signal compression with wavelet packets , 1993 .

[8]  M.J. Ready,et al.  Transform representation of the spectra of acoustic speech segments with applications. I. General approach and application to speech recognition , 1993, IEEE Trans. Speech Audio Process..

[9]  C. d'Alessandro,et al.  Decomposition of the speech signal into short-time waveforms using spectral segmentation , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[10]  J.-S. Lienard Speech analysis and reconstruction using short-time, elementary waveforms , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Richard Kronland-Martinet,et al.  Analysis of Sound Patterns through Wavelet transforms , 1987, Int. J. Pattern Recognit. Artif. Intell..

[12]  DaubechiesIngrid Orthonormal bases of compactly supported wavelets II , 1993 .

[13]  James E. Youngberg Rate/Pitch modification of speech using the constant-Q transform , 1979, ICASSP.

[14]  Shubha Kadambe,et al.  Application of the wavelet transform for pitch detection of speech signals , 1992, IEEE Trans. Inf. Theory.

[15]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .