Nonlinear techniques for parameter extraction from quasi-continuous wavelet transform with application to speech

Speaker identification and word spotting will shortly play a key role in a lot of different fields. This paper presents an approach, based on the wavelet transform, to extract features from a speech signal. These features are based on the `modulation model'. An adequate choice of the extracted features dramatically increases the efficiency of the classification performed on the different speakers or on the different words.

[1]  Biing-Hwang Juang,et al.  A vector quantization approach to speaker recognition , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Richard Kronland-Martinet,et al.  Asymptotic wavelet and Gabor analysis: Extraction of instantaneous frequencies , 1992, IEEE Trans. Inf. Theory.

[3]  D. Friedman,et al.  Instantaneous-frequency distribution vs. time: An interpretation of the phase structure of speech , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Richard J. Mammone,et al.  Growing and Pruning Neural Tree Networks , 1993, IEEE Trans. Computers.

[6]  古井 貞煕,et al.  Digital speech processing, synthesis, and recognition , 1989 .