Speech pitch detection in noisy environment using multi-rate adaptive lossless FIR filters

In this paper a new approach to pitch detection for spontaneous speech in noisy environment is proposed. It uses multi-rate lossless FIR filter to build a model which connects acoustic tubes and wavelet transform. An "on-line" eigen-structure algorithm is used to control the parameters of the model based on the state description of the physical system and input speech signal. The pitch periods are determined by identifying the "events" from the trajectories of modeling parameters. The simulations illustrate that the proposed method features higher accuracy, more robustness to noise and the capacity of "on-line" pitch detection for synthetic and natural speech signal in comparison with other existing wavelet-based pitch detection algorithms.

[1]  Douglas D. O'Shaughnessy,et al.  Automatic and reliable estimation of glottal closure instant and period , 1989, IEEE Trans. Acoust. Speech Signal Process..

[2]  C. Wendt,et al.  Pitch determination and speech segmentation using the discrete wavelet transform , 1996, 1996 IEEE International Symposium on Circuits and Systems. Circuits and Systems Connecting the World. ISCAS 96.

[3]  Shubha Kadambe,et al.  Application of the wavelet transform for pitch detection of speech signals , 1992, IEEE Trans. Inf. Theory.

[4]  Wolfgang Hess,et al.  Pitch Determination of Speech Signals , 1983 .

[5]  A. Gray,et al.  Least squares glottal inverse filtering from the acoustic speech waveform , 1979 .

[6]  Gloria Faye Boudreaux-Bartels,et al.  A comparison of a wavelet functions for pitch detection of speech signals , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[7]  Phillip A. Regalia,et al.  Attainable error bounds in multirate adaptive lossless FIR filters , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[8]  John Nicholas Holmes,et al.  Speech synthesis , 1972 .