The pitch of voiced speech sounds provides very important information in speech analysis. Pitch estimation is a difficult task when unprevented noise exists. However experimental results have shown that even robust pitch detection techniques fail in noisy environment with periodic patterns such as noise generated by machines. Wavelet transform, with its special properties in time frequency relation, can be used to detect pitch with remarkable advantage in noise resistance. In wavelet signal analysis, the modulus of the transform have been used extensively, however, we found that the phase information is equally important especially for pitch detection. Since the phase spectrum is always intensive to noise, a more promising pitch period can be obtained from the phase diagram. Properties of the phase pattern in wavelet transform are investigated and the result is applied to construct a robust pitch detector. In our first test, the detector is employed to detect the pitches of a set of speech signals with white noise. We found that our approach clearly outperforms other non-wavelet methods with low signal-to-noise ratio. Sinusoidal noise with different frequency levels is used in the second test. Simulation results have shown that our system works quite stable in such an environment.
[1]
Mitch Weintraub,et al.
Estimation of noise-corrupted speech DFT-spectrum using the pitch period
,
1994,
IEEE Trans. Speech Audio Process..
[2]
Stefan Hadjitodorov,et al.
Robust hybrid pitch detector
,
1993
.
[3]
斎藤 収三,et al.
Fundamentals of speech signal processing
,
1985
.
[4]
Shubha Kadambe,et al.
Application of the wavelet transform for pitch detection of speech signals
,
1992,
IEEE Trans. Inf. Theory.
[5]
Gloria Faye Boudreaux-Bartels,et al.
A comparison of a wavelet functions for pitch detection of speech signals
,
1991,
[Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.
[6]
Peter Howell,et al.
Signals and Systems for Speech and Hearing
,
1991
.
[7]
C. Nadeu,et al.
Pitch determination using the cepstrum of the one-sided autocorrelation sequence
,
1991,
[Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.