论文信息 - Single complex sinusoid and ARHE model based pitch extractors

Single complex sinusoid and ARHE model based pitch extractors

In this paper we propose two techniques for the estimation of the fundamental frequency of speech signals. The rst technique is based on the Autoregressive Harmonic Excitation (ARHE) speech model. ARHE model consists of an autoregressive process driven simultaneously by white noise and a periodic excitation. The second technique is based on the estimation of a complex sinusoid in white Gaussian noise. It uses the Hilbert transform of the speech signal and the derivative of its phase function over the time. The derivative of the phase information is seen as a simple model of a moving average process driven by noise. The fundamental frequency is obtained by the minimum variance estimator of the model. The proposed methods have comparable performance to previous reported pitch detectors while they maintain their performance under noisy conditions.

Yannis Stylianou | Ilija Zeljkovic | Y. Stylianou | I. Zeljkovic

[1] Wolfgang Hess,et al. Pitch Determination of Speech Signals: Algorithms and Devices , 1983 .

[2] B. Noble. Applied Linear Algebra , 1969 .

[3] Yannis Stylianou,et al. Harmonic plus noise models for speech, combined with statistical methods, for speech and speaker modification , 1996 .

[4] Sabine Van Huffel,et al. Total least squares problem - computational aspects and analysis , 1991, Frontiers in applied mathematics.

[5] Rangasami L. Kashyap,et al. Estimation of close sinusoids in colored noise and model discrimination , 1987, IEEE Trans. Acoust. Speech Signal Process..

[6] S. Seneff,et al. Real-time harmonic pitch detector , 1978 .

[7] Steven Kay,et al. A Fast and Accurate Single Frequency Estimator , 2022 .

[8] Wolfgang Hess,et al. Pitch Determination of Speech Signals , 1983 .

[9] Thomas F. Quatieri,et al. Pitch estimation and voicing detection based on a sinusoidal speech model , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[10] David A. Krubsack,et al. A spectral autocorrelation method for measurement of the fundamental frequency of noise-corrupted speech , 1987, IEEE Trans. Acoust. Speech Signal Process..

[11] Andreas Spanias,et al. Cepstrum-based pitch detection using a new statistical V/UV classification algorithm , 1999, IEEE Trans. Speech Audio Process..