A spectral autocorrelation method for measurement of the fundamental frequency of noise-corrupted speech

A method for measurement of the fundamental frequency of a voiced speech signal corrupted by high levels of additive white Gaussian noise is described. The method is based on flattening the spectrum of the signal by a bank of bandpass lifters and extracting the pitch frequency from autocorrelation functions calculated at the output of the lifters. A smoothing modified median filter is applied to the calculated pitch frequency contour to result in an improvement in the accuracy of the method. A byproduct of the pitch tracker is a voiced/ unvoiced classifier. The maximum and the variance of the autocorrelation function maxima, over the bank of lifters, serve as the basis for voiced/unvoiced classification by making use of a two-dimensional, nearest-neighbor pattern recognition approach. Results are presented for fundamental frequency measurement and voiced/unvoiced classification for several signal-to-noise ratios.

[1]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[2]  J. Warren A pattern classification technique for speech recognition , 1971 .

[3]  Leah J. Siegel,et al.  A decision tree procedure for voiced/Unvoiced/Mixed excitation classification of speech , 1980, ICASSP.

[4]  Jack J.Q. Chang Digital Computer Generation of White Gaussian Noise , 1986 .

[5]  A. Noll Cepstrum pitch determination. , 1967, The Journal of the Acoustical Society of America.

[6]  R.W. Schafer,et al.  Digital representations of speech signals , 1975, Proceedings of the IEEE.

[7]  L. Siegel,et al.  Voiced/Unvoiced/Mixed excitation classification of speech , 1982 .

[8]  Kuldip K. Paliwal,et al.  A synthesis-based method for pitch extraction , 1983, Speech Commun..

[9]  M. Sondhi,et al.  New methods of pitch extraction , 1968 .

[10]  B. Cox,et al.  Nonparametric rank-order statistics applied to robust voiced-unvoiced-silence classification , 1980 .

[11]  Aaron E. Rosenberg,et al.  A comparative performance study of several pitch detection algorithms , 1976 .

[12]  V. Ramamoorthy Voice/Unvoice detection based on a composite-Gaussian source model of speech , 1980, ICASSP.

[13]  T. V. Sreenivas,et al.  Pitch extraction from corrupted harmonics of the power spectrum , 1979 .

[14]  Lawrence R. Rabiner,et al.  A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition , 1976 .

[15]  Aaron E. Rosenberg,et al.  A semiautomatic pitch detector (SAPD) , 1975 .

[16]  L. F. Willems,et al.  Measurement of pitch in speech: an implementation of Goldstein's theory of pitch perception. , 1982, The Journal of the Acoustical Society of America.

[17]  T. Parks,et al.  Maximum likelihood pitch estimation , 1976 .

[18]  Meir Lahat FEATURE EXTRACTION FROM NOISY SPEECH SIGNALS , 1983 .

[19]  R L Miller Performance characteristics of an experimental harmonic identification pitch extraction (HIPEX) system. , 1970, The Journal of the Acoustical Society of America.

[20]  Lawrence R. Rabiner,et al.  Applications of a nonlinear smoothing algorithm to speech processing , 1975 .

[21]  A. Noll Short‐Time Spectrum and “Cepstrum” Techniques for Vocal‐Pitch Detection , 1964 .

[22]  L. Siegel A procedure for using pattern classification techniques to obtain a voiced/Unvoiced classifier , 1979 .

[23]  Shih-Chien Yang,et al.  A pitch extraction algorithm based on LPC inverse filtering and AMDF , 1977 .

[24]  Lawrence R. Rabiner,et al.  On the use of autocorrelation analysis for pitch detection , 1977 .

[25]  S. Seneff,et al.  Real-time harmonic pitch detector , 1978 .

[26]  M. Schroeder Period histogram and product spectrum: new methods for fundamental-frequency measurement. , 1968, The Journal of the Acoustical Society of America.

[27]  J. Moorer,et al.  The optimum comb method of pitch period analysis of continuous digitized speech , 1974 .

[28]  V. Sarma,et al.  Studies on pattern recognition approach to voiced-unvoiced-silence classification , 1978, ICASSP.