A comparative performance study of seven pitch detection algorithms was conducted. A speech data base, consisting of eight utterances spoken by 3 males, 3 females, and 1 child was constructed. Telephone, close talking microphone, and wideband recordings were made of each of the utterances. For each of the utterances in the data base a "standard" pitch contour was semiautomaticallly measured using a highly sophisticated interactive pitch detection program. The "standard" pitch contour was then compared with the pitch contour that was obtained from each of the seven programmed pitch detectors. The algorithms used in this study were (1) a center clipping, infinite-peak clipping, modified autocorrelation method, (2) the cepstral method, (3) the SIFT method, (4) the parallel processing time domain method, (5) the data reduction method, (6) a spectral flattening LPC method, and (7) the AMDF method. A set of measurements was made on the pitch contours to quantify the various types of errors which occur in each of the above methods. Included among the error measurements were the average and standard deviation of the error in pitch period during voiced regions, the number of gross errors in the pitch period, and the number of voiced-unvoiced classification errors. For each of the error measurements, the individual pitch detectors could be rank ordered as a measure of their relative performance as a function of recording condition, and pitch range of the various speakers. Results are presented on rankings based on one category of errors.
[1]
N. Miller.
Pitch detection by data reduction
,
1975
.
[2]
A. Noll.
Cepstrum pitch determination.
,
1967,
The Journal of the Acoustical Society of America.
[3]
M. Sondhi,et al.
New methods of pitch extraction
,
1968
.
[4]
M. Ross,et al.
Average magnitude difference function pitch extractor
,
1974
.
[5]
J. Markel,et al.
The SIFT algorithm for fundamental frequency estimation
,
1972
.
[6]
Ronald W. Schafer,et al.
Real-time digital hardware pitch detector
,
1976
.
[7]
Aaron E. Rosenberg,et al.
A semiautomatic pitch detector (SAPD)
,
1975
.
[8]
Lawrence R. Rabiner,et al.
Applications of a nonlinear smoothing algorithm to speech processing
,
1975
.
[9]
Aaron E. Rosenberg,et al.
A comparative performance study of several pitch detection algorithms
,
1976
.
[10]
B Gold,et al.
Parallel processing techniques for estimating pitch periods of speech in the time domain.
,
1969,
The Journal of the Acoustical Society of America.
[11]
L. Rabiner,et al.
System for automatic formant analysis of voiced speech.
,
1970,
The Journal of the Acoustical Society of America.