Simulation of auditory analysis of pitch: an elaboration on the DWS pitch meter.

A model was developed for estimating the pitch of complex sounds that are partially masked by background sound. Our ultimate aim is to obtain a model that can separate two simultaneous sounds on the basis of the harmonic structure of at least one of the sounds. The MDWS model is an extension of the Duifhuis, Willems, and Sluyter pitch meter (DWS) [J. Acoust. Soc. Am. 71, 1568-1580 (1982)] which is a practical implementation of Goldstein's optimum processor theory of pitch perception [J. Acoust. Soc. Am. 54, 1496-1516 (1973)]. The main modifications incorporated in MDWS consist of a more faithful modeling of auditory frequency analysis and of an alteration to the criterion used to decide which fundamental best fits a set of resolved components. Effects of the latter modification were investigated in a comparison between model estimates of the pitch of inharmonic complex signals and results obtained for humans. Furthermore, the accuracy of model estimates of the pitch of periodic signals (among which were synthesized vowel sounds), partially masked by noise, was compared with the just noticeable difference of fundamental frequency of these sounds for human observers. The results of these two tests show that the model estimates come close to human perception.

[1]  E. C. Cherry Some Experiments on the Recognition of Speech, with One and with Two Ears , 1953 .

[2]  R. Ritsma Existence Region of the Tonal Residue. I , 1962 .

[3]  B. L. Cardozo,et al.  Pitch of the Residue , 1962 .

[4]  G. Brink Detection of Tone Pulse of Various Durations in Noise of Various Bandwidths , 1964 .

[5]  A. M. Mimpen,et al.  The ear as a frequency analyzer. II. , 1964, The Journal of the Acoustical Society of America.

[6]  S. S. Stevens Frequency Analysis and Periodicity Detection in Hearing. , 1972 .

[7]  F. Wightman The pattern-transformation model of pitch. , 1973, The Journal of the Acoustical Society of America.

[8]  J. L. Goldstein An optimum processor theory for the central formation of the pitch of complex tones. , 1973, The Journal of the Acoustical Society of America.

[9]  B. Moore Frequency difference limens for short-duration tones. , 1973, The Journal of the Acoustical Society of America.

[10]  E. Terhardt Pitch, consonance, and harmony. , 1974, The Journal of the Acoustical Society of America.

[11]  R. Patterson Auditory filter shapes derived with noise stimuli. , 1976, The Journal of the Acoustical Society of America.

[12]  D. M. Green,et al.  Frequency discrimination as a function of frequency and sensation level. , 1977, The Journal of the Acoustical Society of America.

[13]  J. L. Goldstein,et al.  Evidence for a general template in central optimal processing for pitch of complex tones. , 1978, The Journal of the Acoustical Society of America.

[14]  F. Harris On the use of windows for harmonic analysis with the discrete Fourier transform , 1978, Proceedings of the IEEE.

[15]  Leonardus Franciscus Willems,et al.  Pitch in speech: A hearing theory approach , 1979 .

[16]  Ernst Terhardt,et al.  Calculating virtual pitch , 1979, Hearing Research.

[17]  Age Hoekstra Frequency discrimination and frequency analysis in hearing , 1979 .

[18]  Ernst Terhardt Toward Understanding Pitch Perception: Problems, Concepts, and Solutions , 1980 .

[19]  R. Patterson,et al.  Off-frequency listening and auditory-filter asymmetry. , 1980, The Journal of the Acoustical Society of America.

[20]  E. Terhardt,et al.  Algorithm for extraction of pitch and pitch salience from complex tonal signals , 1982 .

[21]  L. F. Willems,et al.  Measurement of pitch in speech: an implementation of Goldstein's theory of pitch perception. , 1982, The Journal of the Acoustical Society of America.

[22]  S. G. Nooteboom,et al.  Intonation and the perceptual separation of simultaneous voices , 1982 .

[23]  M. Scheffers,et al.  Discrimination of fundamental frequency of synthesized vowel sounds in a noise background. , 1984, The Journal of the Acoustical Society of America.