Vowels and musical notes produce complex repeating structures in the neural activity pattern (NAP) flowing from the cochlea. A number of groups have demonstrated that the pitch of these sounds can be extracted by autocorrelating the activity in the individual channels and constructing a multi-channel autocorrelogram (ACG) (e.g. Meddis and Hewitt, 1991; Slaney and Lyon, 1990). Recently, Cariani and Delgutte (1996) showed that the physiological equivalent of the ACG, a multi-channel, all-order interval histogram, is also an excellent predictor of pitch. Several authors have gone farther and argued that the autocorrelograms (ACGs) of speech and musical sounds could also explain vowel quality and musical timbre (e.g. Meddis and Hewitt, 1992). There is a problem, however; the structures produced by natural sounds in the NAP are highly asymmetric. Autocorrelation is symmetric in time and it converts asymmetric NAP structures into symmetric structures in the ACG. Patterson (1994b) and Akeroyd and Patterson (1995) have shown that we are highly sensitive to temporal asymmetry and they have argued that, for timbre analysis at least, autocorrelation (AC) should be replaced with a form of 'strobed' temporal integration (STI) which produces a similar representation but which preserves temporal asymmetry. Section 2 of this paper compares the asymmetry processing of AC to STI. Section 3, introduces a new form of STI that is more like what we might expect to find in the auditory system. It is based on the delta-gamma operator of Irino and Patterson (1996).
[1]
R. Patterson,et al.
Time-domain modeling of peripheral auditory processing: a modular architecture and a software platform.
,
1995,
The Journal of the Acoustical Society of America.
[2]
R. Patterson,et al.
Complex Sounds and Auditory Images
,
1992
.
[3]
Ray Meddis,et al.
Virtual pitch and phase sensitivity of a computer model of the auditory periphery
,
1991
.
[4]
Roy D. Patterson,et al.
The sound of a sinusoid: Time‐interval models
,
1994
.
[5]
B. Delgutte,et al.
Neural correlates of the pitch of complex tones. I. Pitch and pitch salience.
,
1996,
Journal of neurophysiology.
[6]
R Meddis,et al.
Modeling the identification of concurrent vowels with different fundamental frequencies.
,
1992,
The Journal of the Acoustical Society of America.
[7]
Roy D. Patterson,et al.
The sound of a sinusoid: Spectral models
,
1994
.
[8]
T. Irino,et al.
Temporal asymmetry in the auditory system.
,
1996,
The Journal of the Acoustical Society of America.
[9]
Roy D. Patterson,et al.
Discrimination of wideband noises modulated by a temporally asymmetric function
,
1995
.
[10]
Richard F. Lyon,et al.
A perceptual pitch detector
,
1990,
International Conference on Acoustics, Speech, and Signal Processing.
[11]
W A Yost,et al.
A time domain description for the pitch strength of iterated rippled noise.
,
1996,
The Journal of the Acoustical Society of America.