A Model of Loudness Applicable to Time-Varying Sounds

Previously we described a model for calculating the loudness of steady sounds from their spectrum. Here a new version of the model is presented, which uses a waveform as its input. The stages of the model are as follows. (a) A finite impulse response filter representing transfer through the outer and middle ear. (b) Calculation of the short-term spectrum using the fast Fourier transform (FFT). To give adequate spectral resolution at low frequencies, combined with adequate temporal resolution at high frequencies, six FFTs are calculated in parallel, using longer signal segments for low frequencies and shorter segments for higher frequencies. (c) Calculation of an excitation pattern from the physical spectrum. (d) Transformation of the excitation pattern to a specific loudness pattern. (e) Determination of the area under the specific loudness pattern. This gives a value for the instantaneous loudness. The short-term perceived loudness is calculated from the instantaneous loudness using an averaging mechanism similar to an automatic gain control system, with attack and release times. Finally the overall loudness impression is calculated from the short-term loudness using a similar averaging mechanism, but with longer attack and release times. The new model gives very similar predictions to our earlier model for steady sounds. In addition, it can predict the loudness of brief sounds as a function of duration and the overall loudness of sounds that are amplitude modulated at various rates.

[1]  Harvey Fletcher,et al.  Relation Between Loudness and Masking , 1937 .

[2]  B. Scharf Complex sounds and critical bands. , 1961, Psychological bulletin.

[3]  Eberhard Zwicker,et al.  Temporal Effects in Simultaneous Masking by White‐Noise Bursts , 1965 .

[4]  E. Zwicker,et al.  A MODEL OF LOUDNESS SUMMATION. , 1965, Psychological review.

[5]  H. J. M. Steeneken,et al.  Place Dependence of Timbre in Reverberant Sound Fields , 1973 .

[6]  E. Shaw Transformation of sound pressure level from the free field to the eardrum in the horizontal plane. , 1974, The Journal of the Acoustical Society of America.

[7]  Hugo Fastl Loudness and masking patterns of narrow noise bands , 1975 .

[8]  E. Zwicker Procedure for calculating loudnesss of temporally variable sounds. , 1977, The Journal of the Acoustical Society of America.

[9]  C Elberling,et al.  Loudness summation across frequency under masking and in sensorineural hearing loss. , 1980, Audiology : official organ of the International Society of Audiology.

[10]  W Jesteadt,et al.  An adaptive procedure for subjective judgments , 1980, Perception & psychophysics.

[11]  B. Moore,et al.  Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. , 1983, The Journal of the Acoustical Society of America.

[12]  Hugo Fastl,et al.  BASIC-Program for calculating the loudness of sounds from their 1/3-oct. band spectra according to ISO 532 B , 1984 .

[13]  R. Hellman,et al.  Perceived magnitude of two-tone-noise complexes: loudness, annoyance, and noisiness. , 1985, The Journal of the Acoustical Society of America.

[14]  Brian C. J. Moore,et al.  Formulae describing frequency selectivity as a function of frequency and level, and their use in calculating excitation patterns , 1987, Hearing Research.

[15]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[16]  C Giguère,et al.  A computational model of the auditory periphery for speech and hearing research. I. Ascending path. , 1994, The Journal of the Acoustical Society of America.

[17]  A Kohlrausch,et al.  Phase effects in masking related to dispersion in the inner ear. II. Masking period patterns of short targets. , 1995, The Journal of the Acoustical Society of America.

[18]  S Buus,et al.  Temporal integration of loudness as a function of level. , 1995, The Journal of the Acoustical Society of America.

[19]  L. Robles,et al.  Basilar-membrane responses to tones at the base of the chinchilla cochlea. , 1997, The Journal of the Acoustical Society of America.

[20]  F. Zeng,et al.  Loudness of dynamic stimuli in acoustic and electric hearing. , 1997, The Journal of the Acoustical Society of America.

[21]  Thomas Baer,et al.  A model for the prediction of thresholds, loudness, and partial loudness , 1997 .

[22]  S Buus,et al.  Temporal integration of loudness, loudness discrimination, and the form of the loudness function. , 1997, The Journal of the Acoustical Society of America.

[23]  R. Carlyon,et al.  Excitation produced by Schroeder-phase complexes: evidence for fast-acting compression in the auditory system. , 1997, The Journal of the Acoustical Society of America.

[24]  Influence of individual listener, measurement room and choice of test-tone levels on the shape of equal-loudness level contours , 1997 .

[25]  Thomas Baer,et al.  Loudness of modulated sounds as a function of modulation rate, modulation depth, modulation waveform and overall level , 1998 .

[26]  S Rosen,et al.  Auditory filter nonlinearity at 2 kHz in normal hearing listeners. , 1998, The Journal of the Acoustical Society of America.

[27]  Richard J. Baker,et al.  An efficient characterisation of human auditory filtering across level and frequency that is physiologically reasonable , 1998 .

[28]  B C Moore,et al.  Factors affecting the loudness of modulated sounds. , 1999, The Journal of the Acoustical Society of America.

[29]  W. S. Rhode,et al.  Basilar membrane responses to broadband stimuli. , 2000, The Journal of the Acoustical Society of America.

[30]  B. Moore,et al.  Frequency selectivity as a function of level and frequency measured with uniformly exciting notched noise. , 2000, The Journal of the Acoustical Society of America.

[31]  Kohlrausch,et al.  The influence of carrier level and frequency on modulation and beat-detection thresholds for sinusoidal carriers , 2000, The Journal of the Acoustical Society of America.

[32]  E. Lopez-Poveda,et al.  A computational algorithm for computing nonlinear auditory frequency selectivity. , 2001, The Journal of the Acoustical Society of America.

[33]  L. Carney,et al.  A phenomenological model for the responses of auditory-nerve fibers: I. Nonlinear tuning with compression and suppression. , 2001, The Journal of the Acoustical Society of America.

[34]  T. Irino,et al.  A compressive gammachirp auditory filter for both physiological and psychophysical data. , 2001, The Journal of the Acoustical Society of America.

[35]  B C Moore,et al.  Temporal modulation transfer functions obtained using sinusoidal carriers with normally hearing and hearing-impaired listeners. , 2001, The Journal of the Acoustical Society of America.

[36]  A. Oxenham,et al.  Forward masking: adaptation or integration? , 2001, The Journal of the Acoustical Society of America.