A computational model of spatial hearing

The human auditory system performs remarkably at determining the positions of sound sources in an acoustic environment. While the localization ability of humans has been studied and quanti ed, there are no existing models capable of explaining many of the phenomena associated with spatial hearing. This thesis describes a spatial hearing model intended to reproduce human localization ability in both azimuth and elevation for a single sound source in an anechoic environment. The model consists of a front end, which extracts useful localization cues from the signals received at the eardrums, and a probabilistic position estimator, which operates on the extracted cues. The front end is based upon human physiology, performing frequency analysis independently at the two ears and estimating interaural di erence cues from the resulting signals. The position estimator is based on the maximum-likelihood estimation technique. Several experiments designed to test the performance of the model are discussed, and the localization blur exhibited by the model is quanti ed. A \perceptual distance" metric is introduced, which allows direct localization comparisons between di erent stimuli. It is shown that the interaural intensity di erence (IID) contains su cient information, when considered as a function of frequency, to explain human localization performance in both azimuth and elevation, for freeeld broad-band stimuli. Thesis Supervisor: Barry L. Vercoe Title: Professor of Media Arts and Sciences Thesis Supervisor: Patrick M. Zurek Title: Principal Research Scientist Research Laboratory of Electronics

[1]  E. Shaw Transformation of sound pressure level from the free field to the eardrum in the horizontal plane. , 1974, The Journal of the Acoustical Society of America.

[2]  L. Rayleigh,et al.  XII. On our perception of sound direction , 1907 .

[3]  C Trahiotis,et al.  Across-frequency interaction in lateralization of complex binaural stimuli. , 1994, The Journal of the Acoustical Society of America.

[4]  C Trahiotis,et al.  Lateralization of bands of noise: effects of bandwidth and differences of interaural time and phase. , 1989, The Journal of the Acoustical Society of America.

[5]  N. Durlach,et al.  Interaural time and amplitude jnds for a 500-Hz tone. , 1969, The Journal of the Acoustical Society of America.

[6]  G. F. Kuhn Physical acoustics and measurements pertaining to directional hearing , 1983 .

[7]  F L Wightman,et al.  Headphone simulation of free-field listening. II: Psychophysical validation. , 1989, The Journal of the Acoustical Society of America.

[8]  M. Bodden Modeling human sound-source localization and the cocktail-party-effect , 1993 .

[9]  B. V. Van Veen,et al.  A spatial feature extraction and regularization model for the head-related transfer function. , 1995, The Journal of the Acoustical Society of America.

[10]  A. Dale Magoun,et al.  Decision, estimation and classification , 1989 .

[11]  E. Owens,et al.  An Introduction to the Psychology of Hearing , 1997 .

[12]  D. M. Green,et al.  Sound localization by human listeners. , 1991, Annual review of psychology.

[13]  John Vanderkooy,et al.  Transfer-Function Measurement with Maximum-Length Sequences , 1989 .

[14]  Frederic L. Wightman,et al.  A New Approach to the Study of Human Sound Localization , 1987 .

[15]  H. Gaskell The precedence effect , 1983, Hearing Research.

[16]  L A JEFFRESS,et al.  A place theory of sound localization. , 1948, Journal of comparative and physiological psychology.

[17]  E. C. Cherry,et al.  Mechanism of Binaural Fusion in the Hearing of Speech , 1957 .

[18]  R. Thouless Experimental Psychology , 1939, Nature.

[19]  P M Zurek,et al.  The precedence effect and its possible role in the avoidance of interaural ambiguities. , 1980, The Journal of the Acoustical Society of America.

[20]  W. Hartmann,et al.  Localization of sound in rooms, II: The effects of a single reflecting surface. , 1985, The Journal of the Acoustical Society of America.

[21]  Ewan A. Macpherson,et al.  A Computer Model of Binaural Localization for Stereo Imaging Measurement , 1989 .

[22]  Bill Gardner,et al.  HRTF Measurements of a KEMAR Dummy-Head Microphone , 1994 .

[23]  W. Lindemann Extension of a binaural cross-correlation model by contralateral inhibition. I. Simulation of lateralization for stationary signals. , 1986, The Journal of the Acoustical Society of America.

[24]  M. Cynader,et al.  A computational theory of spectral cue localization , 1993 .

[25]  F L Wightman,et al.  Headphone simulation of free-field listening. I: Stimulus synthesis. , 1989, The Journal of the Acoustical Society of America.

[26]  Nathaniel I. Durlach,et al.  Chapter 11 – MODELS OF BINAURAL INTERACTION , 1978 .

[27]  Nathaniel I. Durlach,et al.  On the Externalization of Auditory Images , 1992, Presence: Teleoperators & Virtual Environments.

[28]  John Vanderkooy,et al.  Aspects of MLS Measuring Systems , 1994 .

[29]  A. Zeiberg,et al.  Lateralization of complex binaural stimuli: a weighted-image model. , 1988, The Journal of the Acoustical Society of America.

[30]  C. Lim,et al.  Estimating the azimuth and elevation of a sound source from the output of a cochlear model , 1994, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers.

[31]  W. G. Gardner,et al.  HRTF measurements of a KEMAR , 1995 .

[32]  W. Gaik,et al.  Combined evaluation of interaural time and intensity differences: psychoacoustic results and computer modeling. , 1993, The Journal of the Acoustical Society of America.

[33]  W. Lindemann Extension of a binaural cross-correlation model by contralateral inhibition. II. The law of the first wave front. , 1986, The Journal of the Acoustical Society of America.

[34]  Richard M. Stern,et al.  The Role of Consistency of Interaural Timing Over Frequency in Binaural Lateralization , 1992 .

[35]  P M Zurek,et al.  A note on onset effects in binaural hearing. , 1993, The Journal of the Acoustical Society of America.

[36]  A. W. M. van den Enden,et al.  Discrete Time Signal Processing , 1989 .

[37]  J. Pickles An Introduction to the Physiology of Hearing , 1982 .