Neural system identification model of human sound localization.

This paper examines the role of biological constraints in the human auditory localization process. A psychophysical and neural system modeling approach was undertaken in which performance comparisons between competing models and a human subject explore the relevant biologically plausible "realism constraints." The directional acoustical cues, upon which sound localization is based, were derived from the human subject's head-related transfer functions (HRTFs). Sound stimuli were generated by convolving bandpass noise with the HRTFs and were presented to both the subject and the model. The input stimuli to the model were processed using the Auditory Image Model of cochlear processing. The cochlear data were then analyzed by a time-delay neural network which integrated temporal and spectral information to determine the spatial location of the sound source. The combined cochlear model and neural network provided a system model of the sound localization process. Aspects of humanlike localization performance were qualitatively achieved for broadband and bandpass stimuli when the model architecture incorporated frequency division (i.e., the progressive integration of information across the different frequency channels) and was trained using variable bandwidth and center-frequency sounds. Results indicate that both issues are relevant to human sound localization performance.

[1]  L A JEFFRESS,et al.  A place theory of sound localization. , 1948, Journal of comparative and physiological psychology.

[2]  Henrik Møller Fundamentals of binaural technology , 1991 .

[3]  Simon Carlile,et al.  Generation and Applications of Virtual Auditory Space , 1996 .

[4]  V. Mellert,et al.  Transformation characteristics of the external human ear. , 1977, The Journal of the Acoustical Society of America.

[5]  Simon Carlile,et al.  Virtual Auditory Space: Generation and Applications , 2013, Neuroscience Intelligence Unit.

[6]  Pavel Zahorik,et al.  The fidelity of virtual auditory displays. , 1996 .

[7]  J. C. Middlebrooks,et al.  A neural code for auditory space in the cat's superior colliculus , 1984, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[8]  S. Carlile The auditory periphery of the ferret. II: The spectral transformations of the external ear and their implications for sound localization. , 1990, The Journal of the Acoustical Society of America.

[9]  R. Duda,et al.  Combined monaural and binaural localization of sound sources , 1995, Conference Record of The Twenty-Ninth Asilomar Conference on Signals, Systems and Computers.

[10]  T. Sejnowski,et al.  Report to the National Science Foundation: WORKSHOP ON NEUROMORPHIC ENGINEERING , 1998 .

[11]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[12]  R. Patterson,et al.  Time-domain modeling of peripheral auditory processing: a modular architecture and a software platform. , 1995, The Journal of the Acoustical Society of America.

[13]  R. Butler,et al.  Factors that influence the localization of sound in the vertical plane. , 1968, The Journal of the Acoustical Society of America.

[14]  Richard F. Lyon A computational model of binaural localization and separation , 1983, ICASSP.

[15]  J. C. Middlebrooks Narrow-band sound localization related to external ear acoustics. , 1992, The Journal of the Acoustical Society of America.

[16]  H S Colburn,et al.  Model for auditory localization. , 1976, The Journal of the Acoustical Society of America.

[17]  P M Hofman,et al.  Spectro-temporal factors in two-dimensional human sound localization. , 1998, The Journal of the Acoustical Society of America.

[18]  Nathaniel I. Durlach,et al.  On the Externalization of Auditory Images , 1992, Presence: Teleoperators & Virtual Environments.

[19]  A. Moiseff,et al.  An artificial neural network for sound localization using binaural cues. , 1996, The Journal of the Acoustical Society of America.

[20]  Robert A. Butler,et al.  Binaural localization: Influence of stimulus frequency and the linkage to covert peak areas , 1993, Hearing Research.

[21]  Simon Carlile,et al.  The nature and distribution of errors in sound localization by human listeners , 1997, Hearing Research.

[22]  L D Braida,et al.  Binaural pinna disparity: another auditory localization cue. , 1975, The Journal of the Acoustical Society of America.

[23]  Michael Friis Sørensen,et al.  Head-Related Transfer Functions of Human Subjects , 1995 .

[24]  D. M. Green,et al.  Directional dependence of interaural envelope delays. , 1990, The Journal of the Acoustical Society of America.

[25]  W. Gaik,et al.  Combined evaluation of interaural time and intensity differences: psychoacoustic results and computer modeling. , 1993, The Journal of the Acoustical Society of America.

[26]  D M Green,et al.  Observations on a principal components analysis of head-related transfer functions. , 1992, The Journal of the Acoustical Society of America.

[27]  D. Irvine Physiology of the Auditory Brainstem , 1992 .

[28]  S. Carlile,et al.  The localisation of spectrally restricted sounds by human listeners , 1999, Hearing Research.

[29]  F L Wightman,et al.  Headphone simulation of free-field listening. II: Psychophysical validation. , 1989, The Journal of the Acoustical Society of America.

[30]  Kevin J. Lang A time delay neural network architecture for speech recognition , 1989 .

[31]  Markus Schenkel Handwriting recognition using neural networks and hidden Markov models , 1995 .

[32]  Robert B. King,et al.  The Impact of Signal Bandwidth on Auditory Localization: Implications for the Design of Three-Dimensional Audio Displays , 1997, Hum. Factors.

[33]  F. Wightman,et al.  Acoustic origins of individual differences in sound localization behavior , 1988 .

[34]  W Chung,et al.  A performance adequate computational model for auditory localization. , 2000, The Journal of the Acoustical Society of America.

[35]  A D Musicant,et al.  The influence of pinnae-based spectral cues on sound localization. , 1984, The Journal of the Acoustical Society of America.

[36]  Ewan A. Macpherson A comparison of spectral correlation and local feature‐matching models of pinna cue processing , 1997 .

[37]  R A Butler,et al.  The linkage between stimulus frequency and covert peak areas as it relates to monaural localization , 1992, Perception & psychophysics.

[38]  D. M. Green,et al.  Characterization of external ear impulse responses using Golay codes. , 1992, The Journal of the Acoustical Society of America.

[39]  W R Thurlow,et al.  Effect of stimulus duration on localization of direction noise stimuli. , 1970, Journal of speech and hearing research.

[40]  Simon Carlile,et al.  Methods for spherical data analysis and visualization , 1998, Journal of Neuroscience Methods.

[41]  B. May,et al.  Spectral cues for sound localization in cats: a model for discharge rate representations in the auditory nerve. , 1997, The Journal of the Acoustical Society of America.

[42]  Simon R. Oldfield,et al.  Acuity of Sound Localisation: A Topography of Auditory Space. II. Pinna Cues Absent , 1984, Perception.

[43]  Geoffrey E. Hinton,et al.  Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[44]  Ewan A. Macpherson Effects of source spectrum irregularity and uncertainty on sound localization. , 1996 .

[45]  E D Young,et al.  Neural organization and responses to complex stimuli in the dorsal cochlear nucleus. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[46]  K.R. Rao,et al.  Multiple auditory template matching using expansion , 1993, Proceedings of 36th Midwest Symposium on Circuits and Systems.

[47]  P J Bloom,et al.  Determination of monaural sensitivity changes due to the pinna by use of minimum-audible-field measurements in the lateral vertical plane. , 1977, The Journal of the Acoustical Society of America.

[48]  D. M. Green,et al.  Sound localization by human listeners. , 1991, Annual review of psychology.

[49]  W M Hartmann,et al.  On the externalization of sound images. , 1996, The Journal of the Acoustical Society of America.

[50]  P. Woodland,et al.  A computational model of the auditory periphery for speech and hearing research. II. Descending paths. , 1994, The Journal of the Acoustical Society of America.

[51]  William A. Yost,et al.  Head movements in localization , 1997 .

[52]  H. Steven Colburn,et al.  Role of spectral detail in sound-source localization , 1998, Nature.

[53]  David Marr,et al.  VISION A Computational Investigation into the Human Representation and Processing of Visual Information , 2009 .

[54]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[55]  W. Lindemann Extension of a binaural cross-correlation model by contralateral inhibition. I. Simulation of lateralization for stationary signals. , 1986, The Journal of the Acoustical Society of America.

[56]  C Giguère,et al.  A computational model of the auditory periphery for speech and hearing research. I. Ascending path. , 1994, The Journal of the Acoustical Society of America.

[57]  M. Cynader,et al.  A computational theory of spectral cue localization , 1993 .

[58]  F L Wightman,et al.  Headphone simulation of free-field listening. I: Stimulus synthesis. , 1989, The Journal of the Acoustical Society of America.

[59]  S. Carlile,et al.  Measuring the human head-related transfer functions: a novel method for the construction and calibration of a miniature "in-ear" recording system. , 1994, The Journal of the Acoustical Society of America.

[60]  David Haussler,et al.  What Size Net Gives Valid Generalization? , 1989, Neural Computation.

[61]  E D Young,et al.  Neural network models of sound localization based on directional filtering by the pinna. , 1992, The Journal of the Acoustical Society of America.

[62]  D. Zipser,et al.  Identification models of the nervous system , 1992, Neuroscience.

[63]  F L Wightman,et al.  Monaural sound localization revisited. , 1997, The Journal of the Acoustical Society of America.

[64]  M. Gardner,et al.  Problem of localization in the median plane: effect of pinnae cavity occlusion. , 1973, The Journal of the Acoustical Society of America.

[65]  Robert A. Butler,et al.  The bandwidth effect on monaural and binaural localization , 1986, Hearing Research.

[66]  L. Rayleigh,et al.  XII. On our perception of sound direction , 1907 .

[67]  T. Takahashi,et al.  The neural coding of auditory space. , 1989, The Journal of experimental biology.