A Probabilistic Model for Binaural Sound Localization

This paper proposes a biologically inspired and technically implemented sound localization system to robustly estimate the position of a sound source in the frontal azimuthal half-plane. For localization, binaural cues are extracted using cochleagrams generated by a cochlear model that serve as input to the system. The basic idea of the model is to separately measure interaural time differences and interaural level differences for a number of frequencies and process these measurements as a whole. This leads to two-dimensional frequency versus time-delay representations of binaural cues, so-called activity maps. A probabilistic evaluation is presented to estimate the position of a sound source over time based on these activity maps. Learned reference maps for different azimuthal positions are integrated into the computation to gain time-dependent discrete conditional probabilities. At every timestep these probabilities are combined over frequencies and binaural cues to estimate the sound source position. In addition, they are propagated over time to improve position estimation. This leads to a system that is able to localize audible signals, for example human speech signals, even in reverberating environments

[1]  Sam H. Ridgway,et al.  The Auditory Central Nervous System of Dolphins , 2000 .

[2]  Volker Willert,et al.  Building a Motion Resolution Pyramid by Combining Velocity Distributions , 2004, DAGM-Symposium.

[3]  Jürgen Adamy,et al.  A brain-like neural network for periodicity analysis , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[4]  Sean B. Andersson,et al.  A biomimetic apparatus for sound-source localization , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).

[5]  Richard R. Fay,et al.  Integrative Functions in the Mammalian Auditory Pathway , 2002, Springer Handbook of Auditory Research.

[6]  N. Bhadkamkar,et al.  A sound localization system based on biological analogy , 1993, IEEE International Conference on Neural Networks.

[7]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[8]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[9]  Shigeyuki Kuwada,et al.  Transformations in processing interaural time differences between the superior olivary complex and inferior colliculus: beyond the Jeffress model , 2002, Hearing Research.

[10]  L A JEFFRESS,et al.  A place theory of sound localization. , 1948, Journal of comparative and physiological psychology.

[11]  Eric I. Knudsen,et al.  Maps versus clusters: different representations of auditory space in the midbrain and forebrain , 1999, Trends in Neurosciences.

[12]  Jie Huang,et al.  A model-based sound localization system and its application to robot navigation , 1999, Robotics Auton. Syst..

[13]  Günter Ehret,et al.  The Central Auditory System , 1996 .

[14]  Alan V. Oppenheim,et al.  Discrete-Time Signal Pro-cessing , 1989 .

[15]  D. M. Green,et al.  Sound localization by human listeners. , 1991, Annual review of psychology.

[16]  S. Shamma On the role of space and time in auditory processing , 2001, Trends in Cognitive Sciences.

[17]  Malcolm Slaney,et al.  An Efficient Implementation of the Patterson-Holdsworth Auditory Filter Bank , 1997 .

[18]  C. Faller,et al.  Source localization in complex listening situations: selection of binaural cues based on interaural coherence. , 2004, The Journal of the Acoustical Society of America.

[19]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[20]  M A Lord Rayleigh,et al.  On Our Perception of the Direotion of a Source of Sound , 1875 .

[21]  Gregory H. Wakefield,et al.  Introduction to Head-Related Transfer Functions (HRTFs): Representations of HRTFs in Time, Frequency, and Space , 2001 .

[22]  Fredrik Gustafsson,et al.  Determining the initial states in forward-backward filtering , 1996, IEEE Trans. Signal Process..

[23]  D. Oliver Ascending efferent projections of the superior olivary complex , 2000, Microscopy research and technique.

[24]  R. Baierlein Probability Theory: The Logic of Science , 2004 .

[25]  L. Rayleigh,et al.  XII. On our perception of sound direction , 1907 .

[26]  P. Heil,et al.  Frequency and periodicity are represented in orthogonal maps in the human auditory cortex: evidence from magnetoencephalography , 1997, Journal of Comparative Physiology A.

[27]  Eric I. Knudsen,et al.  A Connectionist Model of the Owl's Sound Localization System , 1993, NIPS.

[28]  Joerg Damaschke,et al.  Towards a neurophysiological correlate of the precedence effect: from psychoacoustics to electroencephalography , 2004 .

[29]  Volker Willert,et al.  Ein binaurales Richtungshörsystem für mobile Roboter in echoarmer Umgebung (A Binaural Sound Localization System for Mobile Robots in Low-reflecting Environments) , 2003 .

[30]  S A Shamma,et al.  Stereausis: binaural processing without neural delays. , 1989, The Journal of the Acoustical Society of America.