Azimuthal sound localization using coincidence of timing across frequency on a robotic platform.

An algorithm for localizing a sound source with two microphones is introduced and used in real-time situations. This algorithm is inspired by biological computation of interaural time difference as occurring in the barn owl and is a modification of the algorithm proposed by Liu et al. [J. Acoust. Soc. Am. 110, 3218-3231 (2001)] in that it creates a three-dimensional map of coincidence location. This eliminates localization artifacts found during tests with the original algorithm. The source direction is found by determining the azimuth at which the minimum of the response in an azimuth-frequency matrix occurs. The system was tested with a pan-tilt unit in real-time in an office environment with signal types ranging from broadband noise to pure tones. Both open loop (pan-tilt unit stationary) and closed loop experiments (pan-tilt unit moving) were conducted. In real world situations, the algorithm performed well for all signal types except pure tones. Subsequent room simulations showed that localization accuracy decreases with decreasing direct-to-reverberant ratio.

[1]  L. Carney,et al.  A model for binaural response properties of inferior colliculus neurons. I. A model with interaural time difference-sensitive excitatory and inhibitory inputs. , 1998, Journal of the Acoustical Society of America.

[2]  Hiroaki Kitano,et al.  Active Audition for Humanoid , 2000, AAAI/IAAI.

[3]  W. Gaik Combined evaluation of interaural time and intensity differences: psychoacoustic results and computer modeling. , 1993, The Journal of the Acoustical Society of America.

[4]  L. Carney,et al.  A Model for Interaural Time Difference Sensitivity in the Medial Superior Olive: Interaction of Excitatory and Inhibitory Synaptic Inputs, Channel Dynamics, and Cellular Morphology , 2005, The Journal of Neuroscience.

[5]  Michael S. Brandstein,et al.  A practical methodology for speech source localization with microphone arrays , 1997, Comput. Speech Lang..

[6]  Maurizio Omologo,et al.  Acoustic event localization using a crosspower-spectrum phase based technique , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  Jonas Braasch,et al.  Localization in the presence of a distracter and reverberation in the frontal horizontal plane. II. Model algorithms , 2002 .

[8]  David McAlpine,et al.  Sound localization and delay lines – do mammals fit the model? , 2003, Trends in Neurosciences.

[9]  Philip H Smith,et al.  Coincidence Detection in the Auditory System 50 Years after Jeffress , 1998, Neuron.

[10]  S Carlile,et al.  Neural system identification model of human sound localization. , 2000, The Journal of the Acoustical Society of America.

[11]  Volker Hohmann,et al.  Enhancing sound sources by use of binaural spatial cues , 2001 .

[12]  B.D. Van Veen,et al.  Beamforming: a versatile approach to spatial filtering , 1988, IEEE ASSP Magazine.

[13]  G. Blasdel,et al.  Sound localization by the barn owl (Tyto alba) measured with the search coil technique , 1979, Journal of comparative physiology.

[14]  H Steven Colburn,et al.  Coincidence model of MSO responses , 1990, Hearing Research.

[15]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[16]  Darren B. Ward,et al.  Particle filter beamforming for acoustic source localization in a reverberant environment , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17]  Hiroaki Kitano,et al.  Real-time sound source localization and separation for robot audition , 2002, INTERSPEECH.

[18]  Kiyohiro Shikano,et al.  Talker Tracking Display On Autonomous Mobile Robot With A Moving Microphone Array , 2002 .

[19]  M. Konishi,et al.  Selectivity for interaural time difference in the owl's midbrain , 1986, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[20]  Volker Hohmann,et al.  Sound source localization in real sound fields based on empirical statistics of interaural parameters. , 2006, The Journal of the Acoustical Society of America.

[21]  M. Konishi,et al.  Axonal delay lines for time measurement in the owl's brainstem. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Harald Viste,et al.  Binaural Source Localization , 2004 .

[23]  Maurizio Omologo,et al.  Acoustic source location in a three-dimensional space using crosspower spectrum phase , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[24]  B C Wheeler,et al.  Localization of multiple sound sources with two microphones. , 2000, The Journal of the Acoustical Society of America.

[25]  DeLiang Wang,et al.  Binaural tracking of multiple moving sources , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[26]  Jie Huang,et al.  A model-based sound localization system and its application to robot navigation , 1999, Robotics Auton. Syst..

[27]  M. Bodden Modeling human sound-source localization and the cocktail-party-effect , 1993 .

[28]  E. Rubel,et al.  Organization and development of brain stem auditory nuclei of the chicken: Organization of projections from N. magnocellularis to N. laminaris , 1975, The Journal of comparative neurology.

[29]  Richard M. Stern,et al.  Chapter 10 – Models of Binaural Interaction , 1995 .

[30]  M. Konishi,et al.  Segregation of stimulus phase and intensity coding in the cochlear nucleus of the barn owl , 1984, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[31]  Stefan Wermter,et al.  Auditory robotic tracking of sound sources using hybrid cross-correlation and recurrent networks , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[32]  W. Lindemann Extension of a binaural cross-correlation model by contralateral inhibition. II. The law of the first wave front. , 1986, The Journal of the Acoustical Society of America.

[33]  M. Konishi,et al.  A circuit for detection of interaural time differences in the brain stem of the barn owl , 1990, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[34]  John C. Pearson,et al.  The Computation of Sound Source Elevation in the Barn Owl , 1989, NIPS 1989.

[35]  B C Wheeler,et al.  A two-microphone dual delay-line approach for extraction of a speech sound in the presence of multiple interferers. , 2001, The Journal of the Acoustical Society of America.

[36]  John C. Pearson,et al.  The Computation of Sound Source Evaluation in the Barn Owl , 1989, NIPS.

[37]  Jean Rouat,et al.  Robust sound source localization using a microphone array on a mobile robot , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[38]  L A JEFFRESS,et al.  A place theory of sound localization. , 1948, Journal of comparative and physiological psychology.

[39]  Terry T. Takahashi,et al.  Prediction of auditory spatial acuity from neural images on the owl's auditory space map , 2003, Nature.

[40]  W Chung,et al.  A performance adequate computational model for auditory localization. , 2000, The Journal of the Acoustical Society of America.

[41]  Carl F. Eyring Methods of Calculating the Average Coefficient of Sound Absorption , 1933 .

[42]  Stanley T. Birchfield,et al.  Fast Bayesian acoustic localization , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[43]  M. Konishi,et al.  Neural map of interaural phase difference in the owl's brainstem. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[44]  W. Lindemann Extension of a binaural cross-correlation model by contralateral inhibition. I. Simulation of lateralization for stationary signals. , 1986, The Journal of the Acoustical Society of America.

[45]  Nathaniel I. Durlach,et al.  Chapter 11 – MODELS OF BINAURAL INTERACTION , 1978 .

[46]  A. Kohlrausch,et al.  Binaural processing model based on contralateral inhibition. I. Model structure. , 2001, The Journal of the Acoustical Society of America.