Localization of multiple sound sources with two microphones.

This paper presents a two-microphone technique for localization of multiple sound sources. Its fundamental structure is adopted from a binaural signal-processing scheme employed in biological systems for the localization of sources using interaural time differences (ITD). The two input signals are transformed to the frequency domain and analyzed for coincidences along left/right-channel delay-line pairs. The coincidence information is enhanced by a nonlinear operation followed by a temporal integration. The azimuths of the sound sources are estimated by integrating the coincidence locations across the broadband of frequencies in speech signals (the "direct" method). Further improvement is achieved by using a novel "stencil" filter pattern recognition procedure. This includes coincidences due to phase delays of greater than 2pi, which are generally regarded as ambiguous information. It is demonstrated that the stencil method can greatly enhance localization of lateral sources over the direct method. Also discussed and analyzed are two limitations involved in both methods, namely missed and artifactual sound sources. Anechoic chamber tests as well as computer simulation experiments showed that the signal-processing system generally worked well in detecting the spatial azimuths of four or six simultaneously competing sound sources.

[1]  H. Steven Colburn Erratum: ''Theory of binaural interaction based on auditory-nerve data. II. Detection of tones in noise'' [J. Acoust. Soc. Am. 61, 525-533 (1977)] , 1977 .

[2]  Richard M. Stern,et al.  The Role of Consistency of Interaural Timing Over Frequency in Binaural Lateralization , 1992 .

[3]  H Steven Colburn,et al.  Coincidence model of MSO responses , 1990, Hearing Research.

[4]  Richard M. Stern,et al.  MODELS OF BINAURAL PERCEPTION , 1999 .

[5]  John Lazzaro,et al.  Analog VLSI model of binaural hearing , 1991, IEEE Trans. Neural Networks.

[6]  H. Steven Colburn,et al.  Computational Models of Binaural Processing , 1996 .

[7]  T. Yin,et al.  Interaural time sensitivity in medial superior olive of cat. , 1990, Journal of neurophysiology.

[8]  G. Gourevitch Binaural Hearing in Land Mammals , 1987 .

[9]  Nathaniel I. Durlach,et al.  Chapter 11 – MODELS OF BINAURAL INTERACTION , 1978 .

[10]  D. Banks Localisation and separation of simultaneous voices with two microphones , 1993 .

[11]  Gerald M. Edelman,et al.  Auditory function : neurobiological bases of hearing , 1988 .

[12]  A. Zeiberg,et al.  Lateralization of complex binaural stimuli: a weighted-image model. , 1988, The Journal of the Acoustical Society of America.

[13]  R Plomp,et al.  Effect of multiple speechlike maskers on binaural speech recognition in normal and impaired hearing. , 1992, The Journal of the Acoustical Society of America.

[14]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[15]  H S Colburn,et al.  Theory of binaural interaction based on auditory-nerve data. I. General strategy and preliminary results on interaural discrimination. , 1973, The Journal of the Acoustical Society of America.

[16]  Jens Blauert,et al.  Modelling of Interaural Time and Intensity Difference Discrimination , 1980 .

[17]  C H Keller,et al.  Representation of multiple sound sources in the owl's auditory space map , 1994, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[18]  W. Lindemann Extension of a binaural cross-correlation model by contralateral inhibition. I. Simulation of lateralization for stationary signals. , 1986, The Journal of the Acoustical Society of America.

[19]  W. Gaik,et al.  Combined evaluation of interaural time and intensity differences: psychoacoustic results and computer modeling. , 1993, The Journal of the Acoustical Society of America.

[20]  S A Shamma,et al.  Stereausis: binaural processing without neural delays. , 1989, The Journal of the Acoustical Society of America.

[21]  M. Bodden Modeling human sound-source localization and the cocktail-party-effect , 1993 .

[22]  H S Colburn,et al.  Theory of binaural interaction based on auditory-nerve data. II. Detection of tones in noise. , 1977, The Journal of the Acoustical Society of America.

[23]  L A JEFFRESS,et al.  A place theory of sound localization. , 1948, Journal of comparative and physiological psychology.

[24]  Albert S. Feng,et al.  Peripheral basis of sound localization in anurans. Acoustic properties of the frog's ear , 1981, Hearing Research.

[25]  J. Flanagan,et al.  Computer‐steered microphone arrays for sound transduction in large rooms , 1985 .