Acoustic binaural correspondence used for localization of natural acoustic signals

Abstract The left-right signal correspondence problem, that is considered as one of the most prominent problems by visual stereoscopic computational models, is much ignored by computational auditory stereophonic models. The correspondence problem, which is trivial if only one acoustic source is present, is highly complicated for a multiple sources environment. We present a computational model able to perform localization of natural complex acoustic signals (one or two human speakers). The model relies mainly on computing the cross-correlation functions of selected frequency channels arriving at the two ears, and performing a weighted integration on these functions. Thus, first attempts are made to establish a correspondence between acoustic features of the two channels. Preliminary results show that this model, which might be compared to “early vision” models in computational vision research, can serve as a first step in analyzing the acoustic scene.

[1]  A. Mills On the minimum audible angle , 1958 .

[2]  E. Knudsen Auditory and visual maps of space in the optic tectum of the owl , 1982, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[3]  R. Dye,et al.  The combination of interaural information across frequencies: lateralization on the basis of interaural delay. , 1990, The Journal of the Acoustical Society of America.

[4]  S A Shamma,et al.  Stereausis: binaural processing without neural delays. , 1989, The Journal of the Acoustical Society of America.

[5]  C Trahiotis,et al.  Lateralization of sinusoidally amplitude-modulated tones: effects of spectral locus and temporal variation. , 1985, The Journal of the Acoustical Society of America.

[6]  John K. Cullen Auditory Scene Analysis Albert S. Bregman, Cambridge, MA: The MIT Press, (1990). 773 pp., $60.00, ISBN 0-262-02297-4. Orders: (800) 356-0343. , 1993 .

[7]  D. Perrott Concurrent minimum audible angle: a re-examination of the concept of auditory spatial acuity. , 1984, The Journal of the Acoustical Society of America.

[8]  H S Colburn,et al.  Theory of binaural interaction based on auditory-nerve data. II. Detection of tones in noise. , 1977, The Journal of the Acoustical Society of America.

[9]  C. Trahiotis,et al.  Lateralization of bands of noise and sinusoidally amplitude-modulated tones: effects of spectral locus and bandwidth. , 1986, The Journal of the Acoustical Society of America.

[10]  E I Knudsen,et al.  Computational maps in the brain. , 1987, Annual review of neuroscience.

[11]  D W Grantham,et al.  Detectability of tonal signals with changing interaural phase differences in noise. , 1988, The Journal of the Acoustical Society of America.

[12]  A. Zeiberg,et al.  Lateralization of complex binaural stimuli: a weighted-image model. , 1988, The Journal of the Acoustical Society of America.

[13]  Richard F. Lyon A computational model of binaural localization and separation , 1983, ICASSP.

[14]  Richard F. Lyon,et al.  A computational model of filtering, detection, and compression in the cochlea , 1982, ICASSP.

[15]  Li Deng,et al.  Processing of acoustic signals in a cochlear model incorporating laterally coupled suppressive elements , 1992, Neural Networks.

[16]  J. Rinzel,et al.  A biophysical model of cochlear processing: intensity dependence of pure tone responses. , 1986, The Journal of the Acoustical Society of America.

[17]  Hans Werner Strube,et al.  Separation of several speakers recorded by two microphones (cocktail-party processing) , 1981 .

[18]  H S Colburn,et al.  Theory of binaural interaction based in auditory-nerve data. IV. A model for subjective lateral position. , 1978, The Journal of the Acoustical Society of America.

[19]  S. Coren,et al.  In Sensation and perception , 1979 .