Three-dimensional sound localization from a compact non-coplanar array of microphones using tree-based learning.

One of the various human sensory capabilities is to identify the direction of perceived sounds. The goal of this work is to study sound source localization in three dimensions using some of the most important cues the human uses. In an attempt to satisfy the requirements of portability and miniaturization in robotics, this approach employs a compact sensor structure that can be placed on a mobile platform. The objective is to estimate the relative sound source position in three-dimensional space without imposing excessive restrictions on its spatio-temporal characteristics and the environment structure. Two types of features are considered, interaural time and level differences. Their relative effectiveness for localization is studied, as well as a practical way of using these complementary parameters. A two-stage procedure was used. In the training stage, sound samples are produced from points with known coordinates and then are stored. In the recognition stage, unknown sounds are processed by the trained system to estimate the 3D location of the sound source. Results from the experiments showed under +/-3 degrees in average angular error and less than +/-20% in average radial distance error.

[1]  Y. Chan,et al.  The least squares estimation of time delay and its use in signal detection , 1978 .

[2]  J. Ianniello,et al.  Time delay estimation via cross-correlation in the presence of large estimation errors , 1982 .

[3]  Dermot Furlong,et al.  Virtual Imaging Capabilities of Surround Sound Systems , 1994 .

[4]  Benoît Champagne,et al.  Performance of time-delay estimation in the presence of room reverberation , 1996, IEEE Trans. Speech Audio Process..

[5]  F. Wightman,et al.  The dominant role of low-frequency interaural time differences in sound localization. , 1992, The Journal of the Acoustical Society of America.

[6]  Juyang Weng,et al.  Vision-guided navigation using SHOSLIF , 1998, Neural Networks.

[7]  Michael S. Brandstein,et al.  A practical time-delay estimator for localizing speech sources with a microphone array , 1995, Comput. Speech Lang..

[8]  W M Hartmann,et al.  Localization of sound in rooms. IV: The Franssen effect. , 1989, The Journal of the Acoustical Society of America.

[9]  Michael S. Brandstein,et al.  A closed-form location estimator for use with room environment microphone arrays , 1997, IEEE Trans. Speech Audio Process..

[10]  Michael S. Brandstein,et al.  A practical methodology for speech source localization with microphone arrays , 1997, Comput. Speech Lang..

[11]  Stephen L. Hobbs Asymptotic statistics for location estimates of acoustic signals , 1992 .

[12]  James L. Flanagan,et al.  A DSP implementation of source location using microphone arrays. , 1996 .

[13]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .