A unifying framework for acoustic localization

Recent advances in acoustic localization have combined the advantages of the traditional methods of beamforming and time-delay estimation, leading to techniques that are both accurate and fast. We present a unifying framework that reveals the relationships between beamforming, time-delay estimation, Bayesian formulation, hemisphere sampling, and accumulated correlation. We then experimentally compare the algorithms, on both compact and distributed microphone arrays, showing that the recent technique of accumulated correlation, although much less computationally expensive, exhibits performance comparable to that of beamforming.

[1]  Stanley T. Birchfield,et al.  Acoustic source direction by hemisphere sampling , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[2]  Michael S. Brandstein,et al.  A closed-form method for finding source locations from microphone-array time-decay estimates , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[3]  Maurizio Omologo,et al.  Acoustic source location in a three-dimensional space using crosspower spectrum phase , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Andrew Blake,et al.  Nonlinear filtering for speaker tracking in noisy and reverberant environments , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[5]  M S Brandstein Time-delay estimation of reverberated speech exploiting harmonic structure. , 1999, The Journal of the Acoustical Society of America.

[6]  Stanley T. Birchfield,et al.  Fast Bayesian acoustic localization , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Trevor Darrell,et al.  A Probabilistic Framework for Multi-modal Multi-Person Tracking , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.

[8]  Hong Wang,et al.  Voice source localization for automatic camera pointing system in videoconferencing , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Michael S. Brandstein,et al.  A practical methodology for speech source localization with microphone arrays , 1997, Comput. Speech Lang..

[10]  Michael S. Brandstein,et al.  Robust Localization in Reverberant Rooms , 2001, Microphone Arrays.

[11]  Darren B. Ward,et al.  Particle filter beamforming for acoustic source localization in a reverberant environment , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Maurizio Omologo,et al.  Use of the crosspower-spectrum phase in acoustic event location , 1997, IEEE Trans. Speech Audio Process..

[13]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[14]  Larry S. Davis,et al.  Active speech source localization by a dual coarse-to-fine search , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).