Performance of real-time source-location estimators for a large-aperture microphone array

A large array of microphones is being studied as a possible means of acquiring data in offices, conference rooms, and auditoria without requiring close-talking microphones. An array that surrounds all possible sources has a large aperture and such arrays have attractive properties for accurate spatial resolution and significant signal-to-noise enhancement. For the first time, this paper presents all the details of a real-time, source-location algorithm (LEMSalg) based on time-of-arrival delays derived from a phase transform applied to the generalized cross-power spectrum. It is being used successfully in a representative environment where microphone SNRs are below 0 dB. We have found that many small features are required to make a useful location estimating algorithm work and work well in real-time. We present an experimental evaluation of the current algorithm's performance using data taken with the Huge Microphone Array (HMA) system, which has 448 microphones in a noisy, reverberant environment. Using off-line computation, we also compared the LEMSalg to two alternative methods. The first of these adds local beamforming to the preprocessing of the base algorithm, increasing performance significantly at modest additional computational cost. The second algorithm maximizes the total steered-response power in the same phase transform. While able to derive good position estimates from shorter data runs, this method is two orders of magnitude more computationally expensive and is not yet suitable for real-time use.

[1]  L. Deng Ieee Transactions on Speech and Audio Processing, Speech Trajectory Discrimination Using the Minimum Classiication Error Learning , 1997 .

[2]  Harvey F. Silverman,et al.  An experiment that validates theory with measurements for a large-aperture microphone array , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[3]  Harvey F. Silverman,et al.  First measurements of a large-aperture microphone array system for remote audio acquisition , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[4]  Michael Shapiro Brandstein,et al.  A framework for speech source localization using sensor arrays , 1995 .

[5]  J. Flanagan,et al.  Computer‐steered microphone arrays for sound transduction in large rooms , 1985 .

[6]  Parham Aarabi,et al.  EURASIP Journal on Applied Signal Processing 2003:4, 338–347 c ○ 2003 Hindawi Publishing Corporation The Fusion of Distributed Microphone Arrays for Sound Localization , 2002 .

[7]  Harvey F. Silverman,et al.  Microphone array optimization by stochastic region contraction , 1991, IEEE Trans. Signal Process..

[8]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[9]  D. B. Ward,et al.  Grid-based beamformer design for room-environment microphone arrays , 1999, Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452).

[10]  M. Rossi,et al.  Acoustics and Electroacoustics , 1992 .

[11]  Maurizio Omologo,et al.  Acoustic source location in noisy and reverberant environment using CSP analysis , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[12]  Harvey F Silverman,et al.  Factors affecting the performance of large-aperture microphone arrays. , 2002, The Journal of the Acoustical Society of America.

[13]  James L. Flanagan,et al.  The Huge Microphone Array. 2 , 1999, IEEE Concurr..

[14]  William H. Press,et al.  Numerical recipes in C , 2002 .

[15]  Michael S. Brandstein,et al.  Robust Localization in Reverberant Rooms , 2001, Microphone Arrays.

[16]  Maurizio Omologo,et al.  Acoustic event localization using a crosspower-spectrum phase based technique , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[17]  Joseph H. DiBiase A High-Accuracy, Low-Latency Technique for Talker Localization in Reverberant Environments Using Microphone Arrays , 2000 .

[18]  Harvey F. Silverman,et al.  Early experimental results for a large-aperture microphone-array system , 2000, Proceedings of the 2000 IEEE Sensor Array and Multichannel Signal Processing Workshop. SAM 2000 (Cat. No.00EX410).

[19]  James L. Flanagan,et al.  The huge microphone array , 1998, IEEE Concurr..

[20]  Harvey F. Silverman,et al.  Position calibration of large-aperture microphone arrays , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[21]  Harvey F. Silverman,et al.  Visualizing the performance of large-aperture microphone arrays , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[22]  Ea-Ee Jan,et al.  Matched-filter processing of microphone array for spatial volume selectivity , 1995, Proceedings of ISCAS'95 - International Symposium on Circuits and Systems.

[23]  J. Flanagan,et al.  The Huge Microphone Array (HMA) , 1997 .