Constant False Alarm Rate Sound Source Detection with Distributed Microphones

Applications related to distributed microphone systems are typically initiated with sound source detection. This paper introduces a novel method for the automatic detection of sound sources in images created with steered response power (SRP) algorithms. The method exploits the near-symmetric coherent power noise distribution to estimate constant false-alarm rate (CFAR) thresholds. Analyses show that low-frequency source components degrade CFAR threshold performance due to increased nonsymmetry in the coherent power distribution. This degradation, however, can be offset by partial whitening or increasing differential path distances between the microphone pairs and the spatial locations of interest. Experimental recordings are used to assess CFAR performance subject to variations in source frequency content and partial whitening. Results for linear, perimeter, and planar microphone geometries demonstrate that experimental false-alarm probabilities for CFAR thresholds ranging from 10-1 and 10-6 are limited to within one order of magnitude when proper filtering, partial whitening, and noise model parameters are applied.

[1]  Maurizio Omologo,et al.  Talker localization and speech recognition using a microphone array and a cross-powerspectrum phase analysis , 1994, ICSLP.

[2]  James L. Flanagan,et al.  Autodirective Microphone Systems , 1991 .

[3]  Hong-Seok Kim,et al.  Performance of an HMM speech recognizer using a real-time tracking microphone array as input , 1999, IEEE Trans. Speech Audio Process..

[4]  Hermann Rohling,et al.  Radar CFAR Thresholding in Clutter and Multiple Target Situations , 1983, IEEE Transactions on Aerospace and Electronic Systems.

[5]  Rama Chellappa,et al.  Non-Gaussian CFAR techniques for target detection in high resolution SAR images , 1994, Proceedings of 1st International Conference on Image Processing.

[6]  Henry G. Dietz,et al.  Performance of phase transform for detecting sound sources with microphone arrays in reverberant and noisy environments , 2007, Signal Process..

[7]  Mohan M. Trivedi,et al.  Source localization in reverberant environments: modeling and statistical analysis , 2003, IEEE Trans. Speech Audio Process..

[8]  Harvey F. Silverman,et al.  Some analysis of microphone arrays for speech data acquisition , 1987, IEEE Trans. Acoust. Speech Signal Process..

[9]  K.D. Donohue,et al.  Sound source detection threshold estimation using negative coherent power , 2008, IEEE SoutheastCon 2008.

[10]  Nihat M. Bilgutay,et al.  OS characterization for local CFAR detection , 1991, IEEE Trans. Syst. Man Cybern..

[11]  Seok Cheol Kee,et al.  Speaker detection and tracking at mobile robot platform , 2004, Proceedings of 2004 International Symposium on Intelligent Signal Processing and Communication Systems, 2004. ISPACS 2004..

[12]  André Gilloire,et al.  Microphone array for sound pickup in teleconference systems , 1994 .

[13]  J. Flanagan,et al.  Robust speech recognition in a multimedia teleconferencing environment , 1992 .

[14]  Kevin D. Donohue,et al.  Experimental performance analysis of sound source detection with SRP PHAT-β , 2009, IEEE Southeastcon 2009.

[15]  Michael S. Brandstein,et al.  Robust Localization in Reverberant Rooms , 2001, Microphone Arrays.