Enhanced diffuse field model for ad hoc microphone array calibration

In this paper, we investigate the diffuse field coherence model for microphone array pairwise distance estimation. We study the fundamental constraints and assumptions underlying this approach and propose evaluation methodologies to measure the adequacy of diffuseness for microphone array calibration. In addition, an enhanced schemebased on coherence averaging and histogramming, is presented to improve the robustness and performance of the pairwise distance estimation approach. The proposed theories and algorithms are evaluated on simulated and real data recordings for calibration of microphone array geometry in an ad hoc set-up. HighlightsAveraging and histogramming improve the diffuse field coherence model for calibration.A novel approach for assessment of the adequacy of diffuseness is formulated.The relation between distance, enclosure dimension and diffuseness is characterized.A methodology for augmenting the diffuse sound field is proposed.Fundamental limitation of calibration based on the coherence model is analyzed.

[1]  Spatial cross-correlation of reverberant sound fields , 1979 .

[2]  Volkan Cevher,et al.  Multi-Party Speech Recovery Exploiting Structured Sparsity Models , 2011, INTERSPEECH.

[3]  H. Nélisse,et al.  Characterization of a diffuse field in a reverberant room , 1997 .

[4]  Mohammed Ghanbari,et al.  Verified speaker localization utilizing voicing level in split-bands , 2009, Signal Process..

[5]  M. Schroeder,et al.  On Frequency Response Curves in Rooms. Comparison of Experimental, Theoretical, and Monte Carlo Results for the Average Frequency Spacing between Maxima , 1962 .

[6]  Rafaely Spatial-temporal correlation of a diffuse sound field , 2000, The Journal of the Acoustical Society of America.

[7]  Richard V. Waterhouse,et al.  Interference Patterns in Reverberant Sound Fields. II , 1955 .

[8]  Charles T. Morrow Point‐to‐Point Correlation of Sound Pressures in Reverberation Chambers , 1969 .

[9]  Anoop Gupta,et al.  Distributed meetings: a meeting capture and broadcasting system , 2002, MULTIMEDIA '02.

[10]  Bruno Fazenda,et al.  Studies in modal density – its effect at low frequencies , 2009 .

[11]  Manfred R. Schroeder Measurement of Sound Diffusion in Reverberation Chambers , 1959 .

[12]  Patrick J. F. Groenen,et al.  Modern Multidimensional Scaling: Theory and Applications , 2003 .

[13]  Spatial cross-correlation of acoustic pressures in steady and decaying reverberant sound fields , 1976 .

[14]  Kristine L. Bell,et al.  Array self calibration with large sensor position errors , 1999, Conference Record of the Thirty-Third Asilomar Conference on Signals, Systems, and Computers (Cat. No.CH37020).

[15]  Richard M. Stern,et al.  Microphone array processing for robust speech recognition , 2003 .

[16]  Bartosz Gapiński,et al.  THE ROUNDNESS DEVIATION MEASUREMENT WITH COORDINATE MEASURING MACHINES , 2006 .

[17]  Ming Zhang,et al.  A robust speech detection algorithm in a microphone array teleconferencing system , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[18]  Minghua Chen,et al.  Energy-Based Position Estimation of Microphones and Speakers for Ad Hoc Microphone Arrays , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[19]  Karl-Dirk Kammeyer,et al.  Theoretical noise reduction limits of the generalized sidelobe canceller (GSC) for speech enhancement , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[20]  R. K. Cook,et al.  Measurement of Correlation Coefficients in Reverberant Sound Fields , 1955 .

[21]  M. Schroeder The ‘‘Schroeder frequency’’ revisited , 1996 .

[22]  C. Maury,et al.  Enhancing low frequency sound transmission measurements using a synthesis method. , 2007, The Journal of the Acoustical Society of America.

[23]  Harvey F. Silverman,et al.  Microphone position and gain calibration for a large-aperture microphone array , 2005, IEEE Transactions on Speech and Audio Processing.

[24]  Hervé Bourlard,et al.  Sparse component analysis for speech recognition in multi-speaker environment , 2010, INTERSPEECH.

[25]  Ivan Himawan,et al.  Microphone Array Shape Calibration in Diffuse Noise Fields , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[26]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[27]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[28]  Hervé Bourlard,et al.  Microphone array beampattern characterization for hands-free speech applications , 2012, 2012 IEEE 7th Sensor Array and Multichannel Signal Processing Workshop (SAM).

[29]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[30]  Rainer Lienhart,et al.  Position calibration of microphones and loudspeakers in distributed computing platforms , 2005, IEEE Transactions on Speech and Audio Processing.

[31]  W. T. Chu Eigenmode analysis of the interference patterns in reverberant sound fields , 1980 .

[32]  Afsaneh Asaei,et al.  An integrated framework for multi-channel multi-source localization and voice activity detection , 2011, 2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays.

[33]  T. J. Schultz Diffusion in reverberation rooms , 1971 .