A Free-Source Method (FrSM) for Calibrating a Large-Aperture Microphone Array

Large-aperture microphone arrays can be used to capture and enhance speech from individual talkers in noisy, multi-talker, and reverberant environments. However, they must be calibrated, often more than once, to obtain accurate 3-dimensional coordinates for all microphones. Direct-measurement techniques, such as using a measuring tape or a laser-based tool are cumbersome and time-consuming. Some previous methods that used acoustic signals for array calibration required bulky hardware and/or fixed, known source locations. Others, which allowed more flexible source placement, often have issues with real data, have reported results for 2D only, or work only for small arrays. This paper describes a complete and robust method for automatic calibration using acoustic signals which is simple, repeatable, accurate, and has been shown to work for a real system. The method requires only a single transducer (speaker) with a microphone attached above its center. The unit is freely moved around the focal volume of the microphone array generating a single long recording from all the microphones. After that, the system is completely automatic. We describe the free source method (FrSM), validate its effectiveness and present accuracy results against measured ground truth. The performance of FrSM is compared to that from several other methods for a real 128-microphone array.

[1]  Ming Zhang,et al.  A robust speech detection algorithm in a microphone array teleconferencing system , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[2]  Minghua Chen,et al.  Energy-Based Position Estimation of Microphones and Speakers for Ad Hoc Microphone Arrays , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[3]  B. C. Ng,et al.  Sensor-array calibration using a maximum-likelihood approach , 1996 .

[4]  Ivan Himawan,et al.  Microphone Array Shape Calibration in Diffuse Noise Fields , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  Jen-Tzung Chien,et al.  Microphone array signal processing for far-talking speech recognition , 2001, 2001 IEEE Third Workshop on Signal Processing Advances in Wireless Communications (SPAWC'01). Workshop Proceedings (Cat. No.01EX471).

[6]  Anthony J. Weiss,et al.  Array shape calibration using sources in unknown locations-a maximum likelihood approach , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[7]  Ahmad Hashemi-Sakhtsari,et al.  Automatic microphone array position calibration using an acoustic sounding source , 2009, 2009 4th IEEE Conference on Industrial Electronics and Applications.

[8]  Alessio Del Bue,et al.  A Bilinear Approach to the Position Self-Calibration of Multiple Sensors , 2012, IEEE Transactions on Signal Processing.

[9]  Ramani Duraiswami,et al.  Automatic position calibration of multiple microphones , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Gernot A. Fink,et al.  A hierarchical approach to unsupervised shape calibration of microphone array networks , 2009, 2009 IEEE/SP 15th Workshop on Statistical Signal Processing.

[11]  Ying Yu,et al.  A Real-Time SRP-PHAT Source Location Implementation using Stochastic Region Contraction(SRC) on a Large-Aperture Microphone Array , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[12]  Richard M. Stern,et al.  Speech recognizer-based microphone array processing for robust hands-free speech recognition , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  Peter M. Schultheiss,et al.  Array shape calibration using sources in unknown locations-Part II: Near-field sources and estimator implementation , 1987, IEEE Trans. Acoust. Speech Signal Process..

[14]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[15]  Sebastian Thrun,et al.  A passive approach to sensor network localization , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[16]  A Fast Algorithm of Moore-Penrose Inverse for the Symmetric Loewner-Type Matrix , 2009, 2009 International Conference on Information Engineering and Computer Science.

[17]  Harvey F. Silverman,et al.  A method for locating multiple sources from a frame of a large-aperture microphone array data without tracking , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[18]  Anthony J. Weiss,et al.  Array shape calibration using sources in unknown locations-a maximum likelihood approach , 1989, IEEE Trans. Acoust. Speech Signal Process..

[19]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[20]  Peter M. Schultheiss,et al.  Array shape calibration using sources in unknown locations-Part I: Far-field sources , 1987, IEEE Trans. Acoust. Speech Signal Process..

[21]  Stanley T. Birchfield,et al.  Microphone array position calibration by basis-point classical multidimensional scaling , 2005, IEEE Transactions on Speech and Audio Processing.

[22]  I. N. Imam,et al.  Using rank factorization in calculating the Moore-Penrose generalized inverse , 1989, Proceedings. IEEE Energy and Information Technologies in the Southeast'.

[23]  Stanley T. Birchfield Geometric microphone array calibration by multidimensional scaling , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[24]  Kung Yao,et al.  Maximum-likelihood source localization and unknown sensor location estimation for wideband signals in the near-field , 2002, IEEE Trans. Signal Process..

[25]  D. E. Manolakis,et al.  Efficient solution and performance analysis of 3-D position estimation by trilateration , 1996 .

[26]  Harvey F. Silverman,et al.  Microphone position and gain calibration for a large-aperture microphone array , 2005, IEEE Transactions on Speech and Audio Processing.

[27]  Augusto Sarti,et al.  Geometric calibration of distributed microphone arrays from acoustic source correspondences , 2010, 2010 IEEE International Workshop on Multimedia Signal Processing.

[28]  Rainer Lienhart,et al.  Position calibration of microphones and loudspeakers in distributed computing platforms , 2005, IEEE Transactions on Speech and Audio Processing.

[29]  Marc Pollefeys,et al.  Direct computation of sound and microphone locations from time-difference-of-arrival data , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.