Correlation Maximization-Based Sampling Rate Offset Estimation for Distributed Microphone Arrays

In this paper, we investigate the sampling rate mismatch problem in distributed microphone arrays and propose a correlation maximization algorithm to blindly estimate the sampling rate offset between two asynchronously sampled microphone signals. We approximate the sampling rate offset with a linear-phase drift model in the short-time Fourier transform (STFT) domain and show that the correlation coefficient between two microphone signals tends to present the highest value when the sampling of the two microphone signals is synchronized. Based on this finding we propose the correlation maximization algorithm, which performs sampling rate compensation on two microphone signals with different possible offset values and calculates their correlation coefficient after compensation. The offset value that leads to the largest correlation coefficient is chosen as the optimal estimate. Since the precision of the STFT linear-phase drift model used in the algorithm degrades as the sampling rate offset or the signal length is increased, we further propose a two-stage exhaustive search scheme to detect the optimal sampling rate offset. This scheme is able to minimize the influence of the linear-phase drift model error in order to improve the sampling rate offset estimation accuracy. Both simulated as well as real-world experiments confirm the effectiveness of the proposed algorithm.

[1]  Marc Moonen,et al.  Reduced-Bandwidth and Distributed MWF-Based Noise Reduction Algorithms for Binaural Hearing Aids , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Reinhold Häb-Umbach,et al.  Sampling rate synchronisation in acoustic sensor networks with a pre-trained clock skew error model , 2013, 21st European Signal Processing Conference (EUSIPCO 2013).

[3]  K. J. Ray Liu,et al.  Handbook on Array Processing and Sensor Networks , 2010 .

[4]  Shoji Makino,et al.  Blind compensation of interchannel sampling frequency mismatch for ad hoc microphone array based on maximum likelihood estimation , 2015, Signal Process..

[5]  Te-Won Lee,et al.  Blind Speech Separation , 2007, Blind Speech Separation.

[6]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[7]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[8]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[9]  Peter Vary,et al.  Adaptive Sampling Rate Correction for Acoustic Echo Control in Voice-Over-IP , 2010, IEEE Transactions on Signal Processing.

[10]  Alexander Bertrand,et al.  Applications and trends in wireless acoustic sensor networks: A signal processing perspective , 2011, 2011 18th IEEE Symposium on Communications and Vehicular Technology in the Benelux (SCVT).

[11]  Rémi Gribonval,et al.  Performance measurement in blind audio source separation , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  K. J. Ray Liu,et al.  Handbook on Array Processing and Sensor Networks: Haykin/Array Processing , 2010 .

[13]  Heping Ding,et al.  Combining Superdirective Beamforming and Frequency-Domain Blind Source Separation for Highly Reverberant Signals , 2010, EURASIP J. Audio Speech Music. Process..

[14]  Zicheng Liu SOUND SOURCE SEPARATION WITH DISTRIBUTED MICROPHONE ARRAYS IN THE PRESENCE OF CLOCK SYNCHRONIZATION ERRORS , 2008 .

[15]  Marc Moonen,et al.  GSVD-based optimal filtering for single and multimicrophone speech enhancement , 2002, IEEE Trans. Signal Process..

[16]  Minerva M. Yeung,et al.  On the importance of exact synchronization for distributed audio signal processing , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[17]  David I. Havelock,et al.  Drift-Compensated Adaptive Filtering for Improving Speech Intelligibility in Cases with Asynchronous Inputs , 2010, EURASIP J. Adv. Signal Process..

[18]  Rainer Lienhart,et al.  Position calibration of microphones and loudspeakers in distributed computing platforms , 2005, IEEE Transactions on Speech and Audio Processing.

[19]  Ted S. Wada,et al.  On Dealing with Sampling Rate Mismatches in Blind Source Separation and Acoustic Echo Cancellation , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[20]  Simon Doclo,et al.  Analysis of rate constraints for MWF-based noise reduction in acoustic sensor networks , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[21]  S. Biyiksiz,et al.  Multirate digital signal processing , 1985, Proceedings of the IEEE.

[22]  Joshua D. Reiss,et al.  Self-Localization of Ad-Hoc Arrays Using Time Difference of Arrivals , 2016, IEEE Transactions on Signal Processing.

[23]  I. Cohen,et al.  Generating nonstationary multisensor signals under a spatial coherence constraint. , 2008, The Journal of the Acoustical Society of America.

[24]  Israel Cohen,et al.  Blind Sampling Rate Offset Estimation and Compensation in Wireless Acoustic Sensor Networks with Application to Beamforming , 2012, IWAENC.

[25]  Heping Ding,et al.  A Region-Growing Permutation Alignment Approach in Frequency-Domain Blind Source Separation of Speech Mixtures , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[26]  Qasim M. Chaudhari A Simple and Robust Clock Synchronization Scheme , 2012, IEEE Transactions on Communications.

[27]  Marc Moonen,et al.  Distributed Adaptive Node-Specific Signal Estimation in Fully Connected Sensor Networks—Part I: Sequential Node Updating , 2010, IEEE Transactions on Signal Processing.

[28]  Lin Wang,et al.  Noise Power Spectral Density Estimation Using MaxNSR Blocking Matrix , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[29]  Israel Cohen,et al.  Multichannel Eigenspace Beamforming in a Reverberant Noisy Environment With Multiple Interfering Speech Signals , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[30]  Walter Kellermann,et al.  Synchronization of acoustic sensors for distributed ad-hoc audio networks and its use for blind source separation , 2004, IEEE Sixth International Symposium on Multimedia Software Engineering.

[31]  Michael S. Brandstein,et al.  Microphone Arrays - Signal Processing Techniques and Applications , 2001, Microphone Arrays.

[32]  Marc Moonen,et al.  Distributed Node-Specific LCMV Beamforming in Wireless Sensor Networks , 2012, IEEE Transactions on Signal Processing.

[33]  Lars Erup,et al.  Interpolation in digital modems. II. Implementation and performance , 1993, IEEE Trans. Commun..

[34]  Joshua D. Reiss,et al.  Audio Fingerprinting for Multi-Device Self-Localization , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[35]  Richard Heusdens,et al.  Distributed MVDR Beamforming for (Wireless) Microphone Networks Using Message Passing , 2012, IWAENC.