Blind sampling rate offset estimation based on coherence drift in wireless acoustic sensor networks

In this paper, a new approach for sampling rate offset (SRO) estimation between nodes of a wireless acoustic sensor network (WASN) is proposed using the phase drift of the coherence function between the signals. This method, referred to as least squares coherence drift (LCD) estimation, assumes that the SRO induces a linearly increasing phase-shift in the short-time Fourier transform (STFT) domain. This phase-shift, observed as a drift in the phase of the signal coherence, is applied in a least-squares estimation framework to estimate the SRO. Simulation results in different scenarios show that the LCD estimation approach can estimate the SRO with a mean absolute error of around 1%. We finally demonstrate that the use of the LCD estimation within a compensation approach eliminates the performance-loss due to SRO in a multichannel Wiener filter (MWF)-based speech enhancement task.

[1]  Kay Römer,et al.  Wireless sensor networks: a new regime for time synchronization , 2003, CCRV.

[2]  Marc Moonen,et al.  Spatially pre-processed speech distortion weighted multi-channel Wiener filtering for noise reduction , 2003, Signal Process..

[3]  P. Welch The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms , 1967 .

[4]  Sharon Gannot,et al.  Blind synchronization in wireless sensor networks with application to speech enhancement , 2014, 2014 14th International Workshop on Acoustic Signal Enhancement (IWAENC).

[5]  Shoji Makino,et al.  Blind compensation of inter-channel sampling frequency mismatch with maximum likelihood estimation in STFT domain , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  Walter Kellermann,et al.  Synchronization of acoustic sensors for distributed ad-hoc audio networks and its use for blind source separation , 2004, IEEE Sixth International Symposium on Multimedia Software Engineering.

[7]  James R. Glass,et al.  Non-Negative Factor Analysis of Gaussian Mixture Model Weight Adaptation for Language and Dialect Recognition , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[8]  Peter Vary,et al.  Adaptive Sampling Rate Correction for Acoustic Echo Control in Voice-Over-IP , 2010, IEEE Transactions on Signal Processing.

[9]  Israel Cohen,et al.  Blind Sampling Rate Offset Estimation and Compensation in Wireless Acoustic Sensor Networks with Application to Beamforming , 2012, IWAENC.

[10]  S. Soli,et al.  Development of the Hearing in Noise Test for the measurement of speech reception thresholds in quiet and in noise. , 1994, The Journal of the Acoustical Society of America.