Time Delay Estimation via Minimum Entropy

Time delay estimation (TDE) is a basic technique for numerous applications where there is a need to localize and track a radiating source. The most important TDE algorithms for two sensors are based on the generalized cross-correlation (GCC) method. These algorithms perform reasonably well when reverberation or noise is not too high. In an earlier study by the authors, a more sophisticated approach was proposed. It employs more sensors and takes advantage of their delay redundancy to improve the precision of the time difference of arrival (TDOA) estimate between the first two sensors. The approach is based on the multichannel cross-correlation coefficient (MCCC) and was found more robust to noise and reverberation. In this letter, we show that this approach can also be developed on a basis of joint entropy. For Gaussian signals, we show that, in the search of the TDOA estimate, maximizing MCCC is equivalent to minimizing joint entropy. However, with the generalization of the idea to non-Gaussian signals (e.g., speech), the joint entropy-based new TDE algorithm manifests a potential to outperform the MCCC-based method

[1]  Te-Won Lee,et al.  On the multivariate Laplace distribution , 2006, IEEE Signal Processing Letters.

[2]  Jacob Benesty,et al.  Time-delay estimation via linear interpolation and cross correlation , 2004, IEEE Transactions on Speech and Audio Processing.

[3]  H. Gish,et al.  Generalized coherence (signal detection) , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[4]  G. Carter,et al.  The generalized correlation method for estimation of time delay , 1976 .

[5]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[6]  Samuel Kotz,et al.  Asymmetric Multivariate Laplace Distribution , 2001 .

[7]  Jacob Benesty,et al.  Robust time delay estimation exploiting redundancy among multiple microphones , 2003, IEEE Trans. Speech Audio Process..

[8]  J. Ianniello,et al.  Time delay estimation via cross-correlation in the presence of large estimation errors , 1982 .

[9]  S. Kotz,et al.  AN ASYMMETRIC MULTIVARIATE LAPLACE DISTRIBUTION , 2022 .

[10]  S. Gazor,et al.  Speech probability distribution , 2003, IEEE Signal Processing Letters.

[11]  Benoît Champagne,et al.  Performance of time-delay estimation in the presence of room reverberation , 1996, IEEE Trans. Speech Audio Process..

[12]  I. Kojadinovic On the use of mutual information in data analysis : an overview , 2005 .

[13]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[14]  Arun Ross,et al.  Microphone Arrays , 2009, Encyclopedia of Biometrics.

[15]  Jae S. Lim,et al.  Speech enhancement , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[16]  Panayiotis G. Georgiou,et al.  Alpha-Stable Modeling of Noise and Robust Time-Delay Estimation in the Presence of Impulsive Noise , 1999, IEEE Trans. Multim..

[17]  Jacob Benesty,et al.  Microphone arrays for video camera steering , 2000 .

[18]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[19]  Herbert Gish,et al.  A geometric approach to multiple-channel signal detection , 1995, IEEE Trans. Signal Process..

[20]  Hong Wang,et al.  Voice source localization for automatic camera pointing system in videoconferencing , 1997, Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics.

[21]  C. L. Nikias,et al.  Signal processing with fractional lower order moments: stable processes and their applications , 1993, Proc. IEEE.