Entropy and Kullback-Leibler divergence estimation based on Szegö's theorem

In this work, a new technique for the estimation of the Shannon's entropy and the Kullback-Leibler (KL) divergence for one dimensional data is presented. The estimator is based on the Szegö's theorem for sequences of Toeplitz matrices, which deals with the asymptotic behavior of the eigenvalues of those matrices, and the analogy between a probability density function (PDF) and a power spectral density (PSD), which allows us to estimate a PDF of bounded support using the well-known spectral estimation techniques. Specifically, an AR model is used for the PDF/PSD estimation, and the entropy is easily estimated as a function of the eigenvalues of the autocorrelation Toeplitz matrix. The performance of the Szegö's estimators is illustrated by means of Monte Carlo simulations and compared with previously proposed alternatives, showing a good performance.

[1]  Jean-François Bercher,et al.  Estimating the entropy of a signal with applications , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[2]  Qing Wang,et al.  Divergence estimation of continuous distributions based on data-dependent partitions , 2005, IEEE Transactions on Information Theory.

[3]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[4]  Robert M. Gray,et al.  Toeplitz and Circulant Matrices: A Review , 2005, Found. Trends Commun. Inf. Theory.

[5]  Robert M. Gray,et al.  Toeplitz And Circulant Matrices: A Review (Foundations and Trends(R) in Communications and Information Theory) , 2006 .

[6]  Pedro M. Crespo,et al.  Asymptotically Equivalent Sequences of Matrices and Hermitian Block Toeplitz Matrices With Continuous Symbols: Applications to MIMO Systems , 2008, IEEE Transactions on Information Theory.

[7]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[8]  U. Grenander,et al.  Toeplitz Forms And Their Applications , 1958 .

[9]  Petre Stoica,et al.  Spectral Analysis of Signals , 2009 .

[10]  Fernando Pérez-Cruz,et al.  Kullback-Leibler divergence estimation of continuous distributions , 2008, 2008 IEEE International Symposium on Information Theory.

[11]  Marcelino Lázaro,et al.  Blind restoration of binary signals using a line spectrum fitting approach , 2004, 2004 12th European Signal Processing Conference.

[12]  Oldrich A Vasicek,et al.  A Test for Normality Based on Sample Entropy , 1976 .

[13]  S. Kay Model-based probability density function estimation , 1998, IEEE Signal Processing Letters.