Minimax Mutual Information Approach for Independent Component Analysis

Minimum output mutual information is regarded as a natural criterion for independent component analysis (ICA) and is used as the performance measure in many ICA algorithms. Two common approaches in information-theoretic ICA algorithms are minimum mutual information and maximum output entropy approaches. In the former approach, we substitute some form of probability density function (pdf) estimate into the mutual information expression, and in the latter we incorporate the source pdf assumption in the algorithm through the use of nonlinearities matched to the corresponding cumulative density functions (cdf). Alternative solutions to ICA use higher-order cumulant-based optimization criteria, which are related to either one of these approaches through truncated series approximations for densities. In this article, we propose a new ICA algorithm motivated by the maximum entropy principle (for estimating signal distributions). The optimality criterion is the minimum output mutual information, where the estimated pdfs are from the exponential family and are approximate solutions to a constrained entropy maximization problem. This approach yields an upper bound for the actual mutual information of the output signalshence, the name minimax mutual information ICA algorithm. In addition, we demonstrate that for a specific selection of the constraint functions in the maximum entropy density estimation procedure, the algorithm relates strongly to ICA methods using higher-order cumulants.

[1]  Mark Girolami,et al.  Orthogonal Series Density Estimation and the Kernel Eigenvalue Problem , 2002, Neural Computation.

[2]  E. Jaynes Information Theory and Statistical Mechanics , 1957 .

[3]  Deniz Erdogmus,et al.  INDEPENDENT COMPONENT ANALYSIS USING JAYNES' MAXIMUM ENTROPY PRINCIPLE , 2003 .

[4]  Aapo Hyvärinen,et al.  New Approximations of Differential Entropy for Independent Component Analysis and Projection Pursuit , 1997, NIPS.

[5]  Meir Feder,et al.  Multi-channel signal separation by decorrelation , 1993, IEEE Trans. Speech Audio Process..

[6]  Jose C. Principe,et al.  A unifying criterion for blind source separation and decorrelation: simultaneous diagonalization of correlation matrices , 1997, Neural Networks for Signal Processing VII. Proceedings of the 1997 IEEE Signal Processing Society Workshop.

[7]  Aapo Hyvärinen,et al.  Survey on Independent Component Analysis , 1999 .

[8]  J. Príncipe,et al.  Information-Theoretic Learning Using Renyi's Quadratic Entropy , 1999 .

[9]  Colin Fyfe,et al.  Kurtosis extrema and identification of independent components: a neural network approach , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Gene H. Golub,et al.  Matrix computations , 1983 .

[11]  D. Erdogmus,et al.  Independent components analysis using Renyi's mutual information and Legendre density estimation , 2001, IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222).

[12]  Shun-ichi Amari,et al.  Adaptive Online Learning Algorithms for Blind Separation: Maximum Entropy and Minimum Mutual Information , 1997, Neural Computation.

[13]  Erkki Oja,et al.  The nonlinear PCA learning rule in independent component analysis , 1997, Neurocomputing.

[14]  Kari Torkkola,et al.  Blind separation of delayed sources based on information maximization , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[15]  Andrzej Cichocki,et al.  A New Learning Algorithm for Blind Signal Separation , 1995, NIPS.

[16]  Jean-Franois Cardoso High-Order Contrasts for Independent Component Analysis , 1999, Neural Computation.

[17]  Scott C. Douglas,et al.  Adaptive paraunitary filter banks for contrast-based multichannel blind deconvolution , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[18]  Visa Koivunen,et al.  Maximum likelihood estimation of ICA model for wide class of source distributions , 2000, Neural Networks for Signal Processing X. Proceedings of the 2000 IEEE Signal Processing Society Workshop (Cat. No.00TH8501).

[19]  Shun-ichi Amari,et al.  Neural Learning in Structured Parameter Spaces - Natural Riemannian Gradient , 1996, NIPS.

[20]  Lucas C. Parra,et al.  Convolutive blind separation of non-stationary sources , 2000, IEEE Trans. Speech Audio Process..

[21]  Kari Torkkola,et al.  Blind Separation For Audio Signals - Are We There Yet? , 1999 .

[22]  Aapo Hyvärinen,et al.  Fast and robust fixed-point algorithms for independent component analysis , 1999, IEEE Trans. Neural Networks.

[23]  John W. Fisher,et al.  A novel measure for independent component analysis (ICA) , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[24]  J. Cardoso,et al.  Blind beamforming for non-gaussian signals , 1993 .

[25]  B. R. Crain,et al.  Estimation of Distributions Using Orthogonal Expansions , 1974 .

[26]  Terrence J. Sejnowski,et al.  An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[27]  J. Cardoso On the Performance of Orthogonal Source Separation Algorithms , 1994 .

[28]  Shun-ichi Amari,et al.  Differential-geometrical methods in statistics , 1985 .

[29]  Dinh-Tuan Pham,et al.  Blind separation of instantaneous mixture of sources via an independent component analysis , 1996, IEEE Trans. Signal Process..

[30]  M. Girolami Symmetric adaptive maximum likelihood estimation for noise cancellation and signal separation , 1997 .

[31]  J. N. Kapur,et al.  Entropy optimization principles with applications , 1992 .

[32]  Jean-Francois Cardoso,et al.  Blind signal separation: statistical principles , 1998, Proc. IEEE.

[33]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[34]  J. F. C. Kingman,et al.  Information and Exponential Families in Statistical Theory , 1980 .

[35]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[36]  C. Serviere,et al.  Blind source separation of convolutive mixtures , 1996, Proceedings of 8th Workshop on Statistical Signal and Array Processing.

[37]  Pierre Comon Independent component analysis - a new concept? signal processing , 1994 .

[38]  Dinh Tuan Pham,et al.  Blind separation of instantaneous mixture of sources via the Gaussian mutual information criterion , 2000, 2000 10th European Signal Processing Conference.

[39]  José Carlos Príncipe,et al.  Generalized anti-Hebbian learning for source separation , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[40]  Deniz Erdogmus,et al.  Supervised training of adaptive systems with partially labeled data , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[41]  Deniz Erdoğmuş,et al.  Blind source separation using Renyi's mutual information , 2001, IEEE Signal Processing Letters.

[42]  P. Comon,et al.  Contrasts for multichannel blind deconvolution , 1996, IEEE Signal Processing Letters.

[43]  Rodney W. Johnson,et al.  Axiomatic derivation of the principle of maximum entropy and the principle of minimum cross-entropy , 1980, IEEE Trans. Inf. Theory.

[44]  Kari Torkkola,et al.  Blind separation of convolved sources based on information maximization , 1996, Neural Networks for Signal Processing VI. Proceedings of the 1996 IEEE Signal Processing Society Workshop.

[45]  C. Jutten,et al.  Blind source separation of convolutive mixtures by maximization of fourth-order cumulants: the non i.i.d. case , 1998, Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284).

[46]  Pierre Comon,et al.  Independent component analysis, A new concept? , 1994, Signal Process..

[47]  J. Príncipe,et al.  A Gaussianity measure for blind source separation insensitive to the sign of kurtosis , 1999, Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop (Cat. No.98TH8468).