Supervised training of adaptive systems with partially labeled data

Supervised adaptive system training is traditionally performed with available pairs of input-output data and the system weights are fixed following this training procedure. Recently, in the context of machine learning, where the desired outputs are discrete-valued, the idea of exploiting unlabeled samples for improving classification performance has been proposed. We introduce an information theoretic framework based on density divergence minimization to obtain extended training algorithms. Our goal is to provide a theoretical framework upon which we can build efficient algorithms to this end.

[1]  Deniz Erdoğmuş INFORMATION THEORETIC LEARNING: RENYI'S ENTROPY AND ITS APPLICATIONS TO ADAPTIVE SYSTEM TRAINING , 2002 .

[2]  Alfréd Rényi,et al.  Probability Theory , 1970 .

[3]  E. Jaynes Information Theory and Statistical Mechanics , 1957 .

[4]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[5]  Deniz Erdogmus,et al.  An error-entropy minimization algorithm for supervised training of nonlinear adaptive systems , 2002, IEEE Trans. Signal Process..

[6]  Nello Cristianini,et al.  An introduction to Support Vector Machines , 2000 .

[7]  Deniz Erdoğmuş,et al.  Online entropy manipulation: stochastic information gradient , 2003, IEEE Signal Processing Letters.

[8]  Bernard Widrow,et al.  Adaptive Signal Processing , 1985 .

[9]  Sebastian Thrun,et al.  Text Classification from Labeled and Unlabeled Documents using EM , 2000, Machine Learning.

[10]  Luc Devroye,et al.  Combinatorial methods in density estimation , 2001, Springer series in statistics.

[11]  Deniz Erdogmus,et al.  Minimax Mutual Information Approach for Independent Component Analysis , 2004, Neural Computation.

[12]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[13]  Mikhail Belkin,et al.  Manifold Regularization : A Geometric Framework for Learning from Examples , 2004 .

[14]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[15]  L. Györfi,et al.  Nonparametric entropy estimation. An overview , 1997 .

[16]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[17]  Ali H. Sayed,et al.  Fundamentals Of Adaptive Filtering , 2003 .