论文信息 - Global Boltzmann perceptron network for online learning of conditional distributions

Global Boltzmann perceptron network for online learning of conditional distributions

This paper proposes a backpropagation-based feedforward neural network for learning probability distributions of outputs conditioned on inputs using incoming input-output samples only. The backpropagation procedure is shown to locally minimize the Kullback-Leibler measure in an expected sense. The procedure is enhanced to facilitate boundedness of weights and exploration of the search space to reach a global minimum. Weak convergence theory is employed to show that the longterm behavior of the resulting algorithm can be approximated by that of a stochastic differential equation, whose invariant distributions are concentrated around the global minima of the Kullback-Leibler measure within a region of interest. Simulation studies on problems involving samples arriving from a mixture of labeled densities and the well-known Iris data problem demonstrate the speed and accuracy of the proposed procedure.

Mandayam A. L. Thathachar | M. T. Arvind

[1] Geoffrey E. Hinton,et al. A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..

[2] P. Billingsley,et al. Convergence of Probability Measures , 1969 .

[3] Eric B. Baum,et al. Supervised Learning of Probability Distributions by Neural Networks , 1987, NIPS.

[4] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[5] F. Aluffi-Pentini,et al. Global optimization and stochastic differential equations , 1985 .

[6] Simon Haykin,et al. Neural Networks: A Comprehensive Foundation , 1998 .

[7] Pierre Priouret,et al. Adaptive Algorithms and Stochastic Approximations , 1990, Applications of Mathematics.

[8] Allen Gersho,et al. The Boltzmann Perceptron Network: A soft classifier , 1990, Neural Networks.

[9] Solomon Kullback,et al. Information Theory and Statistics , 1970, The Mathematical Gazette.

[10] Solomon Kullback,et al. Information Theory and Statistics , 1960 .

[11] Mandayam A. L. Thathachar,et al. Learning the global maximum with parameterized learning automata , 1995, IEEE Trans. Neural Networks.

[12] J J Hopfield,et al. Learning algorithms and probability distributions in feed-forward and feed-back networks. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[13] C. D. Gelatt,et al. Optimization by Simulated Annealing , 1983, Science.

[14] Mandayam A. L. Thathachar,et al. Learning Optimal Discriminant Functions through a Cooperative Game of Automata , 1987, IEEE Transactions on Systems, Man, and Cybernetics.