Hinton (in press) recently proposed a learning algorithm called contrastive divergence learning for a class of probabilistic models called product of experts (PoE). Whereas in standard mixture models the “beliefs” of individual experts are averaged, in PoEs the “beliefs” are multiplied together and then renormalized. One advantage of this approach is that the combined beliefs can be much sharper than the individual beliefs of each expert. It has been shown that a restricted version of the Boltzmann machine, in which there are no lateral connections between hidden units or between observation units, is a PoE. In this paper we generalize these results to diffusion networks, a continuous-time, continuous-state version of the Boltzmann machine. We show that when the unit activation functions are linear, this PoE architecture is equivalent to a factor analyzer. This result suggests novel non-linear generalizations of factor analysis and independent component analysis that could be implemented using interactive neural circuitry.
[1]
M. Turk,et al.
Eigenfaces for Recognition
,
1991,
Journal of Cognitive Neuroscience.
[2]
Horace Barlow,et al.
What is the computational goal of the neocortex
,
1994
.
[3]
Terrence J. Sejnowski,et al.
An Information-Maximization Approach to Blind Separation and Blind Deconvolution
,
1995,
Neural Computation.
[4]
Hagai Attias,et al.
Independent Factor Analysis
,
1999,
Neural Computation.
[5]
Paul Mineiro,et al.
Partially Observable SDE Models for Image Sequence Recognition Tasks
,
2000,
NIPS.
[6]
Geoffrey E. Hinton.
Training Products of Experts by Minimizing Contrastive Divergence
,
2002,
Neural Computation.