Nonparametric density estimation and regression achieved with topographic maps maximizing the information-theoretic entropy of their outputs

We introduce an unsupervised competitive learning rule, called the extended Maximum Entropy learning Rule (eMER), for topographic map formation. Unlike Kohonen's Self-Organizing Map (SOM) algorithm, the presence of a neighborhood function is not a prerequisite for achieving topology-preserving mappings, but instead it is intended: (1) to speed up the learning process and (2) to perform nonparametric regression. We show that, when the neighborhood function vanishes, the neural weigh t density at convergence approaches a linear function of the input density so that the map can be regarded as a nonparametric model of the input density. We apply eMER to density estimation and compare its performance with that of the SOM algorithm and the variable kernel method. Finally, we apply the ‘batch’ version of eMER to nonparametric projection pursuit regression and compare its performance with that of back-propagation learning, projection pursuit learning, constrained topolog ical mapping, and the Heskes and Kappen approach.

[1]  Allen Gersho,et al.  Asymptotically optimal block quantization , 1979, IEEE Trans. Inf. Theory.

[2]  Helge J. Ritter,et al.  Neural computation and self-organizing maps - an introduction , 1992, Computation and neural systems series.

[3]  Stephen P. Luttrell Code vector density in topographic mappings: Scalar case , 1991, IEEE Trans. Neural Networks.

[4]  Vladimir Cherkassky,et al.  Self-Organization as an Iterative Kernel Smoothing Process , 1995, Neural Computation.

[5]  J. Friedman,et al.  Projection Pursuit Regression , 1981 .

[6]  Hilbert J. Kappen,et al.  Self-organization and nonparametric regression , 1995 .

[7]  Marc M. Van Hulle,et al.  The Formation of Topographic Maps That Maximize the Average Mutual Information of the Output Responses to Noiseless Input Signals , 1997, Neural Computation.

[8]  Vladimir Cherkassky,et al.  Constrained topological mapping for nonparametric regression analysis , 1991, Neural Networks.

[9]  Bernard W. Silverman,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[10]  J. Friedman,et al.  FLEXIBLE PARSIMONIOUS SMOOTHING AND ADDITIVE MODELING , 1989 .

[11]  Globally-ordered topology-preserving maps achieved with a learning rule performing local weight updates only , 1995, Proceedings of 1995 IEEE Workshop on Neural Networks for Signal Processing.

[12]  L. Breiman,et al.  Variable Kernel Estimates of Multivariate Densities , 1977 .

[13]  Jenq-Neng Hwang,et al.  Regression modeling in back-propagation and projection pursuit learning , 1994, IEEE Trans. Neural Networks.

[14]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[15]  Trevor Hastie,et al.  Automatic Smoothing Spline Projection Pursuit , 1994 .

[16]  H. Ritter Combining self-organizing maps , 1989, International 1989 Joint Conference on Neural Networks.

[17]  C. D. Kemp,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[18]  David Zipser,et al.  Feature Discovery by Competive Learning , 1986, Cogn. Sci..

[19]  Ralf Der,et al.  Instabilities in self-organized feature maps with short neighbourhood range , 1994, ESANN.

[20]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..