Dually Optimal Neuronal Layers: Lobe Component Analysis

Development imposes great challenges. Internal ldquocorticalrdquorepresentations must be autonomously generated from interactive experiences. The eventual quality of these developed representations is of course important. Additionally, learning must be as fast as possible-to quickly derive better representation from limited experiences. Those who achieve both of these will have competitive advantages. We present a cortex-inspired theory called lobe component analysis (LCA) guided by the aforementioned dual criteria. A lobe component represents a high concentration of probability density of the neuronal input space. We explain how lobe components can achieve a dual-spatiotemporal (ldquobestrdquo and ldquofastestrdquo)-optimality, through mathematical analysis, in which we describe how lobe components plasticity can be temporally scheduled to take into account the history of observations in the best possible way. This contrasts with using only the last observation in gradient-based adaptive learning algorithms. Since they are based on two cell-centered mechanisms-Hebbian learning and lateral inhibition-lobe components develop in-place, meaning every networked neuron is individually responsible for the learning of its signal-processing characteristics within its connected network environment. There is no need for a separate learning network. We argue that in-place learning algorithms will be crucial for real-world large-size developmental applications due to their simplicity, low computational complexity, and generality. Our experimental results show that the learning speed of the LCA algorithm is drastically faster than other Hebbian-based updating methods and independent component analysis algorithms, thanks to its dual optimality, and it does not need to use any second- or higher order statistics. We also introduce the new principle of fast learning from stable representation.

[1]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[2]  Erkki Oja,et al.  An Experimental Comparison of Neural ICA Algorithms , 1998 .

[3]  B. Ripley,et al.  Robust Statistics , 2018, Encyclopedia of Mathematical Geosciences.

[4]  Stephen Grossberg,et al.  Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system , 1991, Neural Networks.

[5]  F. Werblin,et al.  Requirement for Cholinergic Synaptic Transmission in the Propagation of Spontaneous Retinal Waves , 1996, Science.

[6]  Juha Karhunen,et al.  Blind source separation using least-squares type adaptive algorithms , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  Lawrence Sirovich,et al.  Application of the Karhunen-Loeve Procedure for the Characterization of Human Faces , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  K. Martin,et al.  The Cell Biology of Synaptic Plasticity , 2011, Science.

[9]  Erkki Oja,et al.  Independent component analysis: algorithms and applications , 2000, Neural Networks.

[10]  Risto Miikkulainen,et al.  Computational Maps in the Visual Cortex , 2005 .

[11]  Juha Karhunen,et al.  Least-Squares Methods for Blind Source Separation Based on Nonlinear PCA , 1997, Int. J. Neural Syst..

[12]  S. Grossberg,et al.  Contrast-sensitive perceptual grouping and object-based attention in the laminar circuits of primary visual cortex , 2000, Vision Research.

[13]  H. Saunders,et al.  Probability, Random Variables and Stochastic Processes (2nd Edition) , 1989 .

[14]  Peter Dayan,et al.  Theoretical Neuroscience: Computational and Mathematical Modeling of Neural Systems , 2001 .

[15]  E. Callaway,et al.  Contributions of individual layer 6 pyramidal neurons to local circuitry in macaque primary visual cortex , 1996, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[16]  Eero P. Simoncelli,et al.  Natural image statistics and neural representation. , 2001, Annual review of neuroscience.

[17]  Rama Chellappa,et al.  Discriminant Analysis for Recognition of Human Face Images (Invited Paper) , 1997, AVBPA.

[18]  E. L. Lehmann,et al.  Theory of point estimation , 1950 .

[19]  D. Chakrabarti,et al.  A fast fixed - point algorithm for independent component analysis , 1997 .

[20]  S. Grossberg,et al.  Adaptive pattern classification and universal recoding: I. Parallel development and coding of neural feature detectors , 1976, Biological Cybernetics.

[21]  L. C. Katz,et al.  Development of cortical circuits: Lessons from ocular dominance columns , 2002, Nature Reviews Neuroscience.

[22]  D. J. Felleman,et al.  Distributed hierarchical processing in the primate cerebral cortex. , 1991, Cerebral cortex.

[23]  W. K. Purves Life: The Science of Biology , 1985 .

[24]  Juyang Weng,et al.  Optimal In-Place Learning and the Lobe Component Analysis , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[25]  M. V. Velzen,et al.  Self-organizing maps , 2007 .

[26]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[27]  M. Alexander,et al.  Principles of Neural Science , 1981 .

[28]  Juyang Weng,et al.  Using Discriminant Eigenfeatures for Image Retrieval , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Juyang Weng,et al.  Motor initiated expectation through top-down connections as abstract context in a physical world , 2008, 2008 7th IEEE International Conference on Development and Learning.

[30]  James L. McClelland,et al.  Autonomous Mental Development by Robots and Animals , 2001, Science.

[31]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[32]  E. Callaway Local circuits in primary visual cortex of the macaque monkey. , 1998, Annual review of neuroscience.

[33]  Terrence J. Sejnowski,et al.  Independent Component Analysis Using an Extended Infomax Algorithm for Mixed Subgaussian and Supergaussian Sources , 1999, Neural Computation.

[34]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[35]  T. Ens,et al.  Blind signal separation : statistical principles , 1998 .

[36]  Juyang Weng,et al.  Autonomous Mental Development: Workshop on Development and Learning (WDL) , 2002, AI Mag..

[37]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[38]  Juyang Weng,et al.  A Multilayer in-Place Learning Network for Development of General Invariances , 2007, Int. J. Humanoid Robotics.

[39]  E. Oja Simplified neuron model as a principal component analyzer , 1982, Journal of mathematical biology.

[40]  Peter J. Huber,et al.  Robust Statistics , 2005, Wiley Series in Probability and Statistics.

[41]  Juyang Weng,et al.  2008 Special issue , 2008 .

[42]  Stephen Grossberg,et al.  A massively parallel architecture for a self-organizing neural pattern recognition machine , 1988, Comput. Vis. Graph. Image Process..

[43]  Juyang Weng,et al.  Topographic Class Grouping with applications to 3D object recognition , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[44]  Thomas S. Huang,et al.  Motion and Structure from Image Sequences , 1992 .

[45]  Juyang Weng,et al.  Candid Covariance-Free Incremental Principal Component Analysis , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[46]  H. Gundersen,et al.  Total regional and global number of synapses in the human brain neocortex , 2001, Synapse.

[47]  P. J. Green,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[48]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[49]  Terrence J. Sejnowski,et al.  Independent Component Analysis Using an Extended Infomax Algorithm for Mixed Sub-Gaussian and Super-Gaussian Sources , 1999, Neural Comput..

[50]  John G. Proakis,et al.  Probability, random variables and stochastic processes , 1985, IEEE Trans. Acoust. Speech Signal Process..

[51]  J. Elman,et al.  Rethinking Innateness: A Connectionist Perspective on Development , 1996 .

[52]  Erkki Oja,et al.  The nonlinear PCA criterion in blind source separation: Relations with other approaches , 1998, Neurocomputing.