Yli-Krekola A bio-inspired computational model of covert attention and learning

The cerebral cortex of the mammalian brain is responsible for such things as processing sensory information, imagination, working memory and consciousness. The key functions of the cortex that help improve the behaviour of the animal can be thought to be learning regularities from the world, attending to behaviourally relevant targets and simulating a world model for the purpose of planning. This thesis builds a system-level computational model of learning and attention in the cortex. The model is inspired by the real cortex and it is intended to be later extended to include all the important functions in the cortex. The cortex learns a hierarchy of increasingly abstract representations of the world. The high levels can recognise complex objects and dynamical events and represent them with simple codes. Many models have been proposed for this kind of cortical learning. However , most of the models lack attention. The real cortex does not try to perceive everything simultaneously. Instead, it attends to those features in the world that are useful for the behaviour of the animal. Attention is also known to guide learning, which results in allocating the representational capacity in important features. There are computational models which have cortex-like attention, but so far, learning and attention have not been combined. In the model developed in this thesis, learning and attention support each other. Attention is not a distinct process which is controlled externally, but it emerges in the same network where learning and perception take place. Experiments with this model show that it is able to learn a hierarchy of increasingly invariant representations for objects and give birth to selective attention simultaneously. In the future, this model could be embodied in a controller of an autonomous robot.

[1]  H. Bastian Sensation and Perception.—I , 1869, Nature.

[2]  H. Hotelling Relations Between Two Sets of Variates , 1936 .

[3]  J. Deutsch Perception and Communication , 1958, Nature.

[4]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[5]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[6]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[7]  M. Posner,et al.  Attention and the detection of signals. , 1980, Journal of experimental psychology.

[8]  W. Singer,et al.  The effects of early visual experience on the cat's visual cortex and their possible explanation by Hebb synapses. , 1981, The Journal of physiology.

[9]  E. Oja Simplified neuron model as a principal component analyzer , 1982, Journal of mathematical biology.

[10]  J J Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[11]  M. Alexander,et al.  Principles of Neural Science , 1981 .

[12]  Stephen Grossberg,et al.  A massively parallel architecture for a self-organizing neural pattern recognition machine , 1988, Comput. Vis. Graph. Image Process..

[13]  Duane DeSieno,et al.  Adding a conscience to competitive learning , 1988, IEEE 1988 International Conference on Neural Networks.

[14]  J. Duncan,et al.  Visual search and stimulus similarity. , 1989, Psychological review.

[15]  Daniel J. Amit,et al.  Modeling brain function: the world of attractor neural networks, 1st Edition , 1989 .

[16]  Peter Földiák,et al.  Learning Invariance from Transformation Sequences , 1991, Neural Comput..

[17]  R. Sun On Variable Binding in Connectionist Networks , 1992 .

[18]  Geoffrey E. Hinton,et al.  Self-organizing neural network that discovers surfaces in random-dot stereograms , 1992, Nature.

[19]  M. Kawato,et al.  The cerebellum and VOR/OKR learning models , 1992, Trends in Neurosciences.

[20]  D. V. van Essen,et al.  A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[21]  S. Hochstein,et al.  Attentional control of early perceptual learning. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[22]  G. Wallis,et al.  Learning invariant responses to the natural transformations of objects , 1993, Proceedings of 1993 International Conference on Neural Networks (IJCNN-93-Nagoya, Japan).

[23]  L. Shastri,et al.  From simple associations to systematic reasoning: A connectionist representation of rules, variables and dynamic bindings using temporal synchrony , 1993, Behavioral and Brain Sciences.

[24]  C D Gilbert,et al.  Early perceptual learning. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[25]  M. Hasselmo Neuromodulation and cortical function: modeling the physiological basis of behavior , 1995, Behavioural Brain Research.

[26]  Terrence J. Sejnowski,et al.  An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[27]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[28]  R. Desimone,et al.  Neural mechanisms of selective visual attention. , 1995, Annual review of neuroscience.

[29]  Geoffrey E. Hinton,et al.  The Helmholtz Machine , 1995, Neural Computation.

[30]  Jzau-Sheng Lin,et al.  The application of competitive Hopfield neural network to medical image segmentation , 1996, IEEE Trans. Medical Imaging.

[31]  C. Gilbert,et al.  Spatial integration and cortical dynamics. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[32]  Geoffrey E. Hinton,et al.  Varieties of Helmholtz Machine , 1996, Neural Networks.

[33]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[34]  E. Niebur,et al.  Modeling the Temporal Dynamics of IT Neurons in Visual Search: A Mechanism for Top-Down Selective Attention , 1996, Journal of Cognitive Neuroscience.

[35]  V. Mountcastle The columnar organization of the neocortex. , 1997, Brain : a journal of neurology.

[36]  D. Chakrabarti,et al.  A fast fixed - point algorithm for independent component analysis , 1997 .

[37]  Rajesh P. N. Rao,et al.  Dynamic Model of Visual Recognition Predicts Neural Response Properties in the Visual Cortex , 1997, Neural Computation.

[38]  Peter Földiák,et al.  SPARSE CODING IN THE PRIMATE CORTEX , 2002 .

[39]  R. Desimone,et al.  Competitive Mechanisms Subserve Attention in Macaque Areas V2 and V4 , 1999, The Journal of Neuroscience.

[40]  Kenji Doya,et al.  What are the computations of the cerebellum, the basal ganglia and the cerebral cortex? , 1999, Neural Networks.

[41]  Marc-Oliver Gewaltig,et al.  A model of computation in neocortical architecture , 1999, Neural Networks.

[42]  Rajesh P. N. Rao,et al.  Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. , 1999 .

[43]  D. Durstewitz,et al.  A Neurocomputational Theory of the Dopaminergic Modulation of Working Memory Functions , 1999, The Journal of Neuroscience.

[44]  C. Koch,et al.  A saliency-based search mechanism for overt and covert shifts of visual attention , 2000, Vision Research.

[45]  E. Miller,et al.  THE PREFRONTAL CORTEX AND COGNITIVE CONTROL , 2000 .

[46]  C. Balkenius Attention, habituation and conditioning: toward a computational model , 2000 .

[47]  Konrad P. Körding,et al.  Neurons with Two Sites of Synaptic Integration Learn Invariant Representations , 2001, Neural Computation.

[48]  Stephen Grossberg,et al.  Linking the laminar circuits of visual cortex to visual perception: Development, grouping, and attention , 2001, Neuroscience & Biobehavioral Reviews.

[49]  Gustavo Deco,et al.  Biased Competition Mechanisms for Visual Attention in a Multimodular Neurodynamical System , 2001, Emergent Neural Computational Architectures Based on Neuroscience.

[50]  Leslie G. Ungerleider,et al.  The neural basis of biased competition in human visual cortex , 2001, Neuropsychologia.

[51]  C. Gilbert,et al.  The Neural Basis of Perceptual Learning , 2001, Neuron.

[52]  J. Kruschke Toward a unified model of attention in associative learning , 2001 .

[53]  M. Sur,et al.  Development and plasticity of cortical areas and networks , 2001, Nature Reviews Neuroscience.

[54]  J. B. Levitt,et al.  Circuits for Local and Global Signal Integration in Primary Visual Cortex , 2002, The Journal of Neuroscience.

[55]  Terrence J. Sejnowski,et al.  Slow Feature Analysis: Unsupervised Learning of Invariances , 2002, Neural Computation.

[56]  R. Silver,et al.  Shunting Inhibition Modulates Neuronal Gain during Synaptic Excitation , 2003, Neuron.

[57]  Tai Sing Lee,et al.  Hierarchical Bayesian inference in the visual cortex. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[58]  R. Desimone,et al.  Interacting Roles of Attention and Visual Salience in V4 , 2003, Neuron.

[59]  S. Grossberg,et al.  Towards a theory of the laminar architecture of cerebral cortex: computational clues from the visual system. , 2003, Cerebral cortex.

[60]  A. Thomson,et al.  Interlaminar connections in the neocortex. , 2003, Cerebral cortex.

[61]  Peter Dayan,et al.  Inference, Attention, and Decision in a Bayesian Neural Architecture , 2004, NIPS.

[62]  E. Rolls,et al.  A Neurodynamical cortical model of visual attention and invariant object recognition , 2004, Vision Research.

[63]  R. Douglas,et al.  Neuronal circuits of the neocortex. , 2004, Annual review of neuroscience.

[64]  E. B. Baum,et al.  Internal representations for associative memory , 1988, Biological Cybernetics.

[65]  Kunihiko Fukushima,et al.  Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[66]  Otto D. Creutzfeldt,et al.  Generality of the functional structure of the neocortex , 1977, Naturwissenschaften.

[67]  P. Földiák,et al.  Forming sparse representations by local anti-Hebbian learning , 1990, Biological Cybernetics.

[68]  Harri Valpola,et al.  Behaviourally meaningful representations from normalisation and context-guided denoising , 2004 .

[69]  Scott O. Murray,et al.  Perceptual grouping and the interactions between visual cortical areas , 2004, Neural Networks.

[70]  Pietro Perona,et al.  Selective visual attention enables learning and recognition of multiple objects in cluttered scenes , 2005, Comput. Vis. Image Underst..

[71]  G. Striedter Principles of brain evolution. , 2005 .

[72]  Aapo Hyvärinen,et al.  Statistical models of images and early vision , 2005 .

[73]  Harri Valpola,et al.  Denoising Source Separation , 2005, J. Mach. Learn. Res..

[74]  Karl J. Friston,et al.  A theory of cortical responses , 2005, Philosophical Transactions of the Royal Society B: Biological Sciences.

[75]  Michael S. Falconbridge,et al.  A Simple Hebbian/Anti-Hebbian Network Learns the Sparse, Independent Components of Natural Images , 2006, Neural Computation.

[76]  M. Hasselmo The role of acetylcholine in learning and memory , 2006, Current Opinion in Neurobiology.

[77]  J. Buhle,et al.  Typologies of attentional networks , 2006, Nature Reviews Neuroscience.

[78]  John H. R. Maunsell,et al.  Feature-based attention in visual cortex , 2006, Trends in Neurosciences.

[79]  Michael J. Frank,et al.  Making Working Memory Work: A Computational Model of Learning in the Prefrontal Cortex and Basal Ganglia , 2006, Neural Computation.

[80]  Harri Valpola,et al.  Separation of Nonlinear Image Mixtures by Denoising Source Separation , 2006, ICA.

[81]  Paul H. E. Tiesinga Stimulus competition by inhibitory interference , 2006, Neurocomputing.

[82]  M. Shanahan A cognitive architecture that combines internal simulation with a global workspace , 2006, Consciousness and Cognition.

[83]  M. V. Velzen,et al.  Self-organizing maps , 2007 .

[84]  J. Pearce Thalamus , 2007, European Neurology.

[85]  Pierre Baldi,et al.  Bayesian surprise attracts human attention , 2005, Vision Research.