论文信息 - Behavioral Learning in a Cognitive Neuromorphic Robot: An Integrative Approach

Behavioral Learning in a Cognitive Neuromorphic Robot: An Integrative Approach

We present here a learning system using the iCub humanoid robot and the SpiNNaker neuromorphic chip to solve the real-world task of object-specific attention. Integrating spiking neural networks with robots introduces considerable complexity for questionable benefit if the objective is simply task performance. But, we suggest, in a cognitive robotics context, where the goal is understanding how to compute, such an approach may yield useful insights to neural architecture as well as learned behavior, especially if dedicated neural hardware is available. Recent advances in cognitive robotics and neuromorphic processing now make such systems possible. Using a scalable, structured, modular approach, we build a spiking neural network where the effects and impact of learning can be predicted and tested, and the network can be scaled or extended to new tasks automatically. We introduce several enhancements to a basic network and show how they can be used to direct performance toward behaviorally relevant goals. Results show that using a simple classical spike-timing-dependent plasticity (STDP) rule on selected connections, we can get the robot (and network) to progress from poor task-specific performance to good performance. Behaviorally relevant STDP appears to contribute strongly to positive learning: “do this” but less to negative learning: “don’t do that.” In addition, we observe that the effect of structural enhancements tends to be cumulative. The overall system suggests that it is by being able to exploit combinations of effects, rather than any one effect or property in isolation, that spiking networks can achieve compelling, task-relevant behavior.

[1] Steve B. Furber,et al. Power analysis of large-scale, real-time neural networks on SpiNNaker , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[2] Angelo Cangelosi,et al. Visual and linguistic cues to graspable objects , 2013, Experimental Brain Research.

[3] Alan C. Evans,et al. Small-world anatomical networks in the human brain revealed by cortical thickness from MRI. , 2007, Cerebral cortex.

[4] Andrew S. Cassidy,et al. A million spiking-neuron integrated circuit with a scalable communication network and interface , 2014, Science.

[5] John Wawrzynek,et al. Silicon Auditory Processors as Computer Peripherals , 1992, NIPS.

[6] Jürgen Schmidhuber,et al. Multi-column deep neural network for traffic sign classification , 2012, Neural Networks.

[7] R. Douglas,et al. A Quantitative Map of the Circuit of Cat Primary Visual Cortex , 2004, The Journal of Neuroscience.

[8] Fakhri Karray,et al. Visual Attention for Robotic Cognition: A Survey , 2011, IEEE Transactions on Autonomous Mental Development.

[9] Daniel D. Lee,et al. Equilibrium properties of temporally asymmetric Hebbian plasticity. , 2000, Physical review letters.

[10] Monica N. Nicolescu,et al. An Unsupervised Approach to Learning and Early Detection of Spatio-Temporal Patterns Using Spiking Neural Networks , 2015, J. Intell. Robotic Syst..

[11] Thomas Serre,et al. Models of visual cortex , 2013, Scholarpedia.

[12] Minoru Asada,et al. Cognitive developmental robotics as a new paradigm for the design of humanoid robots , 2001, Robotics Auton. Syst..

[13] Alexandra Kirsch. Robot learning language - Integrating programming and learning for cognitive systems , 2009, Robotics Auton. Syst..

[14] Angelo Cangelosi,et al. Aquila 2.0 software architecture for cognitive robotics , 2013, 2013 IEEE Third Joint International Conference on Development and Learning and Epigenetic Robotics (ICDL).

[15] Eugene M. Izhikevich,et al. Polychronization: Computation with Spikes , 2006, Neural Computation.

[16] Pierre Yger,et al. PyNN: A Common Interface for Neuronal Network Simulators , 2008, Front. Neuroinform..

[17] Olaf Sporns,et al. Plasticity in Value Systems and its Role in Adaptive Behavior , 2000, Adapt. Behav..

[18] Giacomo Indiveri,et al. A current-mode conductance-based silicon neuron for address-event neuromorphic systems , 2009, 2009 IEEE International Symposium on Circuits and Systems.

[19] Matthew Cook,et al. Unsupervised learning of digit recognition using spike-timing-dependent plasticity , 2015, Front. Comput. Neurosci..

[20] Tobias C. Potjans,et al. The Cell-Type Specific Cortical Microcircuit: Relating Structure and Activity in a Full-Scale Spiking Network Model , 2012, Cerebral cortex.

[21] Steve B. Furber,et al. Concurrent heterogeneous neural model simulation on real-time neuromimetic hardware , 2011, Neural Networks.

[22] Henry Markram,et al. Real-Time Computing Without Stable States: A New Framework for Neural Computation Based on Perturbations , 2002, Neural Computation.

[23] Chris Eliasmith,et al. A Unified Approach to Building and Controlling Spiking Attractor Networks , 2005, Neural Computation.

[24] Markus Diesmann,et al. Spike-Timing-Dependent Plasticity in Balanced Random Networks , 2007, Neural Computation.

[25] Jörg Conradt,et al. Trainable sensorimotor mapping in a neuromorphic robot , 2015, Robotics Auton. Syst..

[26] Jing Yang,et al. A supervised multi-spike learning algorithm based on gradient descent for spiking neural networks , 2013, Neural Networks.

[27] Steve B. Furber,et al. Towards Real-World Neurorobotics: Integrated Neuromorphic Visual Attention , 2014, ICONIP.

[28] Jim D. Garside,et al. Overview of the SpiNNaker System Architecture , 2013, IEEE Transactions on Computers.

[29] Johannes Schemmel,et al. A wafer-scale neuromorphic hardware system for large-scale neural modeling , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[30] Carver A. Mead,et al. Implementing neural architectures using analog VLSI circuits , 1989 .

[31] Yasuo Kuniyoshi,et al. Neural learning of embodied interaction dynamics , 1998, Neural Networks.

[32] Jim D. Garside,et al. SpiNNaker: A 1-W 18-Core System-on-Chip for Massively-Parallel Neural Network Simulation , 2013, IEEE Journal of Solid-State Circuits.

[33] N. Brunel,et al. Calcium-based plasticity model explains sensitivity of synaptic changes to spike pattern, rate, and dendritic location , 2012, Proceedings of the National Academy of Sciences.

[34] H. Abarbanel,et al. Dynamical model of long-term synaptic plasticity , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[35] Gregor Schöner,et al. An embodied account of serial order: How instabilities drive sequence generation , 2010, Neural Networks.

[36] Matthew Fellows,et al. On the variability of manual spike sorting , 2004, IEEE Transactions on Biomedical Engineering.

[37] Bernabé Linares-Barranco,et al. A Real-Time, Event-Driven Neuromorphic System for Goal-Directed Attentional Selection , 2012, ICONIP.

[38] Chris Eliasmith,et al. How to Build a Brain: A Neural Architecture for Biological Cognition , 2013 .

[39] Johannes Schemmel,et al. A location-independent direct link neuromorphic interface , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[40] Michael S. Lewicki,et al. Efficient Coding of Time-Relative Structure Using Spikes , 2005, Neural Computation.

[41] Giulio Sandini,et al. The iCub humanoid robot: An open-systems platform for research in cognitive development , 2010, Neural Networks.

[42] Henry Kennedy,et al. Pathways of Attention: Synaptic Relationships of Frontal Eye Field to V4, Lateral Intraparietal Cortex, and Area 46 in Macaque Monkey , 2011, The Journal of Neuroscience.

[43] Andrzej J. Kasinski,et al. Supervised Learning in Spiking Neural Networks with ReSuMe: Sequence Learning, Classification, and Spike Shifting , 2010, Neural Computation.

[44] Lindsay Aitkin,et al. The auditory cortex : structural and functional bases of auditory perception , 1990 .

[45] Andreas G. Andreou,et al. Current-mode subthreshold MOS circuits for analog VLSI neural systems , 1991, IEEE Trans. Neural Networks.

[46] Valérie Ventura,et al. To sort or not to sort: the impact of spike-sorting on neural decoding performance , 2014, Journal of neural engineering.

[47] Angelo Cangelosi,et al. The grounding of higher order concepts in action and language: A cognitive robotics model , 2012, Neural Networks.

[48] John A. Marchant,et al. Comparison of a Bayesian classifier with a multilayer feed-forward neural network using the example of plant/weed/soil discrimination , 2003 .

[49] Shih-Chii Liu,et al. A winner-take-all spiking network with spiking inputs , 2004, Proceedings of the 2004 11th IEEE International Conference on Electronics, Circuits and Systems, 2004. ICECS 2004..

[50] Nikola K. Kasabov,et al. Spiking neural network methodology for modelling, classification and understanding of EEG spatio-temporal data measuring cognitive processes , 2015, Inf. Sci..

[51] Chiara Bartolozzi,et al. Embedded neuromorphic vision for humanoid robots , 2011, CVPR 2011 WORKSHOPS.

[52] G. Bi,et al. Synaptic Modifications in Cultured Hippocampal Neurons: Dependence on Spike Timing, Synaptic Strength, and Postsynaptic Cell Type , 1998, The Journal of Neuroscience.

[53] Lynn Andrea Stein. Postmodular Systems: Architectural Principles for Cognitive Robotics , 1997, Cybern. Syst..

[54] Sergey Levine,et al. Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection , 2016, Int. J. Robotics Res..

[55] Angelo Cangelosi,et al. Transport-Independent Protocols for Universal AER Communications , 2015, ICONIP.

[56] Tetsuya Yagi,et al. Binocular robot vision emulating disparity computation in the primary visual cortex , 2008, Neural Networks.

[57] Bernabé Linares-Barranco,et al. On Real-Time AER 2-D Convolutions Hardware for Neuromorphic Spike-Based Cortical Processing , 2008, IEEE Transactions on Neural Networks.

[58] Kevin N. Gurney,et al. A biologically plausible embodied model of action discovery , 2012, Front. Neurorobot..

[59] Kazuyuki Murase,et al. A Hierarchical Autonomous Robot Controller for Learning and Memory: Adaptation in a Dynamic Environment , 2009, Adapt. Behav..

[60] Alois Knoll,et al. Neuromorphic implementations of neurobiological learning algorithms for spiking neural networks , 2015, Neural Networks.

[61] Piotr Dudek,et al. Using Reinforcement Learning to Guide the Development of Self-organised Feature Maps for Visual Orienting , 2010, ICANN.

[62] Gregor Schöner,et al. Dynamics of behavior: Theory and applications for autonomous robot architectures , 1995, Robotics Auton. Syst..