论文信息 - The distribution of rewards in sensorimotor maps acquired by cognitive robots through exploration - 字舞流文

The distribution of rewards in sensorimotor maps acquired by cognitive robots through exploration

To exhibit intelligent behavior, cognitive robots must have some knowledge about the consequences of their actions and their value in the context of the goal being realized. We present a neural framework using which explorative sensorimotor experiences of cognitive robots can be efficiently 'internalized' using growing sensorimotor maps and planning realized using goal induced quasi-stationary value fields. Further, if there are no predefined reward functions (or the case when they are not good enough in a slightly modified world), the robot will have to try and realize its goal by exploration after which reward/penalty is given at the end. This paper proposes three simple rules for distribution of the received end reward among the contributing neurons in a high dimensional sensorimotor map. Importantly, reward/penalty distribution over hundreds of neurons in the sensorimotor map is computed one shot. This resulting reward distribution can be visualized as an additional value field, representing the new learnt experience and can be combined with other such fields in a context dependent fashion to plan/compose novel emergent behavior. The simplicity and efficiency of the approach is illustrated through the resulting behaviors of the GNOSYS robot in two different scenarios (a) learning 'when' to optimize 'what constraint' while realizing spatial goals and (b) learning to push a ball intelligently to the corners of a table, while avoiding traps randomly placed by the teacher (this scenario replicates the famous trap tube paradigm from animal reasoning carried out on chimpanzees, capuchins and infants).

Giorgio Metta | Pietro G. Morasso | Stathis Kasderidis | Vishwanathan Mohan | P. Morasso | G. Metta | S. Kasderidis | Vishwanathan Mohan

[1] Marc Toussaint,et al. A Sensorimotor Map: Modulating Lateral Interactions for Anticipation and Planning , 2006, Neural Computation.

[2] A. Damasio. The Feeling of What Happens: Body and Emotion in the Making of Consciousness , 1999 .

[3] M. Zak. Terminal attractors for addressable memory in neural networks , 1988 .

[4] Joshua G. Hale,et al. Using Humanoid Robots to Study Human Behavior , 2000, IEEE Intell. Syst..

[5] Tetsuya Ogata,et al. Experience Based Imitation Using RNNPB , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6] G. Hesslow. Conscious thought as simulation of behaviour and perception , 2002, Trends in Cognitive Sciences.

[7] N. Kanwisher,et al. Mental Imagery of Faces and Places Activates Corresponding Stimulus-Specific Brain Regions , 2000, Journal of Cognitive Neuroscience.

[8] S. Kosslyn,et al. Visual Mental Imagery Activates Topographically Organized Visual Cortex: PET Investigations , 1993, Journal of Cognitive Neuroscience.

[9] P. Morasso,et al. Kinematic networks , 1988, Biological Cybernetics.

[10] A. Damasio. The feeling of what happens , 2001 .

[11] G. Schöner,et al. Dynamic Field Theory of Movement Preparation , 2022 .

[12] Richard S. Sutton,et al. Dimensions of Reinforcement Learning , 1998 .

[13] Giulio Sandini,et al. A biomimetic, force-field based computational model for motion planning and bimanual coordination in humanoid robots , 2009, Auton. Robots.

[14] A. Glenberg,et al. What memory is for: Creating meaning in the service of action , 1997, Behavioral and Brain Sciences.

[15] Michael Wooldridge,et al. The Belief-Desire-Intention Model of Agency , 1998, ATAL.

[16] Hiroshi Imamizu,et al. Human cerebellar activity reflecting an acquired internal model of a new tool , 2000, Nature.

[17] G. Rizzolatti,et al. Neurophysiological mechanisms underlying the understanding and imitation of action , 2001, Nature Reviews Neuroscience.

[18] Gerald M. Edelman,et al. A Universe of Consciousness , 2000 .

[19] Parag A. Pathak,et al. Massachusetts Institute of Technology , 1964, Nature.

[20] J. O'Keefe,et al. The hippocampus as a spatial map. Preliminary evidence from unit activity in the freely-moving rat. , 1971, Brain research.

[21] Marc Toussaint,et al. Learning a World Model and Planning with a Self-Organizing, Dynamic Neural System , 2003, NIPS.

[22] Douglas Hofstadter,et al. The Copycat Project: An Experiment in Nondeterminism and Creative Analogies , 1984 .

[23] Pietro G. Morasso,et al. Neural Network of a Cognitive Crow: An interacting map based architecture , 2007, First International Conference on Self-Adaptive and Self-Organizing Systems (SASO 2007).

[24] E. Vergaro,et al. Consciousness as the Emergent Property of the Interaction Between Brain, Body, and Environment , 2010 .

[25] Bernd Fritzke,et al. A Growing Neural Gas Network Learns Topologies , 1994, NIPS.

[26] Giulio Sandini,et al. Sensorimotor coordination in a "baby" robot: learning about objects through grasping. , 2007, Progress in brain research.

[27] J. Tenenbaum,et al. Special issue on “Probabilistic models of cognition , 2022 .

[28] Tj Sejnowski,et al. Neural Codes and Distributed Representations , 1999 .

[29] Germund Hesslow,et al. The inner world of a simple robot , 2007 .

[30] Ron Sun,et al. The importance of cognitive architectures: an analysis based on CLARION , 2007, J. Exp. Theor. Artif. Intell..

[31] Friedemann Pulvermüller,et al. The Neuroscience of Language: On Brain Circuits of Words and Serial Order , 2003 .

[32] M S Gazzaniga,et al. Cerebrally Lateralized Mental Representations of Hand Shape and Movement , 1998, The Journal of Neuroscience.

[33] Toshio Tsuji,et al. Motion Planning for Manipulators Using Artificial Potential Field Approach that can Adjust Convergence Time of Generated Arm Trajectory , 1995 .

[34] M. Tomasello,et al. Primate causal understanding in the physical and psychological domains , 1998, Behavioural Processes.

[35] V. ohan,et al. Neural Network of a Cognitive Crow: An interacting map based architecture , 2007, First International Conference on Self-Adaptive and Self-Organizing Systems (SASO 2007).

[36] Stephen Grossberg,et al. SOVEREIGN: An autonomous neural system for incrementally learning planned action sequences to navigate towards a rewarded goal , 2008, Neural Networks.

[37] G. Lakoff,et al. The Brain's concepts: the role of the Sensory-motor system in conceptual knowledge , 2005, Cognitive neuropsychology.

[38] 日本ロボット学会. 日本ロボット学会誌 = Journal of the Robotics Society of Japan , 1983 .

[39] H. Maturana,et al. Autopoiesis: the organization of living systems, its characterization and a model. , 1974, Currents in modern biology.

[40] J. Feldman,et al. Embodied meaning in a neural theory of language , 2004, Brain and Language.

[41] Andrew G. Barto,et al. Reinforcement learning , 1998 .

[42] J. Pearl. Graphs, Causality, and Structural Equation Models , 1998 .

[43] Rodney A. Brooks. The Cog Project , 1997 .

[44] M. Toussaint. A Sensorimotor Map: Modulating Lateral Interactions for Anticipation and Planning , 2006 .

[45] A. Clark. Being There: Putting Brain, Body, and World Together Again , 1996 .

[46] Michael I. Jordan,et al. An internal model for sensorimotor integration. , 1995, Science.

[47] Teuvo Kohonen,et al. Self-Organizing Maps , 2010 .

[48] Allen Newell,et al. Computer science as empirical inquiry: symbols and search (1976) , 1989 .

[49] Nikolaos G. Bourbakis,et al. Neural Network Approach for Image chromatic Adaptation for Skin Color Detection , 2007, Int. J. Neural Syst..

[50] Mitsuo Kawato,et al. Internal models for motor control and trajectory planning , 1999, Current Opinion in Neurobiology.

[51] D Le Bihan,et al. Activation of human primary visual cortex during visual recall: a magnetic resonance imaging study. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[52] A. Kacelnik,et al. Shaping of Hooks in New Caledonian Crows , 2002, Science.

[53] Nicola S. Clayton,et al. The Mentality of Crows: Convergent Evolution of Intelligence in Corvids and Apes , 2004, Science.

[54] J. Fuster. Cortex and mind : unifying cognition , 2003 .

[55] Murray Shanahan,et al. Perception as Abduction: Turning Sensor Data Into Meaningful Representation , 2005, Cogn. Sci..

[56] D M Wolpert,et al. Multiple paired forward and inverse models for motor control , 1998, Neural Networks.

[57] R. Zatorre,et al. Effect of unilateral temporal-lobe excision on perception and imagery of songs , 1993, Neuropsychologia.

[58] Hector Geffner,et al. Default reasoning - causal and conditional theories , 1992 .

[59] B Fritzke,et al. A growing neural gas network learns topologies. G. Tesauro, DS Touretzky, and TK Leen, editors , 1995, NIPS 1995.

[60] Alexander A. Petrov,et al. Integration of Memory and Reasoning in Analogy-Making: The AMBR Model , 2000 .

[61] Pietro G. Morasso,et al. Self-Organization, Computational Maps, and Motor Control , 1997 .

[62] Rick Grush,et al. The emulation theory of representation: Motor control, imagery, and perception , 2004, Behavioral and Brain Sciences.

[63] R. Sun. Symbol Grounding: A New Look At An Old Idea , 1999 .

[64] Χαράλαμπος Μπαλτζάκης,et al. A hybrid framework for mobile robot navigation: modelling with switching state space networks , 2004 .

[65] Kim A. Bard,et al. Reaching into Thought: The Minds of the Great Apes , 1999 .

[66] R. Milo,et al. Tools, language and cognition in human evolution , 1995, International Journal of Primatology.

[67] H. T. Blair,et al. Role of the Lateral Mammillary Nucleus in the Rat Head Direction Circuit A Combined Single Unit Recording and Lesion Study , 1998, Neuron.

[68] Rodney A. Brooks,et al. A Robust Layered Control Syste For A Mobile Robot , 2022 .

[69] W. K. Yeap. On Symbol Grounding , 1993 .

[70] Owen Holland,et al. Robots With Internal Models A Route to Machine Consciousness , 2003 .

[71] D. Wolpert,et al. Mental state inference using visual control parameters. , 2005, Brain research. Cognitive brain research.

[72] Thomas Metzinger,et al. Motor ontology: the representational reality of goals, actions and selves , 2003 .

[73] Pietro G. Morasso,et al. A Forward / Inverse Motor Controller for Cognitive Robotics , 2006, ICANN.

[74] E. Visalberghi,et al. Acting and understanding: Tool use revisited through the minds of capuchin monkeys , 1996 .

[75] G. Edelman,et al. A Universe Of Consciousness: How Matter Becomes Imagination , 2000 .

[76] S. Amari. Dynamics of pattern formation in lateral-inhibition type neural fields , 1977, Biological Cybernetics.

[77] Robert W. Kentridge. Review: The Feeling of What Happens: Body, Emotion and the Making of Consciousness , 2000 .

[78] J. Fuster. Cortex and Mind , 2002 .

[79] R. Zatorre,et al. When the brain plays music: auditory–motor interactions in music perception and production , 2007, Nature Reviews Neuroscience.

[80] Allen Newell,et al. Computer science as empirical inquiry: symbols and search , 1976, CACM.

[81] Pietro G. Morasso,et al. Towards Reasoning and Coordinating Action in the Mental Space , 2007, Int. J. Neural Syst..

[82] Douglas R. Hofstadter,et al. Fluid Concepts and Creative Analogies , 1995 .