A Hierarchical Autonomous Robot Controller for Learning and Memory: Adaptation in a Dynamic Environment

This work concerns practical issues surrounding the application of learning and memory in a real mobile robot with the goal of optimal navigation in dynamic environments. A novel hierarchical adaptive controller that contains two-level units was developed and trained in a physical mobile robot “e-Puck.” In the low-level unit, the robot holds a number of biologically inspired Aplysia -like spiking neural networks that have the property of spike time-dependent plasticity. Each of these networks is trained to become an expert in a particular local environment(s). All the trained networks are stored in a tree-type memory structure that is located in the high-level unit. These stored networks are used as experiences for the robot to enhance its navigation ability in both new and previously trained environments. The robot's memory is designed to hold memories of various lengths and has a simple searching mechanism. Forgetting and dynamic clustering techniques are used to control the memory size. Experimental results show that the proposed model can produce a robot with learning and memorizing capabilities that enable it to survive in complex and highly dynamic environments.

[1]  Michael J. Carey,et al.  A Study of Index Structures for a Main Memory Database Management System , 1986, HPTS.

[2]  Stefano Nolfi,et al.  Learning to perceive the world as articulated: an approach for hierarchical learning in sensory-motor systems , 1998, Neural Networks.

[3]  Zsolt Kira,et al.  Forgetting Bad Behavior: Memory Management for Case-Based Navigation , 2004 .

[4]  Jacob Nielsen,et al.  Spiking neural building block robot with Hebbian learning , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[5]  Andrew W. Moore,et al.  Efficient memory-based learning for robot control , 1990 .

[6]  Kazuyuki Murase,et al.  A Simple Aplysia-Like Spiking Neural Network to Generate Adaptive Behavior in Autonomous Robots , 2008, Adapt. Behav..

[7]  Kazuyuki Murase,et al.  An Autonomous Mobile Robot Controlled by a Spike Neuron Network with one Hidden-Layer Neuron having Spike Timing-Dependent Plasticity , 2006 .

[8]  Stephanie Forrest,et al.  A Machine Learning Evaluation of an Artificial Immune System , 2005, Evolutionary Computation.

[9]  Hugh F. Durrant-Whyte,et al.  A solution to the simultaneous localization and map building (SLAM) problem , 2001, IEEE Trans. Robotics Autom..

[10]  Dario Floreano,et al.  From Wheels to Wings with Evolutionary Spiking Circuits , 2003, Artificial Life.

[11]  Andrew McCallum,et al.  Reinforcement learning with selective perception and hidden state , 1996 .

[12]  Jun Tani,et al.  How Hierarchical Control Self-organizes in Artificial Adaptive Systems , 2005, Adapt. Behav..

[13]  Ratnesh Kumar,et al.  A behavior-based intelligent control architecture , 1998, Proceedings of the 1998 IEEE International Symposium on Intelligent Control (ISIC) held jointly with IEEE International Symposium on Computational Intelligence in Robotics and Automation (CIRA) Intell.

[14]  James L. McClelland Running head : HEBBIAN LEARNING How Far Can You Go with Hebbian Learning , and When Does it Lead you Astray ? , 2005 .

[15]  Kazuyuki Murase,et al.  Self-organization of Spiking Neural Network that Generates Autonomous Behavior in a Real Mobile Robot , 2006, Int. J. Neural Syst..

[16]  F. Craik,et al.  Levels of Pro-cessing: A Framework for Memory Research , 1975 .

[17]  Genhong Cheng,et al.  The Art of War: Innate and adaptive immune responses , 2003, Cellular and Molecular Life Sciences CMLS.

[18]  Long Ji Lin,et al.  Self-improving reactive agents based on reinforcement learning, planning and teaching , 1992, Machine Learning.

[19]  L. Abbott,et al.  Competitive Hebbian learning through spike-timing-dependent synaptic plasticity , 2000, Nature Neuroscience.

[20]  Michael V. Johnston,et al.  Fundamental neuroscience, 2nd edition: Edited by Duane E. Haines, PhD. 582 pp., illustrated. New York: Churchill Livingstone, 2002. $52.00. ISBN 0-443-066035. , 2002 .

[21]  Ming Tan,et al.  Cost-Sensitive Reinforcement Learning for Adaptive Classification and Control , 1991, AAAI.

[22]  Paul F. M. J. Verschure,et al.  Environmentally mediated synergy between perception and behaviour in mobile robots , 2003, Nature.

[23]  Andreas Zimmermann,et al.  Context-Awareness in User Modelling: Requirements Analysis for a Case-Based Reasoning Application , 2003, ICCBR.

[24]  Jürgen Branke,et al.  Evolutionary Optimization in Dynamic Environments , 2001, Genetic Algorithms and Evolutionary Computation.

[25]  Kazuyuki Murase,et al.  Adaptation of Real Autonomous Mobile Robot in Complex Environment using Pattern Association Network Controller (PAN-C) , 2008 .

[26]  K. Jellinger Processes of Change in Brain and Cognitive Development Attention and Performance XXI , 2007 .

[27]  Anthony Kulis,et al.  Bio-Inspired Artificial Intelligence: Theories, Methods, and Technologies , 2009, Scalable Comput. Pract. Exp..

[28]  David C. Wilson,et al.  When Experience Is Wrong: Examining CBR for Changing Tasks and Environments , 1999, ICCBR.

[29]  Paul F. M. J. Verschure,et al.  A real-world rational agent: unifying old and new AI , 2003, Cogn. Sci..

[30]  James S. Albus,et al.  4-D/RCS reference model architecture for unmanned ground vehicles , 1999, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[31]  Helga Noice,et al.  What Studies of Actors and Acting Can Tell Us About Memory and Cognitive Functioning , 2006 .

[32]  Tom Ziemke,et al.  Imagination and Abstraction of Sensorimotor Flow : Towards a Robot Model , 2005 .

[33]  Kazuyuki Murase,et al.  Sensor-fusion in spiking neural network that generates autonomous behavior in real mobile robot , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[34]  Ronald C. Arkin,et al.  Learning behavioral parameterization using spatio-temporal case-based reasoning , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[35]  Longxin Lin Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching , 2004, Machine Learning.

[36]  Zbigniew Michalewicz,et al.  Adaptive evolutionary planner/navigator for mobile robots , 1997, IEEE Trans. Evol. Comput..

[37]  K. Ganesan,et al.  Case-based path planning for autonomous underwater vehicles , 1994, Auton. Robots.

[38]  Christine M. Anderson-Cook,et al.  A Genetic Algorithm with Memory for Mixed Discrete-Continuous Design Optimization , 2002 .

[39]  Martin K. Purvis,et al.  Self-adaptation and Dynamic Environment Experiments with Evolvable Virtual Machines , 2005, Engineering Self-Organising Systems.