An incremental approach to developing intelligent neural network controllers for robots

By beginning with simple reactive behaviors and gradually building up to more memory-dependent behaviors, it may be possible for connectionist systems to eventually achieve the level of planning. This paper focuses on an intermediate step in this incremental process, where the appropriate means of providing guidance to adapting controllers is explored. A local and a global method of reinforcement learning are contrasted-a special form of back-propagation and an evolutionary algorithm. These methods are applied to a neural network controller for a simple robot. A number of experiments are described where the presence of explicit goals and the immediacy of reinforcement are varied. These experiments reveal how various types of guidance can affect the final control behavior. The results show that the respective advantages and disadvantages of these two adaptation methods are complementary, suggesting that some hybrid of the two may be the most effective method. Concluding remarks discuss the next incremental steps toward more complex control behaviors.

[1]  V. Braitenberg Vehicles, Experiments in Synthetic Psychology , 1984 .

[2]  Rodney A. Brooks,et al.  A Robust Layered Control Syste For A Mobile Robot , 2022 .

[3]  Inman Harvey,et al.  Analysing recurrent dynamical networks evolved for robot control , 1993 .

[4]  John H. Holland,et al.  Escaping brittleness: the possibilities of general-purpose learning algorithms applied to parallel rule-based systems , 1995 .

[5]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[6]  Lisa Meeden Towards planning: incremental investigations into adaptive robot control , 1995 .

[7]  David J. Chalmers,et al.  The Evolution of Learning: An Experiment in Genetic Connectionism , 1991 .

[8]  Inman Harvey,et al.  Evolutionary robotics and SAGA: The case for hill crawling and tournament selection , 1994 .

[9]  L. D. Whitley,et al.  Genetic Reinforcement Learning for Neurocontrol Problems , 2004, Machine Learning.

[10]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[11]  Marco Dorigo,et al.  A comparison of Q-learning and classifier systems , 1994 .

[12]  Uwe Schnepf,et al.  Robot ethology: a proposal for the research into intelligent autonomous systems , 1991 .

[13]  Stewart W. Wilson The animat path to AI , 1991 .

[14]  Jean-Arcady Meyer,et al.  Eight Principles for Building an Intelligent Robot , 1991 .

[15]  Lawrence Davis,et al.  Training Feedforward Neural Networks Using Genetic Algorithms , 1989, IJCAI.

[16]  Tariq Samad,et al.  Towards the Genetic Synthesisof Neural Networks , 1989, ICGA.

[17]  Dean A. Pomerleau,et al.  Neural Network Perception for Mobile Robot Guidance , 1993 .

[18]  D.E. Goldberg,et al.  Classifier Systems and Genetic Algorithms , 1989, Artif. Intell..

[19]  David L. Waltz Eight principles for building an intelligent robot , 1991 .

[20]  Marco Dorigo,et al.  Genetics-based machine learning and behavior-based robotics: a new synthesis , 1993, IEEE Trans. Syst. Man Cybern..

[21]  Andy Clark,et al.  Associative Engines: Connectionism, Concepts, and Representational Change , 1993 .

[22]  John J. Grefenstette,et al.  The Evolution of Strategies for Multiagent Environments , 1992, Adapt. Behav..

[23]  Inman Harvey,et al.  Issues in evolutionary robotics , 1993 .

[24]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[25]  Francesco Mondada,et al.  Automatic creation of an autonomous agent: genetic evolution of a neural-network driven robot , 1994 .

[26]  Randall D. Beer,et al.  Evolving Dynamical Neural Networks for Adaptive Behavior , 1992, Adapt. Behav..

[27]  Shumeet Baluja,et al.  Evolution of an artificial neural network based autonomous land vehicle controller , 1996, IEEE Trans. Syst. Man Cybern. Part B.

[28]  Richard K. Belew,et al.  Evolving networks: using the genetic algorithm with connectionist learning , 1990 .

[29]  Stewart W. Wilson,et al.  Robot Ethology: A Proposal for the Research into Intelligent Autonomous Systems , 1991 .

[30]  David H. Ackley,et al.  Generalization and Scaling in Reinforcement Learning , 1989, NIPS.