论文信息 - Robotics and Multi-agent Systems Robustness in the Long Run: Auto-teaching vs Anticipation in Evolutionary Robotics

Robotics and Multi-agent Systems Robustness in the Long Run: Auto-teaching vs Anticipation in Evolutionary Robotics

In Evolutionary Robotics, auto-teaching networks, neural networks that modify their own weights during the life-time of the robot, have been shown to be powerful architectures to develop adaptive controllers. Unfortunately, when run for a longer period of time than that used during evolution, the long-term behavior of such networks can become unpredictable. This paper gives an example of such dangerous behavior, and proposes an alternative solution based on anticipation: as in auto-teaching networks, a secondary network is evolved, but its outputs try to predict the next state of the robot sensors. The weights of the action network are adjusted using some back-propagation procedure based on the errors made by the anticipatory network. First results – in simulated environments – show a tremendous increase in robustness of the long-term behavior of the controller.

Michèle Sebag | Marc Schoenauer | Nicolas Godzik

[1] Stefano Nolfi. How Learning and Evolution Interact: The Case of a Learning Task which Differs from the Evolutionary Task , 1999, Adapt. Behav..

[2] Stefano Nolfi,et al. Good teaching inputs do not correspond to desired responses in ecological neural networks , 1994, Neural Processing Letters.

[3] Stefano Nolfi,et al. How Co-Evolution can Enhance the Adaptive Power of Artificial Evolution: Implications for Evolutionary Robotics , 1998, EvoRobot.

[4] Inman Harvey,et al. Is There Another New Factor in Evolution? , 1996, Evolutionary Computation.

[5] Naftali Tishby,et al. The information bottleneck method , 2000, ArXiv.

[6] Stefano Nolfi,et al. Learning to Adapt to Changing Environments in Evolving Neural Networks , 1996, Adapt. Behav..

[7] Jeffrey L. Elman,et al. Learning and Evolution in Neural Networks , 1994, Adapt. Behav..

[8] Stefano Nolfi,et al. Desired answers do not correspond to good teaching inputs in ecological neural networks , 2000 .

[9] Peter Stone,et al. Anticipation as a key for collaboration in a team of agents: a case study in robotic soccer , 1999, Optics East.

[10] A. Noë,et al. A sensorimotor account of vision and visual consciousness. , 2001, The Behavioral and brain sciences.

[11] Ronald C. Arkin,et al. Anticipatory robot navigation by simultaneously localizing and building a cognitive map , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[12] Wolfgang Stolzmann,et al. An Introduction to Anticipatory Classifier Systems , 1999, Learning Classifier Systems.

[13] Jean-Gabriel Ganascia,et al. Learning Strategies in Games by Anticipation , 1997, IJCAI.

[14] Michael I. Jordan,et al. Forward Models: Supervised Learning with a Distal Teacher , 1992, Cogn. Sci..

[15] Yannis Dimopoulos,et al. Use of some sensitivity criteria for choosing networks with good generalization ability , 1995, Neural Processing Letters.