论文信息 - Grouping and Anti-predator Behaviors for Multi-agent Systems Based on Reinforcement Learning Scheme

Grouping and Anti-predator Behaviors for Multi-agent Systems Based on Reinforcement Learning Scheme

Several models have been proposed for describing grouping behavior such as bird flocking, terrestrial animal herding, and fish schooling. In these models, a fixed rule has been imposed on each individual a priori for its interactions in a reductive and rigid manner. We have proposed a new framework for self-organized grouping of agents by reinforcement learning. It is important to introduce a learning scheme for developing collective behavior in artificial autonomous distributed systems. This scheme can be expanded to cases in which predators are present. In this study we integrate grouping and anti-predator behaviors into our proposed scheme. The behavior of agents is demonstrated and evaluated in detail through computer simulations, and their grouping and anti-predator behaviors developed as a result of learning are shown to be diverse and robust by changing some parameters of the scheme.

Nobuyuki Matsui | Haruhiko Nishimura | Teijiro Isokawa | Koichiro Morihiro

[1] D. Noakes,et al. Predators and Prey in Fishes , 1983 .

[2] Nobuyuki Matsui,et al. Reinforcement Learning Scheme for Grouping and Anti-predator Behavior , 2007, KES.

[3] Nobuyuki Matsui,et al. Emergence of Flocking Behavior Based on Reinforcement Learning , 2006, KES.

[4] Hayakawa,et al. Collective motion in a system of motile elements. , 1996, Physical review letters.

[5] A. Huth,et al. The simulation of the movement of fish schools , 1992 .

[6] B L Partridge,et al. The structure and function of fish schools. , 1982, Scientific American.

[7] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[8] Andreas Huth,et al. THE SIMULATION OF FISH SCHOOLS IN COMPARISON WITH EXPERIMENTAL DATA , 1994 .

[9] Yoshinobu Inada,et al. Order and flexibility in the motion of fish schools. , 2002, Journal of theoretical biology.

[10] Donald A. Sofge,et al. Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches , 1992 .

[11] Nobuyuki Matsui,et al. Learning Grouping and Anti-predator Behaviors for Multi-agent Systems , 2008, KES.

[12] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[13] T. Pitcher,et al. Predator-avoidance behaviours of sand-eel schools: why schools seldom split , 1983 .

[14] Craig W. Reynolds. Flocks, herds, and schools: a distributed behavioral model , 1987, SIGGRAPH.

[15] Naotake Kamiura,et al. Reinforcement Learning Scheme for Flocking Behavior Emergence , 2007, J. Adv. Comput. Intell. Intell. Informatics.

[16] Lakhmi C. Jain,et al. Knowledge-Based Intelligent Information and Engineering Systems , 2004, Lecture Notes in Computer Science.

[17] I. Aoki. A simulation study on the schooling mechanism in fish. , 1982 .

[18] Sebastian Thrun,et al. The role of exploration in learning control , 1992 .

[19] Peter Dayan,et al. Q-learning , 1992, Machine Learning.

[20] Atsuko Mutoh,et al. A Simulation Study on the Form of Fish Schooling for Escape from Predator , 2003 .

[21] Hiro-Sato Niwa. Self-organizing Dynamic Model of Fish Schooling , 1994 .

[22] Rune Vabø,et al. An individual based model of fish school reactions: predicting antipredator behaviour as observed in nature , 1997 .