Reinforcement Learning Scheme for Grouping and Anti-predator Behavior

Collective behavior such as bird flocking, land animal herding, and fish schooling is well known in nature. Many observations have shown that there are no leaders to control the behavior of a group. Several models have been proposed for describing the grouping behavior, which we regard as a distinctive example of aggregate motions. In these models, a fixed rule is provided for each of the individuals a priori for their interactions in a reductive and rigid manner. In contrast, we propose a new framework for the self-organized grouping of agents by reinforcement learning. It is important to introduce a learning scheme for causing collective behavior in artificial autonomous distributed systems. The behavior of agents is demonstrated and evaluated through computer simulations and it is shown that their grouping and anti-predator behavior emerges as a result of learning.

[1]  Hiro-Sato Niwa Self-organizing Dynamic Model of Fish Schooling , 1994 .

[2]  Atsuko Mutoh,et al.  A Simulation Study on the Form of Fish Schooling for Escape from Predator , 2003 .

[3]  T. Pitcher,et al.  Predator-avoidance behaviours of sand-eel schools: why schools seldom split , 1983 .

[4]  I. Aoki A simulation study on the schooling mechanism in fish. , 1982 .

[5]  Craig W. Reynolds Flocks, herds, and schools: a distributed behavioral model , 1987, SIGGRAPH.

[6]  A. Huth,et al.  The simulation of the movement of fish schools , 1992 .

[7]  Richard S. Sutton,et al.  Reinforcement Learning , 1992, Handbook of Machine Learning.

[8]  Naotake Kamiura,et al.  Reinforcement Learning Scheme for Flocking Behavior Emergence , 2007, J. Adv. Comput. Intell. Intell. Informatics.

[9]  Lakhmi C. Jain,et al.  Knowledge-Based Intelligent Information and Engineering Systems , 2004, Lecture Notes in Computer Science.

[10]  Nobuyuki Matsui,et al.  Emergence of Flocking Behavior Based on Reinforcement Learning , 2006, KES.

[11]  Rune Vabø,et al.  An individual based model of fish school reactions: predicting antipredator behaviour as observed in nature , 1997 .

[12]  Hayakawa,et al.  Collective motion in a system of motile elements. , 1996, Physical review letters.

[13]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[14]  D. Noakes,et al.  Predators and Prey in Fishes , 1983 .

[15]  B L Partridge,et al.  The structure and function of fish schools. , 1982, Scientific American.

[16]  Yoshinobu Inada,et al.  Order and flexibility in the motion of fish schools. , 2002, Journal of theoretical biology.

[17]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[18]  Peter Dayan,et al.  Technical Note: Q-Learning , 2004, Machine Learning.

[19]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.