Using Optimal Foraging Models to Evaluate Learned Robotic Foraging Behavior

A key challenge in designing robot teams is determining how to allocate team members to specific roles according to their abilities and the demands of the environment. In this paper we explore this issue in the context of multi-robot foraging, and we show that optimal foraging theory can be used to evaluate our work in learned multi-robot foraging tasks. We present a means by which members of a multi-robot team may use reinforcement learning to allocate themselves to specific foraging roles appropriate to their environment and their abilities. We test this approach in environments with different distributions of various types of attractors and by varying the relative effectiveness of different foraging strategies. We then examine the effectiveness of the algorithm by comparing the distributions learned by the individual robots to those predicted by several optimal foraging models. We show the resulting learned distributions are substantially similar to those predicted by the optimal foraging theory models.

[1]  G. Theraulaz,et al.  Response threshold reinforcements and division of labour in insect societies , 1998, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[2]  Maja J. Mataric,et al.  Reinforcement Learning in the Multi-Robot Domain , 1997, Auton. Robots.

[3]  Jean-Louis Deneubourg,et al.  The dynamics of collective sorting robot-like ants and ant-like robots , 1991 .

[4]  David Sloan Wilson,et al.  On the Coexistence of Specialists and Generalists , 1994, The American Naturalist.

[5]  Tucker R. Balch,et al.  Communication in reactive multiagent robotic systems , 1995, Auton. Robots.

[6]  Maja J. Mataric,et al.  Adaptive division of labor in large-scale minimalist multi-robot systems , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[7]  E. Bonabeau,et al.  Quantitative study of the fixed threshold model for the regulation of division of labour in insect societies , 1996, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[8]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[9]  J. T. Erichsen,et al.  Optimal prey selection in the great tit (Parus major) , 1977, Animal Behaviour.

[10]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[11]  Ronald C. Arkin,et al.  Motor schema based navigation for a mobile robot: An approach to programming by behavior , 1987, Proceedings. 1987 IEEE International Conference on Robotics and Automation.

[12]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[13]  E. Charnov Optimal Foraging: Attack Strategy of a Mantid , 1976, The American Naturalist.

[14]  L A Real,et al.  Animal choice behavior and the evolution of cognitive architecture , 1991, Science.

[15]  Maja J. Mataric,et al.  Interference as a Tool for Designing and Evaluating Multi-Robot Controllers , 1997, AAAI/IAAI.

[16]  R. Macarthur,et al.  On Optimal Use of a Patchy Environment , 1966, The American Naturalist.

[17]  R. Arkin,et al.  Behavioral diversity in learning robot teams , 1998 .

[18]  Ronald C. Arkin,et al.  Learning to role-switch in multi-robot systems , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[19]  E. Wilson,et al.  Caste and ecology in the social insects. , 1979, Monographs in population biology.