Who goes there?: selecting a robot to reach a goal using social regret

A common decision problem in multi-robot applications involves deciding on which robot, out of a group of N robots, should travel to a goal location, to carry out a task there. Trivially, this decision problem can be solved greedily, by selecting the robot with the shortest expected travel time. However, this ignores the inherent uncertainty in path traversal times; we may prefer a robot that is slower (but always takes the same time), over a robot that is expected to reach the goal faster, but on occasion takes a very long time to arrive. We make several contributions that address this challenge. First, we bring to bear economic decision-making theory, to distinguish between different selection policies, based on risk (risk averse, risk seeking, etc.). Second, we introduce social regret (the difference between the actual travel time by the selected robot, and the hypothetical time of other robots) to augment decision-making in practice. Then, we carry out experiments in simulation and with real robots, to demonstrate the usefulness of the selection procedures under real-world settings, and find that travel-time distributions have repeating characteristics.

[1]  Nidhi Kalra,et al.  Market-Based Multirobot Coordination: A Survey and Analysis , 2006, Proceedings of the IEEE.

[2]  Wolfram Burgard,et al.  Probabilistic Algorithms and the Interactive Museum Tour-Guide Robot Minerva , 2000, Int. J. Robotics Res..

[3]  David E. Bell,et al.  Regret in Decision Making under Uncertainty , 1982, Oper. Res..

[4]  Wolfram Burgard,et al.  Coordination for Multi-Robot Exploration and Mapping , 2000, AAAI/IAAI.

[5]  James E. Bobrow,et al.  Optimal robot plant planning using the minimum-time criterion , 1988, IEEE J. Robotics Autom..

[6]  J. Andrew Bagnell,et al.  Improving robot navigation through self‐supervised online learning , 2006, J. Field Robotics.

[7]  R. Sugden,et al.  Regret Theory: An alternative theory of rational choice under uncertainty Review of Economic Studies , 1982 .

[8]  Evangelos Markakis,et al.  Agent Coordination with Regret Clearing , 2008, AAAI.

[9]  A.I. Chaudhry Path Generation Using Matrix Representations of Previous Robot State Data , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[10]  Olivier Michel,et al.  Cyberbotics Ltd. Webots™: Professional Mobile Robot Simulation , 2004, ArXiv.

[11]  Wolfram Burgard,et al.  Improving Grid-based SLAM with Rao-Blackwellized Particle Filters by Adaptive Proposals and Selective Resampling , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[12]  Karen Zita Haigh,et al.  Planning, Execution and Learning in a Robotic Agent , 1998, AIPS.

[13]  Dean P. Foster,et al.  Regret in the On-Line Decision Problem , 1999 .

[14]  Olivier Michel,et al.  Cyberbotics Ltd. Webots™: Professional Mobile Robot Simulation , 2004 .

[15]  Jan Willemson,et al.  Robots Find a Better Way : A Learning Method for Mobile Robot Navigation in Partially Unknown Environments , 2004 .