Autonomous navigation in dynamic social environments using Multi-Policy Decision Making

In dynamic environments crowded with people, robot motion planning becomes difficult due to the complex and tightly-coupled interactions between agents. Trajectory planning methods, supported by models of typical human behavior and personal space, often produce reasonable behavior. However, they do not account for the future closed-loop interactions of other agents with the trajectory being constructed. As a consequence, the trajectories are unable to anticipate cooperative interactions (such as a human yielding), or adverse interactions (such as the robot blocking the way). In this paper, we propose a new method for navigation amongst pedestrians in which the trajectory of the robot is not explicitly planned, but instead, a planning process selects one of a set of closed-loop behaviors whose utility can be predicted through forward simulation. In particular, we extend Multi-Policy Decision Making (MPDM) [1] to this domain using the closed-loop behaviors Go-Solo, Follow-other, and Stop. By dynamically switching between these policies, we show that we can improve the performance of the robot as measured by utility functions that reward task completion and penalize inconvenience to other agents. Our evaluation includes extensive results in simulation and real-world experiments.

[1]  P. Molnár Social Force Model for Pedestrian Dynamics Typeset Using Revt E X 1 , 1995 .

[2]  Rachid Alami,et al.  A Human Aware Mobile Robot Motion Planner , 2007, IEEE Transactions on Robotics.

[3]  Gonzalo Ferrer,et al.  Multi-objective cost-to-go functions on robot navigation in dynamic environments , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[4]  Brett R. Fajen,et al.  Visual navigation and obstacle avoidance using a steering potential function , 2006, Robotics Auton. Syst..

[5]  Andreas Krause,et al.  Robot navigation in dense human crowds: Statistical models and experimental studies of human–robot cooperation , 2015, Int. J. Robotics Res..

[6]  Gonzalo Ferrer,et al.  Robot companion: A social-force based approach with human awareness-navigation in crowded environments , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Helbing,et al.  Social force model for pedestrian dynamics. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[8]  Thomas Bak,et al.  Trajectory planning for robots in dynamic human environments , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Christian Laugier,et al.  Probabilistic motion planning among moving obstacles following typical motion patterns , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[10]  Panos E. Trahanias,et al.  Probabilistic Autonomous Robot Navigation in Dynamic Environments with Human Motion Prediction , 2010, Int. J. Soc. Robotics.

[11]  Oliver Brock,et al.  High-speed navigation using the global dynamic window approach , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[12]  Wolfram Burgard,et al.  Feature-Based Prediction of Trajectories for Socially Compliant Navigation , 2012, Robotics: Science and Systems.

[13]  Siddhartha S. Srinivasa,et al.  Planning-based prediction for pedestrians , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[14]  Wolfram Burgard,et al.  Socially compliant mobile robot navigation via inverse reinforcement learning , 2016, Int. J. Robotics Res..

[15]  O. Khatib,et al.  Real-Time Obstacle Avoidance for Manipulators and Mobile Robots , 1985, Proceedings. 1985 IEEE International Conference on Robotics and Automation.

[16]  Edwin Olson,et al.  Progress toward multi‐robot reconnaissance and the MAGIC 2010 competition , 2012, J. Field Robotics.

[17]  Edwin Olson,et al.  MPDM: Multipolicy decision-making in dynamic, uncertain environments for autonomous driving , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[18]  Dinesh Manocha,et al.  ClearPath: highly parallel collision avoidance for multi-agent simulation , 2009, SCA '09.

[19]  Gonzalo Ferrer,et al.  Social-aware robot navigation in urban environments , 2013, 2013 European Conference on Mobile Robots.

[20]  Yoram Koren,et al.  Potential field methods and their inherent limitations for mobile robot navigation , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[21]  Christian Laugier,et al.  Leader following: A study on classification and selection , 2016, Robotics Auton. Syst..

[22]  Dinesh Manocha,et al.  Reciprocal n-Body Collision Avoidance , 2011, ISRR.

[23]  Kai Oliver Arras,et al.  Socially-aware robot navigation: A learning approach , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[24]  Wolfram Burgard,et al.  An Approach to Socially Compliant Leader Following for Mobile Robots , 2014, ICSR.

[25]  Edwin Olson,et al.  LCM: Lightweight Communications and Marshalling , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.