A Real-Time Game Theoretic Planner for Autonomous Two-Player Drone Racing

To be successful in multi-player drone racing, a player must not only follow the race track in an optimal way, but also compete with other drones through strategic blocking, faking, and opportunistic passing while avoiding collisions. Since unveiling one's own strategy to the adversaries is not desirable, this requires each player to independently predict the other players' future actions. Nash equilibria are a powerful tool to model this and similar multi-agent coordination problems in which the absence of communication impedes full coordination between the agents. In this paper, we propose a novel receding horizon planning algorithm that, exploiting sensitivity analysis within an iterated best response computational scheme, can approximate Nash equilibria in real time. We also describe a vision-based pipeline that allows each player to estimate its opponent's relative position. We demonstrate that our solution effectively competes against alternative strategies in a large number of drone racing simulations. Hardware experiments with onboard vision sensing prove the practicality of our strategy.

[1]  A. A. Maciejewski,et al.  Obstacle Avoidance , 2005 .

[2]  O. Khatib,et al.  Real-Time Obstacle Avoidance for Manipulators and Mobile Robots , 1985, Proceedings. 1985 IEEE International Conference on Robotics and Automation.

[3]  T. Basar,et al.  H∞-0ptimal Control and Related Minimax Design Problems: A Dynamic Game Approach , 1996, IEEE Trans. Autom. Control..

[4]  Paolo Fiorini,et al.  Motion Planning in Dynamic Environments Using Velocity Obstacles , 1998, Int. J. Robotics Res..

[5]  T. Başar,et al.  Dynamic Noncooperative Game Theory, 2nd Edition , 1998 .

[6]  Takashi Chikayama,et al.  Simulating the Collision Avoidance Behavior of Pedestrians , 2000 .

[7]  T. Raivio,et al.  On the Numerical Solution of a Class of Pursuit-Evasion Games , 2000 .

[8]  E. Feron,et al.  Real-time motion planning for agile autonomous vehicles , 2000, Proceedings of the 2001 American Control Conference. (Cat. No.01CH37148).

[9]  S. Shankar Sastry,et al.  An Invitation to 3-D Vision , 2004 .

[10]  Alexandre M. Bayen,et al.  A time-dependent Hamilton-Jacobi formulation of reachable sets for continuous dynamic games , 2005, IEEE Transactions on Automatic Control.

[11]  Xiaotie Deng,et al.  Settling the Complexity of Two-Player Nash Equilibrium , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[12]  Paul W. Goldberg,et al.  The complexity of computing a Nash equilibrium , 2006, STOC '06.

[13]  Reza Olfati-Saber,et al.  Consensus and Cooperation in Networked Multi-Agent Systems , 2007, Proceedings of the IEEE.

[14]  Chengtao Cai,et al.  Collision Avoidance in Multi-Robot Systems , 2007, 2007 International Conference on Mechatronics and Automation.

[15]  Chengtao Cai,et al.  A fuzzy-based collision avoidance approach for multi-robot systems , 2007, 2007 IEEE International Conference on Robotics and Biomimetics (ROBIO).

[16]  Tobias Pröll,et al.  Autonomous sailboat navigation for short course racing , 2008, Robotics Auton. Syst..

[17]  Claire J. Tomlin,et al.  Decentralized cooperative collision avoidance for acceleration constrained vehicles , 2008, 2008 47th IEEE Conference on Decision and Control.

[18]  J. Hendrickx,et al.  Rigid graph control architectures for autonomous formations , 2008, IEEE Control Systems.

[19]  François Chaumette,et al.  Improvements on Visual Servoing From Spherical Targets Using a Spherical Projection Model , 2009, IEEE Transactions on Robotics.

[20]  Dinesh Manocha,et al.  Reciprocal n-Body Collision Avoidance , 2011, ISRR.

[21]  Francisco Facchinei,et al.  Generalized Nash Equilibrium Problems , 2010, Ann. Oper. Res..

[22]  Emilio Frazzoli,et al.  Sampling-based algorithms for optimal motion planning , 2011, Int. J. Robotics Res..

[23]  Claire J. Tomlin,et al.  Applications of hybrid reachability analysis to robotic aerial vehicles , 2011, Int. J. Robotics Res..

[24]  Jean C. Walrand,et al.  Harbor attack: A pursuit-evasion game , 2011, 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[25]  Anthony V. Fiacco,et al.  Introduction to Sensitivity and Stability Analysis in Nonlinear Programming , 2012 .

[26]  Roland Siegwart,et al.  A robust and modular multi-sensor fusion approach applied to MAV navigation , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[27]  François Chaumette,et al.  Active Structure From Motion: Application to Point, Sphere, and Cylinder , 2014, IEEE Transactions on Robotics.

[28]  Mac Schwager,et al.  Vector field following for quadrotors using differential flatness , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[29]  Flavio Fontana,et al.  Automatic re-initialization and failure recovery for aggressive flight with a monocular vision-based quadrotor , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[30]  J. Christian Gerdes,et al.  A Sequential Two-Step Algorithm for Fast Generation of Vehicle Racing Trajectories , 2015 .

[31]  Anca D. Dragan,et al.  Information gathering actions over human internal state , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[32]  James M. Rehg,et al.  Aggressive driving with model predictive path integral control , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[33]  Flavio Fontana,et al.  Autonomous, Vision‐based Flight and Live Dense 3D Mapping with a Quadrotor Micro Aerial Vehicle , 2016, J. Field Robotics.

[34]  Anca D. Dragan,et al.  Planning for Autonomous Cars that Leverage Effects on Human Actions , 2016, Robotics: Science and Systems.

[35]  Mac Schwager,et al.  Kinematic multi-robot manipulation with no communication using force feedback , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[36]  Manfred Morari,et al.  Optimization‐based autonomous racing of 1:43 scale RC cars , 2015, ArXiv.

[37]  Saptarshi Bandyopadhyay,et al.  Fast, On-line Collision Avoidance for Dynamic Vehicles Using Buffered Voronoi Cells , 2017, IEEE Robotics and Automation Letters.

[38]  Yu Sun,et al.  The IROS 2016 Competitions [Competitions] , 2017, IEEE Robotics Autom. Mag..

[39]  Nolan Wagener,et al.  Information theoretic MPC for model-based reinforcement learning , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[40]  A. Dreves,et al.  A generalized Nash equilibrium approach for optimal control problems of autonomous cars , 2018 .

[41]  David Hyunchul Shim,et al.  A direct visual servoing‐based framework for the 2016 IROS Autonomous Drone Racing Challenge , 2018, J. Field Robotics.

[42]  Mo Chen,et al.  Reach-Avoid Problems via Sum-or-Squares Optimization and Dynamic Programming , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[43]  James M. Rehg,et al.  Best Response Model Predictive Control for Agile Interactions Between Autonomous Ground Vehicles , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[44]  Vladlen Koltun,et al.  Deep Drone Racing: Learning Agile Flight in Dynamic Environments , 2018, CoRL.

[45]  Mo Chen,et al.  Decomposition of Reachable Sets and Tubes for a Class of Nonlinear Systems , 2016, IEEE Transactions on Automatic Control.

[46]  Lentin Joseph Robot Operating System (ROS) for Absolute Beginners , 2018, Apress.

[47]  Anis Koubâa,et al.  Robot Operating System (ROS): The Complete Reference (Volume 1) , 2016 .

[48]  Vladlen Koltun,et al.  Beauty and the Beast: Optimal Methods Meet Learning for Drone Racing , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[49]  Mac Schwager,et al.  A Real-Time Game Theoretic Planner for Autonomous Two-Player Drone Racing , 2020, IEEE Transactions on Robotics.

[50]  John Lygeros,et al.  A Noncooperative Game Approach to Autonomous Racing , 2017, IEEE Transactions on Control Systems Technology.