Best Response Model Predictive Control for Agile Interactions Between Autonomous Ground Vehicles

We introduce an algorithm for autonomous control of multiple fast ground vehicles operating in close proximity to each other. The algorithm is based on a combination of the game theoretic notion of iterated best response, and an information theoretic model predictive control algorithm designed for non-linear stochastic systems. We test the algorithm on two one-fifth scale AutoRally platforms traveling at speeds upwards of 8 meters per second, while maintaining a following distance of under two meters from bumper-to-bumper.

[1]  C. Atkeson,et al.  Minimax differential dynamic programming: application to a biped walking robot , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[2]  Claire J. Tomlin,et al.  A probabilistic approach to planning and control in autonomous urban driving , 2013, 52nd IEEE Conference on Decision and Control.

[3]  Yang Zheng,et al.  Distributed Model Predictive Control for Heterogeneous Vehicle Platoons Under Unidirectional Topologies , 2016, IEEE Transactions on Control Systems Technology.

[4]  James M. Rehg,et al.  Aggressive driving with model predictive path integral control , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[5]  Christoph Hermes,et al.  Long-term vehicle motion prediction , 2009, 2009 IEEE Intelligent Vehicles Symposium.

[6]  Anca D. Dragan,et al.  Planning for Autonomous Cars that Leverage Effects on Human Actions , 2016, Robotics: Science and Systems.

[7]  Nolan Wagener,et al.  Information theoretic MPC for model-based reinforcement learning , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[8]  Nick McKeown,et al.  Automated vehicle control developments in the PATH program , 1991 .

[9]  Qingfeng Huang,et al.  An adaptive peer-to-peer collision warning system , 2002, Vehicular Technology Conference. IEEE 55th Vehicular Technology Conference. VTC Spring 2002 (Cat. No.02CH37367).

[10]  Fawzi Nashashibi,et al.  Real time trajectory prediction for collision risk estimation between vehicles , 2009, 2009 IEEE 5th International Conference on Intelligent Computer Communication and Processing.

[11]  Jean C. Walrand,et al.  Harbor attack: A pursuit-evasion game , 2011, 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[12]  Marin Kobilarov,et al.  Cross-entropy motion planning , 2012, Int. J. Robotics Res..

[13]  Panganamala Ramana Kumar,et al.  An MPC-Based Approach to Provable System-Wide Safety and Liveness of Autonomous Ground Traffic , 2014, IEEE Transactions on Automatic Control.

[14]  H. Kappen Path integrals and symmetry breaking for optimal control theory , 2005, physics/0505066.

[15]  Frank Dellaert,et al.  iSAM2: Incremental smoothing and mapping using the Bayes tree , 2012, Int. J. Robotics Res..

[16]  F. Borrelli,et al.  Stochastic Predictive Control of Autonomous Vehicles in Uncertain Environments , 2014 .

[17]  Dizan Vasquez,et al.  A survey on motion prediction and risk assessment for intelligent vehicles , 2014, ROBOMECH Journal.

[18]  Stefan Schaal,et al.  A Generalized Path Integral Control Approach to Reinforcement Learning , 2010, J. Mach. Learn. Res..

[19]  Nicholas F. Maxemchuk,et al.  Highway Capacity Benefits from Using Vehicle-to-Vehicle Communication and Sensors for Collision Avoidance , 2011, 2011 IEEE Vehicular Technology Conference (VTC Fall).

[20]  Christoph Stiller,et al.  Driver intent inference at urban intersections using the intelligent driver model , 2012, 2012 IEEE Intelligent Vehicles Symposium.

[21]  Dusan M. Stipanovic,et al.  Pursuit, evasion and defense in the plane , 2012, 2012 American Control Conference (ACC).

[22]  Keng Peng Tee,et al.  Continuous Role Adaptation for Human–Robot Shared Control , 2015, IEEE Transactions on Robotics.

[23]  Sebastian Thrun,et al.  Toward robotic cars , 2010, CACM.