Receding Horizon Markov Game Autonomous Driving Strategy

This paper presents a novel human-like autonomous driving algorithms for lane-changing problem. To this end, we present a multi-agent decision-making scheme by blending game theory with the Markov decision process, forming a Markov game (MG). In this decision-making process, interaction of a subject vehicle (SV) and traffic vehicles (TVs) are captured in a mathematically tractable manner via both a cooperative game (max - max) where vehicles perform their decisions for collective objectives and a noncooperative game (max - min) where vehicles perform their decisions for individual objectives. Strategies of the players are computed via a Receding Horizon (RH) approach where optimal solutions are found through an optimization strategy by taking into account current and future constraints. The proposed approach is evaluated in a human-in-the-Ioop (HIL) environment built around a MATLAB/SimulinkldSPACE realtime simulator where the Markov game-guided SV controller interacts with programmed TV s and one human-driven vehicle (HV). Experimental results show that the Markov game driving strategy is capable of finding a safe gap in multi-move traffic that is consistent with human drivers' behaviours in mandatory lane-changing.

[1]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[2]  Markus Maurer,et al.  Probabilistic online POMDP decision making for lane changes in fully automated driving , 2013, 16th International IEEE Conference on Intelligent Transportation Systems (ITSC 2013).

[3]  Jonas Fredriksson,et al.  If, When, and How to Perform Lane Change Maneuvers on Highways , 2016, IEEE Intelligent Transportation Systems Magazine.

[4]  Nakayama,et al.  Dynamical model of traffic congestion and numerical simulation. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[5]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[6]  Meng Wang,et al.  Game theoretic approach for predictive lane-changing and car-following control , 2015 .

[7]  Helbing,et al.  Congested traffic states in empirical observations and microscopic simulations , 2000, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[8]  Reza Langari,et al.  A human-like game theory-based controller for automatic lane changing , 2018 .

[9]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[10]  M. E. R. “If” , 1921, Definitions.

[11]  Hideyuki Kita,et al.  A merging–giveway interaction model of cars in a merging section: a game theoretic analysis , 1999 .

[12]  Reza Langari,et al.  Predictive Fuzzy Markov Decision Strategy for Autonomous Driving in Highways , 2018, 2018 IEEE Conference on Control Technology and Applications (CCTA).

[13]  Reza Langari,et al.  A Predictive Perception Model and Control Strategy for Collision-Free Autonomous Driving , 2019, IEEE Transactions on Intelligent Transportation Systems.

[14]  J. Filar,et al.  Competitive Markov Decision Processes , 1996 .

[15]  Alireza Talebpour,et al.  Influence of connected and autonomous vehicles on traffic flow stability and throughput , 2016 .

[16]  Hani S. Mahmassani,et al.  Modeling Lane-Changing Behavior in a Connected Environment: A Game Theory Approach , 2015 .

[17]  Michael L. Littman,et al.  Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.

[18]  Peter Hidas,et al.  MODELLING LANE CHANGING AND MERGING IN MICROSCOPIC TRAFFIC SIMULATION , 2002 .