Dynamic driving and routing games for autonomous vehicles on networks: A mean field game approach

This paper aims to answer the research question as to optimal design of decision-making processes for autonomous vehicles (AVs), including dynamical selection of driving velocity and route choices on a transportation network. Dynamic traffic assignment (DTA) has been widely used to model travelers’ route choice or/and departure-time choice and predict dynamic traffic flow evolution in the short term. However, the existing DTA models do not explicitly describe one’s selection of driving velocity on a road link. Driving velocity choice may not be crucial for modeling the movement of human drivers but it is a must-have control to maneuver AVs. In this paper, we aim to develop a game-theoretic model to solve for AVs’ optimal driving strategies of velocity control in the interior of a road link and route choice at a junction node. To this end, we will first reinterpret the DTA problem as an N -car differential game and show that this game can be tackled with a general mean field game-theoretic framework. The developed mean field game is challenging to solve because of the forward and backward structure for velocity control and the complementarity conditions for route choice. An efficient algorithm is developed to address these challenges. The model and the algorithm are illustrated on the Braess network and the OW network with a single destination. On the Braess network, we first compare the LWR based DTA model with the proposed game and find that the driving and routing control navigates AVs with overall lower costs. We then compare the total travel cost without and with the middle link and find that the Braess paradox may still arise under certain conditions. We also test our proposed model and solution algorithm on the OW network.

[1]  Xuan Zhang,et al.  Density Flow in Dynamical Networks via Mean-Field Games , 2017, IEEE Transactions on Automatic Control.

[2]  Bin Ran,et al.  A link-based variational inequality model for dynamic departure time/route choice , 1996 .

[3]  Xuan Di,et al.  Multi-Agent Reinforcement Learning for Dynamic Routing Games: A Unified Paradigm , 2020, ArXiv.

[4]  B Wie,et al.  DYNAMIC SYSTEM OPTIMAL TRAFFIC ASSIGNMENT ON CONGESTED MULTIDESTINATION NETWORKS , 1990 .

[5]  Pedro A. Neto,et al.  Dynamic user equilibrium based on a hydrodynamic model , 2013 .

[6]  Hani S. Mahmassani,et al.  Dynamic Network Traffic Assignment and Simulation Methodology for Advanced System Management Applications , 2001 .

[7]  Benedetto Piccoli,et al.  Multiscale Modeling and Control Architecture for V2X Enabled Traffic Streams , 2017, IEEE Transactions on Vehicular Technology.

[8]  Terry L. Friesz,et al.  A Variational Inequality Formulation of the Dynamic Network User Equilibrium Problem , 1993, Oper. Res..

[9]  Jia Yuan Yu,et al.  Mean field equilibria of multi armed bandit games , 2012, 2012 50th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[10]  Guido Gentile,et al.  Using the General Link Transmission Model in a Dynamic Traffic Assignment to Simulate Congestion on Urban Networks , 2015 .

[11]  P. I. Richards Shock Waves on the Highway , 1956 .

[12]  Henry X. Liu,et al.  Continuous-time point-queue models in dynamic network loading , 2012 .

[13]  Christian A. Ringhofer,et al.  Large-Scale Dynamics of Mean-Field Games Driven by Local Nash Equilibria , 2014, J. Nonlinear Sci..

[14]  W. Y. Szeto,et al.  A cell-based variational inequality formulation of the dynamic user optimal assignment problem , 2002 .

[15]  Xiaolei Guo,et al.  Braess paradox under the boundedly rational user equilibria , 2014 .

[16]  Masao Kuwahara,et al.  Dynamic user optimal assignment with physical queues for a many-to-many OD pattern , 2001 .

[17]  H. M. Zhang,et al.  A Comparative Study of Some Macroscopic Link Models Used in Dynamic Traffic Assignment , 2005 .

[18]  B. Ran,et al.  A LINK-BASED VARIATIONAL INEQUALITY FORMULATION OF IDEAL DYNAMIC USER-OPTIMAL ROUTE CHOICE PROBLEM , 1996 .

[19]  Yang Yu,et al.  Day-to-day dynamic traffic assignment with imperfect information, bounded rationality and information sharing , 2018, Transportation Research Part C: Emerging Technologies.

[20]  E. Dockner,et al.  Differential Games in Economics and Management Science , 2001 .

[21]  Xuan Di,et al.  A unified equilibrium framework of new shared mobility systems , 2019, Transportation Research Part B: Methodological.

[22]  Y. W. Xu,et al.  Advances in the Continuous Dynamic Network Loading Problem , 1996, Transp. Sci..

[23]  Terry L. Friesz,et al.  The mathematical foundations of dynamic user equilibrium , 2019, Transportation Research Part B: Methodological.

[24]  H. M. Zhang,et al.  Modelling network flow with and without link interactions: the cases of point queue, spatial queue and cell transmission model , 2013 .

[25]  Bin Ran,et al.  Solving an Instantaneous Dynamic User-Optimal Route Choice Model , 1995, Transp. Sci..

[26]  Tao Yao,et al.  A partial differential equation formulation of Vickrey’s bottleneck model, part I: Methodology and theoretical analysis , 2013 .

[27]  Deepak K. Merchant,et al.  A Model and an Algorithm for the Dynamic Traffic Assignment Problems , 1978 .

[28]  H. Lo A DYNAMIC TRAFFIC ASSIGNMENT FORMULATION THAT ENCAPSULATES THE CELL-TRANSMISSION MODEL , 1999 .

[29]  Mukund Sundararajan,et al.  Mean Field Equilibria of Dynamic Auctions with Learning , 2014, Manag. Sci..

[30]  Stephen Graham Ritchie,et al.  TRANSPORTATION RESEARCH. PART C, EMERGING TECHNOLOGIES , 1993 .

[31]  M. Lighthill On sound generated aerodynamically I. General theory , 1952, Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences.

[32]  Hamidou Tembine,et al.  Electrical Vehicles in the Smart Grid: A Mean Field Game Analysis , 2011, IEEE Journal on Selected Areas in Communications.

[33]  Adriano Festa,et al.  A Mean Field Games approach for multi-lane traffic management , 2017, 1711.04116.

[34]  Olivier Guéant,et al.  Mean Field Games and Applications , 2011 .

[35]  F. Facchinei,et al.  Finite-Dimensional Variational Inequalities and Complementarity Problems , 2003 .

[36]  Stephen D. Boyles,et al.  A multiclass cell transmission model for shared human and autonomous vehicle roads , 2016 .

[37]  Michel Bierlaire,et al.  Dynamic network loading: a stochastic differentiable model that derives link state distributions , 2011 .

[38]  Omar Drissi-Kaïtouni,et al.  A variational inequality formulation of the Dynamic Traffic Assignment Problem , 1993 .

[39]  Kuang Huang,et al.  Stabilizing Traffic via Autonomous Vehicles: A Continuum Mean Field Game Approach , 2019, 2019 IEEE Intelligent Transportation Systems Conference (ITSC).

[40]  M. Burger,et al.  Mean field games with nonlinear mobilities in pedestrian dynamics , 2013, 1304.5201.

[41]  Lorenzo Meschini,et al.  Spillback congestion in dynamic traffic assignment: A macroscopic flow model with time-varying bottlenecks , 2007 .

[42]  Bin Ran,et al.  for dynamic user equilibria with exact flow propagations , 2008 .

[43]  Athanasios K. Ziliaskopoulos,et al.  Foundations of Dynamic Traffic Assignment: The Past, the Present and the Future , 2001 .

[44]  George L. Nemhauser,et al.  Optimality Conditions for a Dynamic Traffic Assignment Model , 1978 .

[45]  Pierre Cardaliaguet,et al.  Weak Solutions for First Order Mean Field Games with Local Coupling , 2013, 1305.7015.

[46]  Ludovic Leclercq,et al.  Dynamic Traffic Assignment for regional networks with traffic-dependent trip lengths and regional paths , 2021, Transportation Research Part C: Emerging Technologies.

[47]  Xuan Di,et al.  A Survey on Autonomous Vehicle Control in the Era of Mixed-Autonomy: From Physics-Based to AI-Guided Driving Policy Learning , 2020, Transportation Research Part C: Emerging Technologies.

[48]  Randall J. LeVeque,et al.  Finite difference methods for ordinary and partial differential equations - steady-state and time-dependent problems , 2007 .

[49]  Satish V. Ukkusuri,et al.  Dynamic system optimal model for multi-OD traffic networks with an advanced spatial queuing model , 2015 .

[50]  Henry X. Liu,et al.  Boundedly rational route choice behavior: A review of models and methodologies , 2016 .

[51]  Michiel C. J. Bliemer,et al.  Genetics of traffic assignment models for strategic transport planning , 2015 .

[52]  P. Cardaliaguet,et al.  Mean Field Games , 2020, Lecture Notes in Mathematics.

[53]  S. Shankar Sastry,et al.  Inverse Problem for Non-Viscous Mean Field Control: Example From Traffic , 2016, IEEE Transactions on Automatic Control.

[54]  Hai-Jun Huang,et al.  Dynamic user optimal traffic assignment model for many to one travel demand , 1995 .

[55]  Kuang Huang,et al.  A Game-Theoretic Framework for Autonomous Vehicles Velocity Control: Bridging Microscopic Differential Games and Macroscopic Mean Field Games , 2019, Discrete & Continuous Dynamical Systems - B.

[56]  Henry X. Liu,et al.  Boundedly Rational User Equilibria (BRUE): Mathematical Formulation and Solution Sets , 2013 .

[57]  W. Y. Szeto,et al.  A CELL-BASED SIMULTANEOUS ROUTE AND DEPARTURE TIME CHOICE MODEL WITH ELASTIC DEMAND , 2004 .

[58]  R. LeVeque Finite Volume Methods for Hyperbolic Problems: Characteristics and Riemann Problems for Linear Hyperbolic Equations , 2002 .

[59]  Roland P. Malhamé,et al.  A micro-macro traffic model based on Mean-Field Games , 2015, 2015 American Control Conference (ACC).

[60]  Malachy Carey,et al.  Comparing whole-link travel time models , 2003 .

[61]  Xuan Di,et al.  Scalable traffic stability analysis in mixed-autonomy using continuum models , 2020 .

[62]  Satish V. Ukkusuri,et al.  A linear programming formulation for autonomous intersection control within a dynamic traffic assignment and connected vehicle environment , 2015 .

[63]  Terry L. Friesz,et al.  Continuity of the path delay operator for dynamic network loading with spillback , 2015, 1501.04241.

[64]  W. Vickrey Congestion Theory and Transport Investment , 1969 .

[65]  M J Lighthill,et al.  On kinematic waves II. A theory of traffic flow on long crowded roads , 1955, Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences.

[66]  Terry L. Friesz,et al.  Computing Dynamic User Equilibria on Large-Scale Networks with Software Implementation , 2019, Networks and Spatial Economics.

[67]  Hyunsoo Noh,et al.  Technical Report on SHRP 2 C10B Version of DynusT and FAST-TrIPs , 2013 .

[68]  Yafeng Yin,et al.  Optimal deployment of charging lanes for electric vehicles in transportation networks , 2016 .

[69]  Malachy Carey,et al.  Behaviour of a whole-link travel time model used in dynamic traffic assignment , 2002 .

[70]  A. Lachapelle,et al.  COMPUTATION OF MEAN FIELD EQUILIBRIA IN ECONOMICS , 2010 .

[71]  H. M. Zhang,et al.  Delay-Function-Based Link Models: Their Properties and Computational Issues , 2005 .

[72]  Lanshan Han,et al.  Dynamic user equilibrium with a path based cell transmission model for general traffic networks , 2012 .

[73]  Marie-Therese Wolfram,et al.  On a mean field game approach modeling congestion and aversion in pedestrian crowds , 2011 .

[74]  Hamidou Tembine,et al.  Mean-Field-Type Games in Engineering , 2016, ArXiv.