论文信息 - Infinite-horizon average-cost Markov decision process routing games

Infinite-horizon average-cost Markov decision process routing games

We explore an extension of nonatomic routing games that we call Markov decision process routing games where each agent chooses a transition policy between nodes in a network rather than a path from an origin node to a destination node, i.e. each agent in the population solves a Markov decision process rather than a shortest path problem. This type of game was first introduced in [1] in the finite-horizon total-cost case. Here we present the infinite-horizon average-cost case. We present the appropriate definition of a Wardrop equilibrium as well as a potential function program for finding the equilibrium. This work can be thought of as a routing-game-based formulation of continuous population stochastic games (mean-field games or anonymous sequential games). We apply our model to ridesharing drivers competing for fares in an urban area.

S. Shankar Sastry | Daniel J. Calderone | S. Sastry

[1] William H. Sandholm,et al. Potential Games with Continuous Player Sets , 2001, J. Econ. Theory.

[2] P. Lions,et al. Jeux à champ moyen. II – Horizon fini et contrôle optimal , 2006 .

[3] P. Lions,et al. Mean field games , 2007 .

[4] C. B. Mcguire,et al. Studies in the Economics of Transportation , 1958 .

[5] Olivier Guéant. Existence and Uniqueness Result for Mean Field Games with Congestion Effect on Graphs , 2011, 1110.3442.

[6] Dan Bernhardt,et al. Anonymous Sequential Games with General State Space , 1991 .

[7] Eitan Altman,et al. Stochastic State Dependent Population Games in Wireless Communication , 2011, IEEE Transactions on Automatic Control.

[8] Eitan Altman,et al. An Anonymous Sequential Game Approach for Battery State Dependent Power Control , 2009, NET-COOP.

[9] Alan S. Manne. Linear Programming and Sequential Decision Models , 1959 .

[10] R. Rosenthal,et al. Anonymous sequential games , 1988 .

[11] D. Bernhardt,et al. Anonymous sequential games: Existence and characterization of equilibria , 1995 .

[12] Stella C. Dafermos,et al. Traffic assignment problem for a general network , 1969 .

[13] Michael Patriksson,et al. The Traffic Assignment Problem: Models and Methods , 2015 .

[14] P. Lions,et al. Jeux à champ moyen. I – Le cas stationnaire , 2006 .

[15] Eitan Altman,et al. Stationary Anonymous Sequential Games with Undiscounted Rewards , 2011, Journal of Optimization Theory and Applications.

[16] R. B. Kulkarni,et al. Linear programming formulations of Markov decision processes , 1986 .

[17] D. Gomes,et al. Discrete Time, Finite State Space Mean Field Games , 2010 .

[18] J. Wardrop. ROAD PAPER. SOME THEORETICAL ASPECTS OF ROAD TRAFFIC RESEARCH. , 1952 .

[19] S. Shankar Sastry,et al. Markov Decision Process Routing Games , 2017, 2017 ACM/IEEE 8th International Conference on Cyber-Physical Systems (ICCPS).

[20] Olivier Guéant. From infinity to one: The reduction of some mean field games to a global control problem , 2011, 1110.3441.