Bounded-Rational Pursuit-Evasion Games

We present a framework that incorporates the idea of bounded rationality into dynamic stochastic pursuit-evasion games. The solution of a stochastic game is characterized, in general, by its (Nash) equilibria in feedback form. However, computing these Nash equilibrium strategies may require extensive computational resources. In this paper, the agents are modeled as bounded rational entities having limited computational resources. We illustrate the framework by applying it to a pursuit-evasion game between two vehicles in a stochastic wind field, where both the pursuer and the evader are bounded rational. We show how such a game may be analyzed by properly casting it as an iterative sequence of finite-state Markov Decision Processes (MDPs). Leveraging tools and algorithms from cognitive hierarchy theory ("level-$k$ thinking") we compute the solution of the ensuing discrete game, while taking into consideration the rationality level of each agent. We also present an online algorithm for each agent to infer its opponent rationality level.

[1]  D. Kahneman,et al.  Heuristics and Biases: The Psychology of Intuitive Judgment , 2002 .

[2]  Y. Ho,et al.  Nonzero-sum differential games , 1969 .

[3]  Raymond J. Dolan,et al.  Game Theory of Mind , 2008, PLoS Comput. Biol..

[4]  J. Filar,et al.  Competitive Markov Decision Processes , 1996 .

[5]  Xiongzhi Chen Brownian Motion and Stochastic Calculus , 2008 .

[6]  P. Varaiya,et al.  Differential games , 1971 .

[7]  Wei Lin,et al.  Nash strategies for pursuit-evasion differential games involving limited observations , 2015, IEEE Transactions on Aerospace and Electronic Systems.

[8]  G. Dantzig,et al.  Linear Programming in a Markov Chain , 1962 .

[9]  W. Willman Formal solutions for a class of stochastic pursuit-evasion games , 1968 .

[10]  Robert Ivor John,et al.  Computing Nash Equilibria and Evolutionarily Stable States of Evolutionary Games , 2016, IEEE Transactions on Evolutionary Computation.

[11]  S. Sastry,et al.  Probabilistic pursuit-evasion games: a one-step Nash approach , 2000, Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No.00CH37187).

[12]  T. Başar,et al.  Dynamic Noncooperative Game Theory , 1982 .

[13]  Colin Camerer,et al.  A Cognitive Hierarchy Model of Games , 2004 .

[14]  H. Kushner Numerical Methods for Stochastic Control Problems in Continuous Time , 2000 .

[15]  Y. Ho,et al.  Differential games and optimal pursuit-evasion strategies , 1965 .

[16]  A. V. Balakrishnan,et al.  Stochastic differential systems , 1973 .

[17]  S. Shankar Sastry,et al.  Probabilistic pursuit-evasion games: theory, implementation, and experimental evaluation , 2002, IEEE Trans. Robotics Autom..

[18]  J. Shinar,et al.  Three-dimensional optimal pursuit and evasion with bounded controls , 1980 .

[19]  Xuanming Su,et al.  A Dynamic Level-k Model in Sequential Games , 2012, Manag. Sci..

[20]  Shlomo Zilberstein,et al.  Models of Bounded Rationality , 1995 .

[21]  Roger B. Myerson,et al.  Game theory - Analysis of Conflict , 1991 .

[22]  B. Øksendal Stochastic differential equations : an introduction with applications , 1987 .

[23]  Y. Ho,et al.  On a class of linear stochastic differential games , 1968 .

[24]  João Pedro Hespanha,et al.  On Discrete-Time Pursuit-Evasion Games With Sensing Limitations , 2008, IEEE Transactions on Robotics.

[25]  Paul W. Goldberg,et al.  The complexity of computing a Nash equilibrium , 2006, STOC '06.

[26]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[27]  J.P. Hespanha,et al.  Multiple-agent probabilistic pursuit-evasion games , 1999, Proceedings of the 38th IEEE Conference on Decision and Control (Cat. No.99CH36304).

[28]  D. Kahneman Maps of Bounded Rationality: Psychology for Behavioral Economics , 2003 .