论文信息 - Reconciling Rationality and Stochasticity: Rich Behavioral Models in Two-Player Games

Reconciling Rationality and Stochasticity: Rich Behavioral Models in Two-Player Games

Two traditional paradigms are often used to describe the behavior of agents in multi-agent complex systems. In the first one, agents are considered to be fully rational and systems are seen as multi-player games. In the second one, agents are considered to be fully stochastic processes and the system itself is seen as a large stochastic process. From the standpoint of a particular agent - having to choose a strategy, the choice of the paradigm is crucial: the most adequate strategy depends on the assumptions made on the other agents. In this paper, we focus on two-player games and their application to the automated synthesis of reliable controllers for reactive systems - a field at the crossroads between computer science and mathematics. In this setting, the reactive system to control is a player, and its environment is its opponent, usually assumed to be fully antagonistic or fully stochastic. We illustrate several recent developments aiming to breach this narrow taxonomy by providing formal concepts and mathematical frameworks to reason about richer behavioral models. The interest of such models is not limited to reactive system synthesis but extends to other application fields of game theory. The goal of our contribution is to give a high-level presentation of key concepts and applications, aimed at a broad audience. To achieve this goal, we illustrate those rich behavioral models on a classical challenge of the everyday life: planning a journey in an uncertain environment.

Mickael Randour | Mickael Randour

[1] E. Rowland. Theory of Games and Economic Behavior , 1946, Nature.

[2] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[3] Krishnendu Chatterjee,et al. The complexity of multi-mean-payoff and multi-energy games , 2012, Inf. Comput..

[4] Thomas Wilke,et al. Automata logics, and infinite games: a guide to current research , 2002 .

[5] Yoshio Ohtsubo,et al. Markov decision processes associated with two threshold probability criteria , 2013 .

[6] Amir Pnueli,et al. On the synthesis of a reactive module , 1989, POPL '89.

[7] John N. Tsitsiklis,et al. An Analysis of Stochastic Shortest Path Problems , 1991, Math. Oper. Res..

[8] Yoshio Ohtsubo,et al. Optimal threshold probability in undiscounted Markov decision processes with a target set , 2004, Appl. Math. Comput..

[9] Benjamin Monmege,et al. To Reach or not to Reach? Efficient Algorithms for Total-Payoff Games , 2014, CONCUR.

[10] Christel Baier,et al. Computing Quantiles in Markov Reward Models , 2013, FoSSaCS.

[11] Edmund M. Clarke,et al. Design and Synthesis of Synchronization Skeletons Using Branching Time Temporal Logic , 2008, 25 Years of Model Checking.