论文信息 - Fairness in Multi-Agent Sequential Decision-Making

Fairness in Multi-Agent Sequential Decision-Making

We define a fairness solution criterion for multi-agent decision-making problems, where agents have local interests. This new criterion aims to maximize the worst performance of agents with a consideration on the overall performance. We develop a simple linear programming approach and a more scalable game-theoretic approach for computing an optimal fairness policy. This game-theoretic approach formulates this fairness optimization as a two-player zero-sum game and employs an iterative algorithm for finding a Nash equilibrium, corresponding to an optimal fairness policy. We scale up this approach by exploiting problem structure and value function approximation. Our experiments on resource allocation problems show that this fairness criterion provides a more favorable solution than the utilitarian criterion, and that our game-theoretic approach is significantly faster than linear programming.

Julie A. Shah | Chongjie Zhang | J. Shah | Chongjie Zhang

[1] Shimon Whiteson,et al. A Survey of Multi-Objective Sequential Decision-Making , 2013, J. Artif. Intell. Res..

[2] Ariel D. Procaccia,et al. Truth, justice, and cake cutting , 2010, Games Econ. Behav..

[3] Dritan Nace,et al. Max-min fairness and its applications to routing and load-balancing in communication networks: a tutorial , 2008, IEEE Communications Surveys & Tutorials.

[4] Ariel D. Procaccia. Thou Shalt Covet Thy Neighbor's Cake , 2009, IJCAI.

[5] Serge-Christophe Kolm,et al. The theory of justice , 1996 .

[6] Laurent Massoulié,et al. Impact of fairness on Internet performance , 2001, SIGMETRICS '01.

[7] Patrice Perny,et al. A Compromise Programming Approach to multiobjective Markov Decision Processes , 2011, Int. J. Inf. Technol. Decis. Mak..

[8] R. S. Laundy,et al. Multiple Criteria Optimisation: Theory, Computation and Application , 1989 .

[9] Yann Chevaleyre,et al. Issues in Multiagent Resource Allocation , 2006, Informatica.

[10] Avrim Blum,et al. Planning in the Presence of Cost Functions Controlled by an Adversary , 2003, ICML.

[11] Andrew McLennan,et al. Asymptotic expected number of Nash equilibria of two-player normal form games , 2005, Games Econ. Behav..

[12] Shobha Venkataraman,et al. Efficient Solution Algorithms for Factored MDPs , 2003, J. Artif. Intell. Res..

[13] Ariel D. Procaccia,et al. No agent left behind: dynamic fair division of multiple resources , 2013, AAMAS.

[14] Yoav Shoham,et al. Simple search methods for finding a Nash equilibrium , 2004, Games Econ. Behav..

[15] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[16] Toby Walsh,et al. Online Cake Cutting , 2010, ADT.