论文信息 - An MDP-Based Approach to Online Mechanism Design

An MDP-Based Approach to Online Mechanism Design

Online mechanism design (MD) considers the problem of providing incentives to implement desired system-wide outcomes in systems with self-interested agents that arrive and depart dynamically. Agents can choose to misrepresent their arrival and departure times, in addition to information about their value for different outcomes. We consider the problem of maximizing the total long-term value of the system despite the self-interest of agents. The online MD problem induces a Markov Decision Process (MDP), which when solved can be used to implement optimal policies in a truth-revealing Bayesian-Nash equilibrium.

David C. Parkes | Satinder P. Singh | Satinder Singh | D. Parkes

[1] Roger B. Myerson,et al. Optimal Auction Design , 1981, Math. Oper. Res..

[2] Dimitri P. Bertsekas,et al. Dynamic Programming: Deterministic and Stochastic Models , 1987 .

[3] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[4] Noam Nisan,et al. Competitive analysis of incentive compatible on-line auctions , 2000, EC '00.

[5] Noam Nisan,et al. Algorithmic Mechanism Design , 2001, Games Econ. Behav..

[6] Y. Shoham,et al. Truth revelation in approximately efficient combinatorial auctions , 2002, JACM.

[7] Vijay Kumar,et al. Online learning in online auctions , 2003, SODA '03.

[8] David C. Parkes,et al. Strategyproof Mechanisms for Ad Hoc Network Formation , 2003 .

[9] Eric J. Friedman,et al. Pricing WiFi at Starbucks: issues in online mechanism design , 2003, EC '03.

[10] Yossi Azar,et al. Reducing truth-telling online mechanisms to online optimization , 2003, STOC '03.

[11] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.