论文信息 - Learning Improved Entertainment Trading Strategies for the TAC Travel Game

Learning Improved Entertainment Trading Strategies for the TAC Travel Game

For almost five years we continually operated a simulation testbed exploring strategies for the TAC Travel game. Building on techniques developed in our recent study of continuous double auctions, we performed an equilibrium analysis of our testbed data, and employed reinforcement learning in the equilibrium environment to derive a new entertainment strategy for this domain. A second iteration of this process led to further improvements. We thus demonstrate that interleaving empirical game-theoretic analysis with reinforcement learning in an effective method for generating stronger trading strategies in this domain.

Michael P. Wellman | L. Julian Schvartzman

[1] Pierre L'Ecuyer,et al. Efficiency improvement and variance reduction , 1994, Proceedings of Winter Simulation Conference.

[2] Masahito Yamamoto,et al. Design of Adaptive Trading Strategy for Trading Agent Competition , 2003 .

[3] Carlos José Pereira de Lucena,et al. An Agent Based Architecture for Highly Competitive Electronic Markets , 2005, FLAIRS.

[4] Peter Stone,et al. ATTac-2000: an adaptive autonomous bidding agent , 2001, AGENTS '01.

[5] Andrew G. Barto,et al. Reinforcement learning , 1998 .

[6] Amy Greenwald,et al. Bid determination in simultaneous actions an agent architecture , 2001, EC '01.

[7] Victor Naroditskiy,et al. RoxyBot-06: Stochastic Prediction and Optimization in TAC Travel , 2009, J. Artif. Intell. Res..

[8] M. P. Wellman,et al. Price Prediction in a Trading Agent Competition , 2004, J. Artif. Intell. Res..

[9] Bart Selman,et al. A principled study of the design tradeoffs for autonomous trading agents , 2003, AAMAS '03.

[10] Klaus Dorer,et al. Agent-oriented software engineering for successful TAC participation , 2002, AAMAS '02.

[11] Pericles A. Mitkas,et al. A Long-Term Profit Seeking Strategy for Continuous Double Auctions in a Trading Agent Competition , 2006, SETN.