Rational Bidding Using Reinforcement Learning An Application in Automated Resource Allocation

The application of autonomous agents by the provisioning and usage of computational resources is an attractive research field. Various methods and technologies in the area of artificial intelligence, statistics and economics are playing together to achieve i) autonomic resource provisioning and usage of computational resources, to invent ii) competitive bidding strategies for widely used market mechanisms and to iii) incentivize consumers and providers to use such market-based systems. The contributions of the paper are threefold. First, we present a framework for supporting consumers and providers in technical and economic preference elicitation and the generation of bids. Secondly, we introduce a consumer-side reinforcement learning bidding strategy which enables rational behavior by the generation and selection of bids. Thirdly, we evaluate and compare this bidding strategy against a truth-telling bidding strategy for two kinds of market mechanisms – one centralized and one decentralized.

[1]  R. Luce,et al.  Simultaneous conjoint measurement: A new type of fundamental measurement , 1964 .

[2]  Vithala R. Rao,et al.  Conjoint Measurement- for Quantifying Judgmental Data , 1971 .

[3]  T. Saaty Axiomatic foundation of the analytic hierarchy process , 1986 .

[4]  M. Weisbach,et al.  The Success of Acquisitions: Evidence from Disvestitures , 1990 .

[5]  Dhananjay K. Gode,et al.  Allocative Efficiency of Markets with Zero-Intelligence Traders: Market as a Partial Substitute for Individual Rationality , 1993, Journal of Political Economy.

[6]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[7]  Hussein M. Abdel-Wahab,et al.  A proportional share resource allocation algorithm for real-time, time-shared systems , 1996, 17th IEEE Real-Time Systems Symposium.

[8]  D. Cliff Minimal-Intelligence Agents for Bargaining Behaviors in Market-Based Environments , 1997 .

[9]  A. Roth,et al.  Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria , 1998 .

[10]  Warren Smith,et al.  Predicting Application Run Times Using Historical Information , 1998, JSSPP.

[11]  Edmund H. Durfee,et al.  An adaptive agent bidding strategy based on stochastic modeling , 1999, AGENTS '99.

[12]  Rajarshi Das,et al.  High-performance bidding agents for the continuous double auction , 2001, EC '01.

[13]  Rajarshi Das,et al.  Agent-Human Interactions in the Continuous Double Auction , 2001, IJCAI.

[14]  Nicholas R. Jennings,et al.  A Fuzzy-Logic Based Bidding Strategy for Autonomous Agents in Continuous Double Auctions , 2003, IEEE Trans. Knowl. Data Eng..

[15]  Claudio Bartolini,et al.  Market-Based Resource Allocation for Utility Data Centers , 2003 .

[16]  Michael P. Wellman,et al.  Exploring bidding strategies for market-based scheduling , 2003, EC '03.

[17]  Luis E. Ortiz,et al.  The Penn-Lehman Automated Trading Project , 2003, IEEE Intell. Syst..

[18]  B. Hajek,et al.  Optimal allocation of a divisible good to strategic buyers , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[19]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[20]  Nicholas R. Jennings,et al.  A Risk-Based Bidding Strategy for Continuous Double Auctions , 2004, ECAI.

[21]  Peter Stone,et al.  Three automated stock-trading agents: a comparative study , 2004, AAMAS'04.

[22]  Sean Luke,et al.  Cooperative Multi-Agent Learning: The State of the Art , 2005, Autonomous Agents and Multi-Agent Systems.

[23]  Emmanuel Medernach,et al.  Workload Analysis of a Cluster in a Grid Environment , 2005, JSSPP.

[24]  Li Zhang,et al.  Tycoon: An implementation of a distributed, market-based resource allocation system , 2004, Multiagent Grid Syst..

[25]  Jiadao Li,et al.  Learning-based negotiation strategies for grid scheduling , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[26]  Jiadao Li,et al.  A Strategic Negotiation Model for Grid Scheduling , 2006, Int. Trans. Syst. Sci. Appl..

[27]  Rudolf Müller,et al.  Decentralization and Mechanism Design for Online Machine Scheduling , 2006, SWAT.

[28]  Shimon Whiteson,et al.  On-line evolutionary computation for reinforcement learning in stochastic domains , 2006, GECCO.

[29]  Yoav Shoham,et al.  If multi-agent learning is the answer, what is the question? , 2007, Artif. Intell..

[30]  Peter Stone,et al.  Learning and Multiagent Reasoning for Autonomous Agents , 2007, IJCAI.

[31]  Michael P. Wellman,et al.  Autonomous bidding agents - strategies and lessons from the trading agent competition , 2007 .

[32]  Stephen George Phelps,et al.  Evolutionary Mechanism Design , 2007 .

[33]  H. Jaap van den Herik,et al.  Multi-agent Learning Dynamics: A Survey , 2007, CIA.

[34]  Peter Stone,et al.  Multiagent learning is not the answer. It is the question , 2007, Artif. Intell..

[35]  Borissov,et al.  SEMI-AUTOMATED PROVISIONING AND USAGE OF CONFIGURABLE SERVICES , 2008 .