Shopbots are software agents that automatically query multiple sellers on the Internet to gather information about prices and other attributes of consumer goods and services. Rapidly increasing in number and sophistication, shopbots are helping more and more buyers minimize expenditure and maximize satisfaction. In response at least partly to this trend, it is anticipated that sellers will come to rely on pricebots, automated agents that employ price-setting algorithms in an attempt to maximize pro ts. This paper reaches toward an understanding of strategic pricebot dynamics. More speci cally, this paper is a comparative study of four candidate price-setting strategies that di er in informational and computational requirements: gametheoretic pricing (GT), myoptimalpricing (MY), derivative following (DF), and Q-learning (Q). In an e ort to gain insights into the tradeo s between practicality and pro tability of pricebot algorithms, the dynamic behavior that arises among homogeneous and heterogeneous collections of pricebots and shopbot-assisted buyers is analyzed and simulated. In homogeneous settings | when all pricebots use the same pricing algorithm | DFs outperform MYs and GTs. Investigation of heterogeneous collections of pricebots, however, reveals an incentive for individual DFs to deviate to MY or GT. The Q strategy exhibits superior performance to all the others since it learns to predict and account for the long-term consequences of its actions. Although the current implementation of Q is impractically expensive, techniques for achieving similar performance at greatly reduced computational cost are under investigation.
[1]
J. Nash.
NON-COOPERATIVE GAMES
,
1951,
Classics in Game Theory.
[2]
H. Varian.
A Model of Sales
,
1980
.
[3]
P. Hawken.
The Next Economy
,
1983
.
[4]
K. Judd,et al.
EQUILIBRIUM PRICE DISPERSION
,
1983
.
[5]
C. Watkins.
Learning from delayed rewards
,
1989
.
[6]
J. Bertrand.
Cournot oligopoly: Review of Walras's Théorie mathématique de la richesse sociale and Cournot's Recherches sur les principes mathématiques de la théorie des richesses
,
1989
.
[7]
Drew Fudenberg,et al.
Game theory (3. pr.)
,
1991
.
[8]
J. Eriksson,et al.
Information and interaction in marketspace: towards an open agent-based market infrastructure
,
1996
.
[9]
Pattie Maes,et al.
Kasbah: An Agent Marketplace for Buying and Selling Goods
,
1996,
PAAM.
[10]
Maria L. Gini,et al.
Magma: An Agent Based Virtual Market for Electronic Commerce
,
1997,
Appl. Artif. Intell..
[11]
A. Greenwald.
Learning to Play Network Games
,
1998
.
[12]
Jeffrey O. Kephart,et al.
Dynamics of an Information-Filtering Economy
,
1998,
CIA.
[13]
Jeffrey O. Kephart,et al.
Shopbots and Pricebots
,
1999,
IJCAI.
[14]
Jeffrey O. Kephart,et al.
Shopbot Economics
,
1999,
AGENTS '99.
[15]
Dean P. Foster,et al.
Regret in the On-Line Decision Problem
,
1999
.
[16]
Gerald Tesauro,et al.
Pricing in Agent Economies Using Neural Networks and Multi-agent Q-Learning
,
2001,
Sequence Learning.