Motivated by the emergence of auction-based marketplaces for display ads such as the Right Media Exchange, we study the design of a bidding agent that implements a display advertising campaign by bidding in such a marketplace. The bidding agent must acquire a given number of impressions with a given target spend, when the highest external bid in the marketplace is drawn from an unknown distribution P. The quantity and spend constraints arise from the fact that display ads are usually sold on a CPM basis. We consider both the full information setting, where the winning price in each auction is announced publicly, and the partially observable setting where only the winner obtains information about the distribution; these differ in the penalty incurred by the agent while attempting to learn the distribution. We provide algorithms for both settings, and prove performance guarantees using bounds on uniform closeness from statistics, and techniques from online learning. We experimentally evaluate these algorithms: both algorithms perform very well with respect to both target quantity and spend; further, our algorithm for the partially observable case performs nearly as well as that for the fully observable setting despite the higher penalty incurred during learning.
[1]
J. Kiefer,et al.
Asymptotic Minimax Character of the Sample Distribution Function and of the Classical Multinomial Estimator
,
1956
.
[2]
Manfred K. Warmuth,et al.
The weighted majority algorithm
,
1989,
30th Annual Symposium on Foundations of Computer Science.
[3]
Manfred K. Warmuth,et al.
The Weighted Majority Algorithm
,
1994,
Inf. Comput..
[4]
Martin Zinkevich,et al.
Online Convex Programming and Generalized Infinitesimal Gradient Ascent
,
2003,
ICML.
[5]
Peter Auer,et al.
Finite-time Analysis of the Multiarmed Bandit Problem
,
2002,
Machine Learning.
[6]
Gábor Lugosi,et al.
Prediction, learning, and games
,
2006
.
[7]
Claire Mathieu,et al.
Greedy bidding strategies for keyword auctions
,
2007,
EC '07.