Pricing a Low-regret Seller

As the number of ad exchanges has grown, publishers have turned to low regret learning algorithms to decide which exchange offers the best price for their inventory. This in turn opens the following question for the exchange: how to set prices to attract as many sellers as possible and maximize revenue. In this work we formulate this precisely as a learning problem, and present algorithms showing that by simply knowing that the counterparty is using a low regret algorithm is enough for the exchange to have its own low regret learning algorithm to find the optimal price.

[1]  Claudio Gentile,et al.  Ieee Transactions on Information Theory 1 Regret Minimization for Reserve Prices in Second-price Auctions , 2022 .

[2]  Umar Syed,et al.  Learning Prices for Repeated Auctions with Strategic Buyers , 2013, NIPS.

[3]  Lihong Li,et al.  An Empirical Evaluation of Thompson Sampling , 2011, NIPS.

[4]  Mehryar Mohri,et al.  Learning Theory and Algorithms for revenue optimization in second price auctions with reserve , 2013, ICML.

[5]  Brendan Kitts,et al.  Optimal Bidding on Keyword Auctions , 2004, Electron. Mark..

[6]  W. R. Thompson ON THE LIKELIHOOD THAT ONE UNKNOWN PROBABILITY EXCEEDS ANOTHER IN VIEW OF THE EVIDENCE OF TWO SAMPLES , 1933 .

[7]  A. Proutière,et al.  Repeated Auctions under Budget Constraints : Optimal bidding strategies and Equilibria , 2012 .

[8]  Brendan Kitts,et al.  A Formal Analysis of Search Auctions Including Predictions on Click Fraud and Bidding Tactics , 2014 .

[9]  H. Varian,et al.  Conditioning Prices on Purchase History , 2005 .

[10]  Tim Roughgarden,et al.  The price of anarchy in games of incomplete information , 2012, SECO.

[11]  Vijay Kumar,et al.  Online learning in online auctions , 2003, SODA '03.

[12]  Sébastien Bubeck,et al.  Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems , 2012, Found. Trends Mach. Learn..

[13]  S. Bikhchandani Reputation in repeated second-price auctions , 1988 .

[14]  Éva Tardos,et al.  Econometrics for Learning Agents , 2015, EC.

[15]  Charles J. Thomas Market Structure and the Flow of Information In Repeated Auctions , 1999 .

[16]  Brendan Lucier,et al.  Beyond Equilibria: Mechanisms for Repeated Combinatorial Auctions , 2009, ICS.

[17]  Joaquin Quiñonero Candela,et al.  Web-Scale Bayesian Click-Through rate Prediction for Sponsored Search Advertising in Microsoft's Bing Search Engine , 2010, ICML.

[18]  Umar Syed,et al.  Repeated Contextual Auctions with Strategic Buyers , 2014, NIPS.

[19]  Peter Key,et al.  Optimal Bidding Strategies and Equilibria in Dynamic Auctions with Budget Constraints , 2013 .

[20]  Hayley H Chouinard,et al.  Repeated Auctions with the Right of First Refusal , 2006 .

[21]  Claire Mathieu,et al.  Greedy bidding strategies for keyword auctions , 2007, EC '07.

[22]  Benjamin Edelman,et al.  Strategic bidder behavior in sponsored search auctions , 2007, Decis. Support Syst..

[23]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[24]  Felix Wu,et al.  Incentive-compatible online auctions for digital goods , 2002, SODA '02.

[25]  Frank Thomson Leighton,et al.  The value of knowing a demand curve: bounds on regret for online posted-price auctions , 2003, 44th Annual IEEE Symposium on Foundations of Computer Science, 2003. Proceedings..

[26]  Éva Tardos,et al.  Composable and efficient mechanisms , 2012, STOC '13.

[27]  Peter Auer,et al.  The Nonstochastic Multiarmed Bandit Problem , 2002, SIAM J. Comput..

[28]  Martin Pesendorfer,et al.  Bidding behavior in a repeated procurement auction: A summary , 2000 .

[29]  H. Robbins,et al.  Asymptotically efficient adaptive allocation rules , 1985 .