An optimal bidimensional multi-armed bandit auction for multi-unit procurement

We study the problem of a buyer who gains stochastic rewards by procuring through an auction, multiple units of a service or item from a pool of heterogeneous agents who are strategic on two dimensions, namely cost and capacity. The reward obtained for a single unit from an allocated agent depends on the inherent quality of the agent; the agent’s quality is fixed but unknown. Each agent can only supply a limited number of units (capacity of the agent). The cost incurred per unit and capacity (maximum number of units that can be supplied) are private information of each agent. The auctioneer is required to elicit from the agents their costs as well as capacities (making the mechanism design bidimensional) and further, learn the qualities of the agents as well, with a view to maximize her utility. Motivated by this, we design a bidimensional multi-armed bandit procurement auction that seeks to maximize the expected utility of the auctioneer subject to incentive compatibility and individual rationality, while simultaneously learning the unknown qualities of the agents. We first work with the assumption that the qualities are known, and propose an optimal, truthful mechanism 2D-OPT for the auctioneer to elicit costs and capacities. Next, in order to learn the qualities of the agents as well, we provide sufficient conditions for a learning algorithm to be Bayesian incentive compatible and individually rational. We finally design a novel learning mechanism, 2D-UCB that is stochastic Bayesian incentive compatible and individually rational.

[1]  Robert D. Kleinberg,et al.  Learning on a budget: posted price mechanisms for online procurement , 2012, EC '12.

[2]  Aleksandrs Slivkins,et al.  Adaptive contract design for crowdsourcing markets: bandit algorithms for repeated principal-agent problems , 2014, J. Artif. Intell. Res..

[3]  Jason D. Hartline Bayesian Mechanism Design , 2013, Found. Trends Theor. Comput. Sci..

[4]  Sujit Gujar,et al.  A quality assuring multi-armed bandit crowdsourcing mechanism with incentive compatible learning , 2014, AAMAS.

[5]  Debasis Mishra,et al.  Multidimensional mechanism design: key results and research issues , 2012 .

[6]  Moshe Babaioff,et al.  Truthful mechanisms with implicit payment computation , 2010, EC '10.

[7]  Nikhil R. Devanur,et al.  Bandits with concave rewards and convex knapsacks , 2014, EC.

[8]  Sujit Gujar,et al.  Multi-Armed Bandit Mechanisms for Multi-Slot Sponsored Search Auctions , 2010, ArXiv.

[9]  Yaron Singer,et al.  Pricing mechanisms for crowdsourcing markets , 2013, WWW.

[10]  Moshe Babaioff,et al.  Multi-parameter mechanisms with implicit payment computation , 2013, EC '13.

[11]  Sujit Gujar,et al.  Optimal multi-unit combinatorial auctions , 2013, Oper. Res..

[12]  Chien-Ju Ho,et al.  Adaptive Task Assignment for Crowdsourced Classification , 2013, ICML.

[13]  Nicholas R. Jennings,et al.  Efficient crowdsourcing of unknown experts using bounded multi-armed bandits , 2014, Artif. Intell..

[14]  Alessandro Lazaric,et al.  A truthful learning mechanism for contextual multi-slot sponsored search auctions with externalities , 2012, EC '12.

[15]  E. Maasland,et al.  Auction Theory , 2021, Springer Texts in Business and Economics.

[16]  Michael D. Buhrmester,et al.  Amazon's Mechanical Turk , 2011, Perspectives on psychological science : a journal of the Association for Psychological Science.

[17]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[18]  G. Iyengar,et al.  Optimal procurement mechanisms for divisible goods with capacitated suppliers , 2008 .

[19]  Nicholas R. Jennings,et al.  Efficient budget allocation with accuracy guarantees for crowdsourcing classification tasks , 2013, AAMAS.

[20]  Sébastien Bubeck,et al.  Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems , 2012, Found. Trends Mach. Learn..

[21]  Moshe Babaioff,et al.  Characterizing truthful multi-armed bandit mechanisms: extended abstract , 2009, EC '09.

[22]  Andreas Krause,et al.  Truthful incentives in crowdsourcing tasks using regret minimization mechanisms , 2013, WWW.

[23]  Roger B. Myerson,et al.  Optimal Auction Design , 1981, Math. Oper. Res..

[24]  Michael H. Rothkopf,et al.  Thirteen Reasons Why the Vickrey-Clarke-Groves Process Is Not Practical , 2007, Oper. Res..

[25]  T. L. Lai Andherbertrobbins Asymptotically Efficient Adaptive Allocation Rules , 2022 .

[26]  Archie C. Chapman,et al.  Knapsack Based Optimal Policies for Budget-Limited Multi-Armed Bandits , 2012, AAAI.

[27]  Ittai Abraham,et al.  Adaptive Crowdsourcing Algorithms for the Bandit Survey Problem , 2013, COLT.

[28]  Debmalya Mandal,et al.  A novel ex-post truthful mechanism for multi-slot sponsored search auctions , 2014, AAMAS.

[29]  Kellen Petersen August Real Analysis , 2009 .

[30]  Chien-Ju Ho,et al.  Online Task Assignment in Crowdsourcing Markets , 2012, AAAI.

[31]  Nikhil R. Devanur,et al.  The price of truthfulness for pay-per-click auctions , 2009, EC '09.