Learning the Valuations of a k-demand Agent

We study problems where a learner aims to learn the valuations of an agent by observing which goods he buys under varying price vectors. More specifically, we consider the case of a k-demand agent, whose valuation over the goods is additive when receiving up to k goods, but who has no interest in receiving more than k goods. We settle the query complexity for the active-learning (preference elicitation) version, where the learner chooses the prices to post, by giving a biased binary search algorithm, generalizing the classical binary search procedure. We complement our query complexity upper bounds by lower bounds that match up to lower-order terms. We also study the passive-learning version in which the learner does not control the prices, and instead they are sampled from some distribution. We show that in the PAC model for passive learning, any empirical risk minimizer has a sample complexity that is optimal up to a factor of Õ(k).

[1]  P. Samuelson A Note on the Pure Theory of Consumer's Behaviour: An Addendum , 1938 .

[2]  Aaron Roth,et al.  Multidimensional Dynamic Pricing for Welfare Maximization , 2016, EC.

[3]  David C. Parkes,et al.  Applying learning algorithms to preference elicitation , 2004, EC '04.

[4]  Moshe Babaioff,et al.  Dynamic Pricing with Limited Supply , 2011, ACM Trans. Economics and Comput..

[5]  Ariel D. Procaccia,et al.  A Lower Bound for Equitable Cake Cutting , 2017, EC.

[6]  David C. Parkes,et al.  Iterative Combinatorial Auctions , 2006 .

[7]  Yishay Mansour,et al.  Learning What's Going on: Reconstructing Preferences and Priorities from Opaque Transactions , 2015, EC.

[8]  Maria-Florina Balcan,et al.  Learning Economic Parameters from Revealed Preferences , 2014, WINE.

[9]  Aaron Roth,et al.  Watch and learn: optimizing from revealed preferences feedback , 2015, SECO.

[10]  Omar Besbes,et al.  Dynamic Pricing Without Knowing the Demand Function: Risk Bounds and Near-Optimal Algorithms , 2009, Oper. Res..

[11]  Zachary Chase Lipton,et al.  Learning Time Dependent Choice , 2018, ITCS.

[12]  Ariel D. Procaccia,et al.  The learnability of voting rules , 2009, Artif. Intell..

[13]  Maria-Florina Balcan,et al.  Learning submodular functions , 2010, STOC '11.

[14]  Maria-Florina Balcan,et al.  A General Theory of Sample Complexity for Multi-Item Profit Maximization , 2017, EC.

[15]  Sven Seuken,et al.  Combinatorial Auctions via Machine Learning-based Preference Elicitation , 2018, IJCAI.

[16]  Sven Seuken,et al.  Probably Approximately Efficient Combinatorial Auctions via Machine Learning , 2017, AAAI.

[17]  Rakesh V. Vohra,et al.  Learning from revealed preference , 2006, EC '06.

[18]  Avrim Blum,et al.  Preference Elicitation and Query Learning , 2004, J. Mach. Learn. Res..

[19]  Philip M. Long,et al.  Characterizations of Learnability for Classes of {0, ..., n}-Valued Functions , 1995, J. Comput. Syst. Sci..

[20]  Morteza Zadimoghaddam,et al.  Efficiently Learning from Revealed Preference , 2012, WINE.

[21]  Steven J. Brams,et al.  Fair division - from cake-cutting to dispute resolution , 1998 .

[22]  Balas K. Natarajan,et al.  On learning sets and functions , 2004, Machine Learning.

[23]  Ruta Mehta,et al.  Social Welfare and Profit Maximization from Revealed Preferences , 2017, WINE.

[24]  S. Afriat THE CONSTRUCTION OF UTILITY FUNCTIONS FROM EXPENDITURE DATA , 1967 .

[25]  Shai Ben-David,et al.  Multiclass Learnability and the ERM principle , 2011, COLT.

[26]  Tim Roughgarden,et al.  Algorithmic Game Theory , 2007 .

[27]  Tuomas Sandholm,et al.  Preference elicitation in combinatorial auctions , 2001, AAMAS '02.

[28]  Martin Hoefer,et al.  Learning Market Parameters Using Aggregate Demand Queries , 2016, AAAI.

[29]  Aaron Roth,et al.  Online Learning and Profit Maximization from Revealed Preferences , 2014, AAAI.

[30]  Vincent Conitzer Eliciting single-peaked preferences using comparison queries , 2007, AAMAS '07.

[31]  Federico Echenique,et al.  Learnability and Models of Decision Making under Uncertainty , 2018, EC.

[32]  P. Samuelson A Note on the Pure Theory of Consumer's Behaviour , 1938 .

[33]  Maria-Florina Balcan,et al.  Learning Valuation Functions , 2011, COLT.