论文信息 - Incentivising Exploration and Recommendations for Contextual Bandits with Payments

Incentivising Exploration and Recommendations for Contextual Bandits with Payments

We propose a contextual bandit based model to capture the learning and social welfare goals of a web platform in the presence of myopic users. By using payments to incentivize these agents to explore different items/recommendations, we show how the platform can learn the inherent attributes of items and achieve a sublinear regret while maximizing cumulative social welfare. We also calculate theoretical bounds on the cumulative costs of incentivization to the platform. Unlike previous works in this domain, we consider contexts to be completely adversarial, and the behavior of the adversary is unknown to the platform. Our approach can improve various engagement metrics of users on e-commerce stores, recommendation engines and matching platforms.

Theja Tulabandhula | Priyank Agrawal

[1] Nicole Immorlica,et al. Incentivizing Exploration with Selective Data Disclosure , 2018, EC.

[2] John Langford,et al. Practical Evaluation and Optimization of Contextual Bandit Algorithms , 2018, ArXiv.

[3] Aart van Halteren,et al. Toward a persuasive mobile application to reduce sedentary behavior , 2013, Personal and Ubiquitous Computing.

[4] Yishay Mansour,et al. Bayesian Exploration: Incentivizing Exploration in Bayesian Games , 2016, EC.

[5] Yishay Mansour,et al. Optimal Algorithm for Bayesian Incentive-Compatible , 2018, ArXiv.

[6] Jon M. Kleinberg,et al. Incentivizing exploration , 2014, EC.

[7] Li Han,et al. Incentivizing Exploration with Heterogeneous Value of Money , 2015, WINE.

[8] Nicole Immorlica,et al. Incentivizing Exploration with Unbiased Histories , 2018, ArXiv.

[9] Sampath Kannan,et al. Fairness Incentives for Myopic Agents , 2017, EC.

[10] Yishay Mansour,et al. Optimal Algorithm for Bayesian Incentive-Compatible Exploration , 2018, EC.

[11] J. Langford,et al. The Epoch-Greedy algorithm for contextual multi-armed bandits , 2007, NIPS 2007.