Learning User Preferences to Incentivize Exploration in the Sharing Economy

We study platforms in the sharing economy and discuss the need for incentivizing users to explore options that otherwise would not be chosen. For instance, rental platforms such as Airbnb typically rely on customer reviews to provide users with relevant information about different options. Yet, often a large fraction of options does not have any reviews available. Such options are frequently neglected as viable choices, and in turn are unlikely to be evaluated, creating a vicious cycle. Platforms can engage users to deviate from their preferred choice by offering monetary incentives for choosing a different option instead. To efficiently learn the optimal incentives to offer, we consider structural information in user preferences and introduce a novel algorithm - Coordinated Online Learning (CoOL) - for learning with structural information modeled as convex constraints. We provide formal guarantees on the performance of our algorithm and test the viability of our approach in a user study with data of apartments on Airbnb. Our findings suggest that our approach is well-suited to learn appropriate incentives and increase exploration on the investigated platform.

[1]  S. Sénécal,et al.  The influence of online product recommendations on consumers' online choices , 2004 .

[2]  Paul Resnick,et al.  The value of reputation on eBay: A controlled experiment , 2002 .

[3]  Mladen Kolar,et al.  Distributed Multitask Learning , 2015, ArXiv.

[4]  Andreas Krause,et al.  Actively Learning Hemimetrics with Applications to Eliciting User Preferences , 2016, ICML.

[6]  Jon M. Kleinberg,et al.  Incentivizing exploration , 2014, EC.

[7]  Inderjit S. Dhillon,et al.  Triangle Fixing Algorithms for the Metric Nearness Problem , 2004, NIPS.

[8]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[9]  Yishay Mansour,et al.  Bayesian Exploration: Incentivizing Exploration in Bayesian Games , 2016, EC.

[10]  Gábor Lugosi,et al.  Prediction, learning, and games , 2006 .

[11]  S.H.G. ten Hagen,et al.  Exploration/exploitation in adaptive recommender systems , 2003 .

[12]  Michael Luca Reviews, Reputation, and Revenue: The Case of Yelp.Com , 2016 .

[13]  Sophie Ahrens,et al.  Recommender Systems , 2012 .

[14]  Fuzhen Zhuang,et al.  Collaborating between Local and Global Learning for Distributed Online Multiple Tasks , 2015, CIKM.

[15]  Andrey Fradkin,et al.  Search Frictions and the Design of Online Marketplaces , 2015, AMMA 2015.

[16]  P. Resnick,et al.  The Market for Evaluations , 1999 .

[17]  Sanmay Das,et al.  Coordinated Versus Decentralized Exploration In Multi-Agent Multi-Armed Bandits , 2017, IJCAI.

[18]  Yishay Mansour,et al.  Implementing the “Wisdom of the Crowd” , 2013, Journal of Political Economy.

[19]  Andreas Krause,et al.  Incentivizing Users for Balancing Bike Sharing Systems , 2015, AAAI.

[20]  Martin Zinkevich,et al.  Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.

[21]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[22]  B. Gu,et al.  The impact of online user reviews on hotel room sales , 2009 .

[23]  Yishay Mansour,et al.  Bayesian Incentive-Compatible Bandit Exploration , 2018 .

[24]  Mladen Kolar,et al.  Distributed Multi-Task Learning , 2016, AISTATS.