Sequential Peer Prediction: Learning to Elicit Effort using Posted Prices

Peer prediction mechanisms are often adopted to elicit truthful contributions from crowd workers when no ground-truth verification is available. Recently, mechanisms of this type have been developed to incentivize effort exertion, in addition to truthful elicitation. In this paper, we study a sequential peer prediction problem where a data requester wants to dynamically determine the reward level to optimize the trade-off between the quality of information elicited from workers and the total expected payment. In this problem, workers have homogeneous expertise and heterogeneous cost for exerting effort, both unknown to the requester. We propose a sequential posted-price mechanism to dynamically learn the optimal reward level from workers' contributions and to incentivize effort exertion and truthful reporting. We show that (1) in our mechanism, workers exerting effort according to a non-degenerate threshold policy and then reporting truthfully is an equilibrium that returns highest utility for each worker, and (2) The regret of our learning mechanism w.r.t. offering the optimal reward (price) is upper bounded by $\tilde{O}(T^{3/4})$ where $T$ is the learning horizon. We further show the power of our learning approach when the reports of workers do not necessarily follow the game-theoretic equilibrium.

[1]  David C. Parkes,et al.  Dwelling on the Negative: Incentivizing Effort in Peer Prediction , 2013, HCOMP.

[2]  Boi Faltings,et al.  Mechanisms for Making Crowds Truthful , 2014, J. Artif. Intell. Res..

[3]  Peter Auer,et al.  Finite-time Analysis of the Multiarmed Bandit Problem , 2002, Machine Learning.

[4]  David C. Parkes,et al.  A Robust Bayesian Truth Serum for Small Populations , 2012, AAAI.

[5]  David C. Parkes,et al.  Learning the Prior in Minimal Peer Prediction , 2013 .

[6]  Boi Faltings,et al.  A Robust Bayesian Truth Serum for Non-Binary Signals , 2013, AAAI.

[7]  Paul Resnick,et al.  Eliciting Informative Feedback: The Peer-Prediction Method , 2005, Manag. Sci..

[8]  David C. Parkes,et al.  Peer prediction without a common prior , 2012, EC '12.

[9]  Aaron Roth,et al.  Conducting truthful surveys, cheaply , 2012, EC '12.

[10]  Jacob Abernethy,et al.  Actively Purchasing Data for Learning , 2015, ArXiv.

[11]  T. L. Lai Andherbertrobbins Asymptotically Efficient Adaptive Allocation Rules , 2022 .

[12]  Jens Witkowski,et al.  A Geometric Method to Construct Minimal Peer Prediction Mechanisms , 2016, AAAI.

[13]  V. Shnayder,et al.  Informed Truthfulness in MultiTask Peer Prediction ( Working paper ) , 2016 .

[14]  Arpit Agarwal,et al.  Informed Truthfulness in Multi-Task Peer Prediction , 2016, EC.

[15]  Yiling Chen,et al.  Output Agreement Mechanisms and Common Knowledge , 2014, HCOMP.

[16]  Shuchi Chawla,et al.  Mechanism design for data science , 2014, EC.

[17]  Boi Faltings,et al.  Collusion-resistant, incentive-compatible feedback payments , 2007, EC '07.

[18]  D. Prelec A Bayesian Truth Serum for Subjective Data , 2004, Science.

[19]  Ilya Segal,et al.  The communication requirements of social choice rules and supporting budget sets , 2007, J. Econ. Theory.

[20]  Yang Liu,et al.  Learning to Incentivize: Eliciting Effort via Output Agreement , 2016, IJCAI.

[21]  Anirban Dasgupta,et al.  Crowdsourced judgement elicitation with endogenous proficiency , 2013, WWW.