论文信息 - Online Learning Using Only Peer Prediction

Online Learning Using Only Peer Prediction

This paper considers a variant of the classical online learning problem with expert predictions. Our model's differences and challenges are due to lacking any direct feedback on the loss each expert incurs at each time step $t$. We propose an approach that uses peer prediction and identify conditions where it succeeds. Our techniques revolve around a carefully designed peer score function $s()$ that scores experts' predictions based on the peer consensus. We show a sufficient condition, that we call \emph{peer calibration}, under which standard online learning algorithms using loss feedback computed by the carefully crafted $s()$ have bounded regret with respect to the unrevealed ground truth values. We then demonstrate how suitable $s()$ functions can be derived for different assumptions and models.

Yang Liu | David P. Helmbold

[1] Clayton Scott,et al. A Rate of Convergence for Mixture Proportion Estimation, with Application to Learning from Noisy Labels , 2015, AISTATS.

[2] Chris Mesterharm,et al. On-line Learning with Delayed Label Feedback , 2005, ALT.

[3] Boi Faltings,et al. Incentives for Effort in Crowdsourcing Using the Peer Truth Serum , 2016, ACM Trans. Intell. Syst. Technol..

[4] Yang Liu,et al. Peer Loss Functions: Learning from Noisy Labels without Knowing Noise Rates , 2020, ICML.

[5] David Haussler,et al. How to use expert advice , 1993, STOC.

[6] Yishay Mansour,et al. Adversarial Online Learning with noise , 2018, ICML.

[7] Mingyan Liu,et al. An Online Learning Approach to Improving the Quality of Crowd-Sourcing , 2015, SIGMETRICS.

[8] Anirban Dasgupta,et al. Crowdsourced judgement elicitation with endogenous proficiency , 2013, WWW.

[9] Boi Faltings,et al. A Robust Bayesian Truth Serum for Non-Binary Signals , 2013, AAAI.

[10] Paul Resnick,et al. Eliciting Informative Feedback: The Peer-Prediction Method , 2005, Manag. Sci..

[11] Philip E. Tetlock,et al. Superforecasting: The Art and Science of Prediction , 2015 .