Budget-Optimal Task Allocation for Reliable Crowdsourcing Systems

Crowdsourcing systems, in which numerous tasks are electronically distributed to numerous “information pieceworkers,” have emerged as an effective paradigm for human-powered solving of large-scale problems in domains such as image classification, data entry, optical character recognition, recommendation, and proofreading. Because these low-paid workers can be unreliable, nearly all such systems must devise schemes to increase confidence in their answers, typically by assigning each task multiple times and combining the answers in an appropriate manner, e.g., majority voting. In this paper, we consider a general model of such crowdsourcing tasks and pose the problem of minimizing the total price i.e., number of task assignments that must be paid to achieve a target overall reliability. We give a new algorithm for deciding which tasks to assign to which workers and for inferring correct answers from the workers' answers. We show that our algorithm, inspired by belief propagation and low-rank matrix approximation, significantly outperforms majority voting and, in fact, is optimal through comparison to an oracle that knows the reliability of every worker. Further, we compare our approach with a more general class of algorithms that can dynamically assign tasks. By adaptively deciding which questions to ask to the next set of arriving workers, one might hope to reduce uncertainty more efficiently. We show that, perhaps surprisingly, the minimum price necessary to achieve a target reliability scales in the same manner under both adaptive and nonadaptive scenarios. Hence, our nonadaptive approach is order optimal under both scenarios. This strongly relies on the fact that workers are fleeting and cannot be exploited. Therefore, architecturally, our results suggest that building a reliable worker-reputation system is essential to fully harnessing the potential of adaptive designs.

[1]  C. Lanczos An iteration method for the solution of the eigenvalue problem of linear differential and integral operators , 1950 .

[2]  Gunther Wyszecki,et al.  Color Science: Concepts and Methods, Quantitative Data and Formulae, 2nd Edition , 2000 .

[3]  J. Cohen,et al.  Color Science: Concepts and Methods, Quantitative Data and Formulas , 1968 .

[4]  A. P. Dawid,et al.  Maximum Likelihood Estimation of Observer Error‐Rates Using the EM Algorithm , 1979 .

[5]  G. Wyszecki,et al.  Color Science Concepts and Methods , 1982 .

[6]  Béla Bollobás,et al.  Random Graphs , 1985 .

[7]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[8]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[9]  Noga Alon,et al.  The Probabilistic Method , 2015, Fundamentals of Ramsey Theory.

[10]  Daniel Kahneman,et al.  Probabilistic reasoning , 1993 .

[11]  Pietro Perona,et al.  Inferring Ground Truth from Subjective Labelling of Venus Images , 1994, NIPS.

[12]  Manfred K. Warmuth,et al.  The Weighted Majority Algorithm , 1994, Inf. Comput..

[13]  Y. Peres,et al.  Broadcasting on trees and the Ising model , 2000 .

[14]  P. Oswald,et al.  Capacity-achieving sequences for the erasure channel , 2001, Proceedings. 2001 IEEE International Symposium on Information Theory (IEEE Cat. No.01CH37252).

[15]  Béla Bollobás,et al.  Random Graphs: Notation , 2001 .

[16]  V. Akila,et al.  Information , 2001, The Lancet.

[17]  Rong Jin,et al.  Learning with Multiple Labels , 2002, NIPS.

[18]  Amin Shokrollahi,et al.  Capacity-achieving sequences for the erasure channel , 2002, IEEE Trans. Inf. Theory.

[19]  William T. Freeman,et al.  Understanding belief propagation and its generalizations , 2003 .

[20]  H. Sebastian Seung,et al.  Selective Sampling Using the Query by Committee Algorithm , 1997, Machine Learning.

[21]  Mehran Mesbahi,et al.  Agreement over random networks , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[22]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[23]  Matti Kääriäinen,et al.  Active Learning in the Non-realizable Case , 2006, ALT.

[24]  Andrea Montanari,et al.  Learning low rank matrices from O(n) entries , 2008, 2008 46th Annual Allerton Conference on Communication, Control, and Computing.

[25]  Andrea Montanari,et al.  The slope scaling parameter for general channels, decoders, and ensembles , 2008, 2008 IEEE International Symposium on Information Theory.

[26]  Andrea Montanari,et al.  Computing the threshold shift for general channels , 2008, 2008 IEEE International Symposium on Information Theory.

[27]  Panagiotis G. Ipeirotis,et al.  Get another label? improving data quality and data mining using multiple, noisy labelers , 2008, KDD.

[28]  N. Alon,et al.  The Probabilistic Method: Alon/Probabilistic , 2008 .

[29]  Rüdiger L. Urbanke,et al.  Modern Coding Theory , 2008 .

[30]  M. Mézard,et al.  Information, Physics, and Computation , 2009 .

[31]  Javier R. Movellan,et al.  Whose Vote Should Count More: Optimal Integration of Labels from Labelers of Unknown Expertise , 2009, NIPS.

[32]  Andrea Montanari,et al.  Matrix Completion from Noisy Entries , 2009, J. Mach. Learn. Res..

[33]  Duncan J. Watts,et al.  Financial incentives and the "performance of crowds" , 2009, HCOMP '09.

[34]  Andrea Montanari,et al.  Low-rank matrix completion with noisy observations: A quantitative comparison , 2009, 2009 47th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[35]  Jaime G. Carbonell,et al.  Efficiently learning the accuracy of labeling sources for selective sampling , 2009, KDD.

[36]  Andrea Montanari,et al.  Matrix completion from a few entries , 2009, 2009 IEEE International Symposium on Information Theory.

[37]  Panagiotis G. Ipeirotis Analyzing the Amazon Mechanical Turk marketplace , 2010, XRDS.

[38]  Michael S. Bernstein,et al.  Soylent: a word processor with a crowd inside , 2010, UIST.

[39]  Vikas Kumar,et al.  CrowdSearch: exploiting crowds for accurate real-time image search on mobile phones , 2010, MobiSys '10.

[40]  Amin Karbasi,et al.  Ultrasound Tomography Calibration Using Structured Matrix Completion , 2010 .

[41]  Rob Miller,et al.  VizWiz: nearly real-time answers to visual questions , 2010, UIST.

[42]  Panagiotis G. Ipeirotis,et al.  Quality management on Amazon Mechanical Turk , 2010, HCOMP '10.

[43]  Kun Deng,et al.  Active Learning from Multiple Noisy Labelers with Varied Costs , 2010, 2010 IEEE International Conference on Data Mining.

[44]  A. Montanari,et al.  Sensor network localization from local connectivity: Performance analysis for the MDS-MAP algorithm , 2010, 2010 IEEE Information Theory Workshop on Information Theory (ITW 2010, Cairo).

[45]  Distributed sensor network localization from local connectivity: performance analysis for the HOP-TERRAIN algorithm , 2010, SIGMETRICS.

[46]  Pietro Perona,et al.  The Multidimensional Wisdom of Crowds , 2010, NIPS.

[47]  A. Montanari,et al.  On positioning via distributed matrix completion , 2010, 2010 IEEE Sensor Array and Multichannel Signal Processing Workshop.

[48]  Gerardo Hermosillo,et al.  Learning From Crowds , 2010, J. Mach. Learn. Res..

[49]  Amin Karbasi,et al.  Distributed sensor network localization from local connectivity: performance analysis for the HOP-TERRAIN algorithm , 2010, SIGMETRICS '10.

[50]  Lydia B. Chilton,et al.  Task search in a human computation market , 2010, HCOMP '10.

[51]  Devavrat Shah,et al.  Budget-optimal crowdsourcing using low-rank matrix approximations , 2011, 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[52]  Michael S. Bernstein,et al.  Crowds in two seconds: enabling realtime crowd-powered interfaces , 2011, UIST.

[53]  Approximating the Wisdom of the Crowd , 2011 .

[54]  Björn Hartmann,et al.  What's the Right Price? Pricing Tasks for Finishing on Time , 2011, Human Computation.

[55]  Jian Peng,et al.  Variational Inference for Crowdsourcing , 2012, NIPS.

[56]  Shipeng Yu,et al.  Eliminating Spammers and Ranking Annotators for Crowdsourced Labeling Tasks , 2012, J. Mach. Learn. Res..

[57]  Bin Bi,et al.  Iterative Learning for Reliable Crowdsourcing Systems , 2012 .

[58]  Amin Karbasi,et al.  Robust Localization From Incomplete Local Information , 2013, IEEE/ACM Transactions on Networking.

[59]  Amin Karbasi,et al.  Calibration Using Matrix Completion With Application to Ultrasound Tomography , 2013, IEEE Transactions on Signal Processing.