We consider the problem of optimal budget allocation for crowdsourcing problems, allocating users to tasks to maximize our final confidence in the crowdsourced answers. Such an optimized worker assignment method allows us to boost the efficacy of any popular crowdsourcing estimation algorithm. We consider a mutual information interpretation of the crowdsourcing problem, which leads to a stochastic subset selection problem with a submodular objective function. We present experimental simulation results which demonstrate the effectiveness of our dynamic task allocation method for achieving higher accuracy, possibly requiring fewer labels, as well as improving upon a previous method which is sensitive to the proportion of users to questions.
[1]
Thomas M. Cover,et al.
Elements of Information Theory
,
2005
.
[2]
Xi Chen,et al.
Spectral Methods Meet EM: A Provably Optimal Algorithm for Crowdsourcing
,
2014,
J. Mach. Learn. Res..
[3]
Andreas Krause,et al.
Submodular Function Maximization
,
2014,
Tractability.
[4]
Qiang Liu,et al.
Aggregating Ordinal Labels from Crowds by Minimax Conditional Entropy
,
2014,
ICML.
[5]
Chao Gao,et al.
Minimax Optimal Convergence Rates for Estimating Ground Truth from Crowdsourced Labels
,
2013,
1310.5764.
[6]
A. P. Dawid,et al.
Maximum Likelihood Estimation of Observer Error‐Rates Using the EM Algorithm
,
1979
.