论文信息 - TrueLabel + Confusions: A Spectrum of Probabilistic Models in Analyzing Multiple Ratings

TrueLabel + Confusions: A Spectrum of Probabilistic Models in Analyzing Multiple Ratings

This paper revisits the problem of analyzing multiple ratings given by different judges. Different from previous work that focuses on distilling the true labels from noisy crowdsourcing ratings, we emphasize gaining diagnostic insights into our in-house well-trained judges. We generalize the well-known DAWIDSKENE model (Dawid & Skene, 1979) to a spectrum of probabilistic models under the same "TrueLabel + Confusion" paradigm, and show that our proposed hierarchical Bayesian model, called HYBRIDCONFUSION, consistently outperforms DAWIDSKENE on both synthetic and real-world data sets.

Chao Liu | Yi-Min Wang | Yi-Min Wang | Chao Liu

[1] Matthew Stephens. Dealing with multimodal posteriors and non-identifiabilit y in mixture models , 1999 .

[2] Hyun-Chul Kim,et al. Bayesian Classifier Combination , 2012, AISTATS.

[3] Pietro Perona,et al. Inferring Ground Truth from Subjective Labelling of Venus Images , 1994, NIPS.

[4] Panagiotis G. Ipeirotis,et al. Managing crowdsourced human computation: a tutorial , 2011, WWW.

[5] Panagiotis G. Ipeirotis,et al. Get another label? improving data quality and data mining using multiple, noisy labelers , 2008, KDD.

[6] Gerardo Hermosillo,et al. Supervised learning from multiple experts: whom to trust when everyone lies a bit , 2009, ICML '09.

[7] Javier R. Movellan,et al. Whose Vote Should Count More: Optimal Integration of Labels from Labelers of Unknown Expertise , 2009, NIPS.

[8] A. P. Dawid,et al. Maximum Likelihood Estimation of Observer Error‐Rates Using the EM Algorithm , 1979 .

[9] Andrew Thomas,et al. WinBUGS - A Bayesian modelling framework: Concepts, structure, and extensibility , 2000, Stat. Comput..

[10] Pietro Perona,et al. The Multidimensional Wisdom of Crowds , 2010, NIPS.

[11] Pietro Perona,et al. Online crowdsourcing: Rating annotators and obtaining cost-effective labels , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.