Aggregating Unstructured Submissions for Reliable Answers in Crowdsourcing Systems

In crowdsourcing systems, requesters assign tasks to multiple workers for obtaining reliable solutions. Task aggregation is performed on submissions received from crowd workers. Existing aggregation methods focus on structured submissions without accounting reliability factor of workers. Hence they are not suitable for aggregating unstructured answers. This paper proposes a novel task aggregation approach for a generic crowdsourcing tasks using an iterative probabilistic model. We make use of the reliability parameter and expertness of workers along with similarity information and requesters feedback. Experiments on the empirical data demonstrate that our scheme yields better results compared to existing state-of-the-art approaches.

[1]  Tom Minka,et al.  How To Grade a Test Without Knowing the Answers - A Bayesian Graphical Model for Adaptive Crowdsourcing and Aptitude Testing , 2012, ICML.

[2]  Sandhya Harikumar,et al.  Logistic regression within DBMS , 2016, 2016 2nd International Conference on Contemporary Computing and Informatics (IC3I).

[3]  Ashalatha Nayak,et al.  A Web 2.0-based internal crowdsourcing solution for tacit knowledge externalisation in enterprises , 2017, Int. J. Web Eng. Technol..

[4]  Bin Bi,et al.  Iterative Learning for Reliable Crowdsourcing Systems , 2012 .

[5]  Charles Elkan,et al.  Expectation Maximization Algorithm , 2010, Encyclopedia of Machine Learning.

[6]  Guoliang Li,et al.  Truth Inference in Crowdsourcing: Is the Problem Solved? , 2017, Proc. VLDB Endow..

[7]  G P Sajeev,et al.  SpreadMax: A Scalable Cascading Model for Influence Maximization in Social Networks , 2018, 2018 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[8]  Xueqi Cheng,et al.  Finding High-Quality Unstructured Submissions in General Crowdsourcing Tasks , 2018, CCIR.

[9]  Hisashi Kashima,et al.  Statistical quality estimation for general crowdsourcing tasks , 2013, HCOMP.

[10]  Jian Wu,et al.  Crowdsourced Label Aggregation Using Bilayer Collaborative Clustering , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[11]  Jian Peng,et al.  Variational Inference for Crowdsourcing , 2012, NIPS.

[12]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[13]  Derek Doran,et al.  Finding and validating medical information shared on Twitter: experiences using a crowdsourcing approach , 2019, Int. J. Web Eng. Technol..

[14]  G P Sajeev,et al.  A Trace Driven Analysis of Incentive Based Crowdsourcing Workload , 2018, 2018 International Conference on Data Science and Engineering (ICDSE).

[15]  Shie-Jue Lee,et al.  A Similarity Measure for Text Classification and Clustering , 2014, IEEE Transactions on Knowledge and Data Engineering.

[16]  A. P. Dawid,et al.  Maximum Likelihood Estimation of Observer Error‐Rates Using the EM Algorithm , 1979 .

[17]  Hisashi Kashima,et al.  Incorporating Worker Similarity for Label Aggregation in Crowdsourcing , 2018, ICANN.

[18]  Bo An,et al.  Understanding Crowdsourcing Systems from a Multiagent Perspective and Approach , 2018, ACM Trans. Auton. Adapt. Syst..

[19]  Brendan T. O'Connor,et al.  Cheap and Fast – But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks , 2008, EMNLP.

[20]  Guoliang Li,et al.  Crowdsourced Data Management: A Survey , 2016, IEEE Transactions on Knowledge and Data Engineering.

[22]  Kok-Leong Ong,et al.  Improving accuracy and lowering cost in crowdsourcing through an unsupervised expertise estimation approach , 2019, Decis. Support Syst..

[23]  Milad Shokouhi,et al.  Community-based bayesian aggregation models for crowdsourcing , 2014, WWW.

[24]  Kok-Leong Ong,et al.  Framework and Literature Analysis for Crowdsourcing’s Answer Aggregation , 2020, J. Comput. Inf. Syst..

[25]  Lei Chen,et al.  DLTA: A Framework for Dynamic Crowdsourcing Classification Tasks , 2019, IEEE Transactions on Knowledge and Data Engineering.

[26]  Karl Aberer,et al.  An Evaluation of Aggregation Techniques in Crowdsourcing , 2013, WISE.