Crowdsourcing: A taxonomy and systematic mapping study

Abstract Context: Crowdsourcing, or tapping into the power of the crowd for problem solving, has gained ever-increasing attraction since it was first introduced. Crowdsourcing has been used in different disciplines, and it is becoming well-accepted in the marketplace as a new business model which utilizes Human Intelligence Tasks (HITs). Objective: While both academia and industry have extensively delved into different aspects of crowdsourcing, there seems to be no common understanding of what crowdsourcing really means and what core and optional features it has. Also, we still lack information on the kinds and disciplines of studies conducted on crowdsourcing and how they defined it in the context of their application area. This paper will clarify this ambiguity by analysing the distribution and demographics of research in crowdsourcing and extracting taxonomy of the variability and commonality in the constructs defining the concept in the literature. Method: We conduct a systematic mapping study and analyse 113 papers, selected via a formal process, and report and discuss the results. The study is combined by a content analysis process to extract a taxonomy of features describing crowdsourcing. Results: We extract and describe the taxonomy of features which characterize crowdsourcing in its four constituents; the crowd, the crowdsourcer, the crowdsourced task and the crowdsourcing platform. In addition, we report on different mappings between these features and the characteristics of the studied papers. We also analyse the distribution of the research using multiple criteria and draw conclusions. For example, our results show a constantly increasing interest in the area, especially in North America and a significant interest from industry. Also, we illustrate that although crowdsourcing is shown to be useful in a variety of disciplines, the research in the field of computer science still seems to be dominant in investigating it. Conclusions: This study allows forming a clear picture of the research in crowdsourcing and understanding the different features of crowdsourcing and their popularity, what type of research was conducted, where and how and by whom. The study enables researchers and practitioners to estimate the current status of the research in this new field. Our taxonomy of extracted features provides a reference model which could be used to configure crowdsourcing and also define it precisely and make design decisions on which of its variation to adopt.

[1]  Eric Schenk,et al.  Crowdsourcing: What can be Outsourced to the Crowd, and Why ? , 2009 .

[2]  Jaime G. Carbonell,et al.  Towards Task Recommendation in Micro-Task Markets , 2011, Human Computation.

[3]  Souad Djelassi,et al.  Customers' participation in product development through crowdsourcing: Issues and implications , 2013 .

[4]  Huiji Gao,et al.  Harnessing the Crowdsourcing Power of Social Media for Disaster Relief , 2011, IEEE Intelligent Systems.

[5]  Gilles Adda,et al.  Economic and Ethical background of Crowdsourcing for Speech , 2013 .

[6]  Bernd Brügge,et al.  User involvement in software evolution practice: A case study , 2013, 2013 35th International Conference on Software Engineering (ICSE).

[7]  Bei Yu,et al.  Crowdsourcing Participatory Evaluation of Medical Pictograms Using Amazon Mechanical Turk , 2013, Journal of medical Internet research.

[8]  Roel Wieringa,et al.  Requirements engineering paper classification and evaluation criteria: a proposal and a discussion , 2005, Requirements Engineering.

[9]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[10]  Hector Garcia-Molina,et al.  Turkalytics: analytics for human computation , 2011, WWW.

[11]  Martin Schreier,et al.  The Value of Crowdsourcing: Can Users Really Compete with Professionals in Generating New Product Ideas? , 2009 .

[12]  Hisashi Kashima,et al.  Statistical quality estimation for general crowdsourcing tasks , 2013, HCOMP.

[13]  Ruggiero Cavallo,et al.  Winner-Take-All Crowdsourcing Contests with Stochastic Production , 2013, HCOMP.

[14]  Andrea Castelletti,et al.  Putting humans in the loop: Social computing for Water Resources Management , 2012, Environ. Model. Softw..

[15]  Lav R. Varshney,et al.  Privacy and Reliability in Crowdsourcing Service Delivery , 2012, 2012 Annual SRII Global Conference.

[16]  Ricardo Matsumura de Araújo,et al.  99designs: An Analysis of Creative Competition in Crowdsourced Design , 2013, HCOMP.

[17]  Mahmood Hosseini,et al.  The four pillars of crowdsourcing: A reference model , 2014, 2014 IEEE Eighth International Conference on Research Challenges in Information Science (RCIS).

[18]  Maja Vukovic,et al.  Crowdsourcing for Enterprises , 2009, 2009 Congress on Services - I.

[19]  Schahram Dustdar,et al.  Tweetflows: flexible workflows with twitter , 2011, PESOS '11.

[20]  Francis D. Tuggle,et al.  Fostering innovation with KM 2.0 , 2010 .

[21]  Jorge Gonçalves,et al.  Crowdsourcing on the spot: altruistic use of public displays, feasibility, performance, and behaviours , 2013, UbiComp.

[22]  Paul Shabajee,et al.  Proceedings of the 22nd International Conference on World Wide Web , 2013 .

[24]  M. Petticrew,et al.  Systematic Reviews in the Social Sciences: A Practical Guide , 2005 .

[25]  Shourya Roy,et al.  Form digitization in BPO: from outsourcing to crowdsourcing? , 2013, CHI.

[26]  V. Chanal,et al.  How to invent a new business model based on crowdsourcing: The crowdspirit ® case , 2008 .

[27]  Mark Wexler Reconfiguring the sociology of the crowd: exploring crowdsourcing , 2011 .

[28]  Rajarshi Das,et al.  Emerging theories and models of human computation systems: a brief survey , 2011, UbiCrowd '11.

[29]  Omar Alonso,et al.  Crowdsourcing for relevance evaluation , 2008, SIGF.

[30]  Valter Crescenzi,et al.  Wrapper Generation Supervised by a Noisy Crowd , 2013, DBCrowd.

[31]  Lise Getoor,et al.  Reducing Label Cost by Combining Feature Labels and Crowdsourcing , 2011 .

[32]  Bill Tomlinson,et al.  Who are the crowdworkers?: shifting demographics in mechanical turk , 2010, CHI Extended Abstracts.

[33]  Alessandro Bozzon,et al.  Reactive crowdsourcing , 2013, WWW.

[34]  Lukas Biewald,et al.  Programmatic Gold: Targeted and Scalable Quality Assurance in Crowdsourcing , 2011, Human Computation.

[35]  Gary Hsieh,et al.  Understanding and Designing for Cultural Differences on Crowdsourcing Marketplaces , 2011 .

[36]  Michael Vitale,et al.  The Wisdom of Crowds , 2015, Cell.

[37]  Wolf-Tilo Balke,et al.  Information Extraction Meets Crowdsourcing: A Promising Couple , 2012, Datenbank-Spektrum.

[38]  Pierre André,et al.  Citizen Participation , 1977 .

[39]  Chien-Ju Ho,et al.  Adaptive Task Assignment for Crowdsourced Classification , 2013, ICML.

[40]  Walid Maalej,et al.  User feedback in the appstore: An empirical study , 2013, 2013 21st IEEE International Requirements Engineering Conference (RE).

[41]  Frank Kleemann,et al.  Un(der)paid innovators: the commercial utilization of consumer work through crowdsourcing , 2008 .

[42]  Marta R. Costa-jussà,et al.  Opinion Mining of Spanish Customer Comments with Non-Expert Annotations on Mechanical Turk , 2010, Mturk@HLT-NAACL.

[43]  E. Seltzer,et al.  Citizen Participation, Open Innovation, and Crowdsourcing , 2013 .

[44]  Wei-Tek Tsai,et al.  Creative software crowdsourcing: from components and algorithm development to project concept formations , 2013, Int. J. Creative Comput..

[45]  Christian Borch,et al.  The Politics of Crowds: An Alternative History of Sociology , 2012 .

[46]  Desney S. Tan,et al.  CHI '11 Extended Abstracts on Human Factors in Computing Systems , 2008, CHI 2011.

[47]  Boyang Li,et al.  Learning Sociocultural Knowledge via Crowdsourced Examples , 2012, HCOMP@AAAI.

[48]  Qinghua Zhu,et al.  Evaluation on crowdsourcing research: Current status and future direction , 2012, Information Systems Frontiers.

[49]  Mark Harman,et al.  Pricing crowdsourcing-based software development tasks , 2013, 2013 35th International Conference on Software Engineering (ICSE).

[50]  Fernando González-Ladrón-de-Guevara,et al.  Towards an integrated crowdsourcing definition , 2012, J. Inf. Sci..

[51]  Duncan J. Watts,et al.  Financial incentives and the "performance of crowds" , 2009, HCOMP '09.

[52]  Pavel Korshunov,et al.  Crowdsourcing-based multimedia subjective evaluations: a case study on image recognizability and aesthetic appeal , 2013, CrowdMM '13.

[53]  Hongwei Li,et al.  Error Rate Analysis of Labeling by Crowdsourcing , 2013 .

[54]  Bashar Nuseibeh,et al.  Social Adaptation - When Software Gives Users a Voice , 2012, ENASE.

[55]  Hans Thies,et al.  Dynamic and Goal-Based Quality Management for Human-Based Electronic Services , 2012, Int. J. Cooperative Inf. Syst..

[56]  Daren C. Brabham Crowdsourcing as a Model for Problem Solving , 2008 .

[57]  Matthew Reid,et al.  Quality control mechanisms for crowdsourcing: peer review, arbitration, & expertise at familysearch indexing , 2013, CSCW '13.

[58]  Roman Lukyanenko,et al.  Conceptual modeling principles for crowdsourcing , 2012, CrowdSens '12.

[59]  Max Bader Crowdsourcing election monitoring in the 2011–2012 Russian elections , 2013 .

[60]  Per Runeson,et al.  Software product line testing - A systematic mapping study , 2011, Inf. Softw. Technol..

[61]  Kathryn T. Stolee,et al.  Exploring the use of crowdsourcing to support empirical studies in software engineering , 2010, ESEM '10.

[62]  Christian Heipke,et al.  Crowdsourcing geospatial data , 2010 .

[63]  Bernardo J. Clavijo,et al.  Crowdsourcing genomic analyses of ash and ash dieback – power to the people , 2013, GigaScience.

[64]  Pearl Brereton,et al.  Performing systematic literature reviews in software engineering , 2006, ICSE.

[65]  Daren C. Brabham Moving the crowd at iStockphoto: The composition of the crowd and motivations for participation in a crowdsourcing application , 2008, First Monday.

[66]  Tobias Hoßfeld,et al.  Analyzing costs and accuracy of validation mechanisms for crowdsourcing platforms , 2013, Math. Comput. Model..

[67]  Phuoc Tran-Gia,et al.  Anatomy of a Crowdsourcing Platform - Using the Example of Microworkers.com , 2011, 2011 Fifth International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing.

[68]  Benjamin B. Bederson,et al.  Web workers unite! addressing challenges of online laborers , 2011, CHI Extended Abstracts.

[69]  T. J. Watson Some Thoughts on a Framework for Crowdsourcing A Position Paper for the CHI 2011 Workshop on Crowdsourcing and Human Computation , 2011 .

[70]  Kyo Chul Kang,et al.  Feature-Oriented Domain Analysis (FODA) Feasibility Study , 1990 .

[71]  Robert Morris,et al.  Crowdsourcing Workshop: The Emergence of Affective Crowdsourcing , 2011 .

[72]  Davor Svetinovic,et al.  CrowdREquire: A Requirements Engineering Crowdsourcing Platform , 2012, AAAI Spring Symposium: Wisdom of the Crowd.

[73]  Tim Kraska,et al.  CrowdDB: answering queries with crowdsourcing , 2011, SIGMOD '11.

[74]  Kai Petersen,et al.  Systematic Mapping Studies in Software Engineering , 2008, EASE.

[75]  Silvia Mara Abrahão,et al.  Usability evaluation methods for the web: A systematic mapping study , 2011, Inf. Softw. Technol..

[76]  Mark A. Musen,et al.  Crowdsourcing the Verification of Relationships in Biomedical Ontologies , 2013, AMIA.

[77]  Marco Bani Crowdsourcing Democracy: The Case of Icelandic Social Constitutionalism , 2012 .

[78]  Pearl Brereton,et al.  Evidence relating to Object-Oriented software design: A survey , 2007, First International Symposium on Empirical Software Engineering and Measurement (ESEM 2007).

[79]  Phuoc Tran-Gia,et al.  Modeling of crowdsourcing platforms and granularity of work organization in Future Internet , 2011, 2011 23rd International Teletraffic Congress (ITC).

[80]  Ittai Abraham,et al.  Crowdsourcing Gold-HIT Creation at Scale: Challenges and Adaptive Exploration Approaches , 2013 .

[81]  Gianluca Demartini,et al.  Mechanical Cheat: Spamming Schemes and Adversarial Techniques on Crowdsourcing Platforms , 2012, CrowdSearch.

[82]  Henning Müller,et al.  Ground truth generation in medical imaging: a crowdsourcing-based iterative approach , 2012, CrowdMM '12.

[83]  Bashar Nuseibeh,et al.  Social sensing: when users become monitors , 2011, ESEC/FSE '11.

[84]  Martha Larson,et al.  Activating the Crowd: Exploiting User-Item Reciprocity for Recommendation , 2013 .

[85]  Omar Alonso,et al.  Perspectives on Infrastructure for Crowdsourcing , 2011 .

[86]  Milan Vojnovic,et al.  Crowdsourcing and all-pay auctions , 2009, EC '09.

[87]  R. Miikkulainen,et al.  Leveraging Human Computation Markets for Interactive Evolution , 2013 .

[88]  David Alan Grier Not for All Markets , 2011, Computer.

[89]  Jerry Brito,et al.  Hack, Mash & Peer: Crowdsourcing Government Transparency , 2007 .

[90]  P. Whitla,et al.  Crowdsourcing and its application in marketing activities , 2009 .

[91]  Eddy Maddalena,et al.  Crowdsourcing to Mobile Users: A Study of the Role of Platforms and Tasks , 2013, DBCrowd.

[92]  Matthew Lease,et al.  Crowdsourcing 101: putting the WSDM of crowds to work for you , 2011, WSDM '11.

[93]  Yaron Singer,et al.  Pricing mechanisms for crowdsourcing markets , 2013, WWW.

[94]  Lisa Buckley,et al.  Value 2.0: eight new rules for creating and capturing value from innovative technologies , 2008 .

[95]  Gabriella Kazai,et al.  In Search of Quality in Crowdsourcing for Search Engine Evaluation , 2011, ECIR.

[96]  Russell L Woods,et al.  Crowdsourcing a Normative Natural Language Dataset: A Comparison of Amazon Mechanical Turk and In-Lab Data Collection , 2013, Journal of medical Internet research.