Just in Time: Controlling Temporal Performance in Crowdsourcing Competitions

Many modern data analytics applications in areas such as crisis management, stock trading, and healthcare, rely on components capable of nearly real-time processing of streaming data produced at varying rates. In addition to automatic processing methods, many tasks involved in those applications require further human assessment and analysis. However, current crowdsourcing platforms and systems do not support stream processing with variable loads. In this paper, we investigate how incentive mechanisms in competition based crowdsourcing can be employed in such scenarios. More specifically, we explore techniques for stimulating workers to dynamically adapt to both anticipated and sudden changes in data volume and processing demand, and we analyze effects such as data processing throughput, peak-to-average ratios, and saturation effects. To this end, we study a wide range of incentive schemes and utility functions inspired by real world applications. Our large-scale experimental evaluation with more than 900 participants and more than 6200 hours of work spent by crowd workers demonstrates that our competition based mechanisms are capable of adjusting the throughput of online workers and lead to substantial on-demand performance boosts.

[1]  Sami Tabbane,et al.  Sigmoid utility function formulation for handoff reducing Access model in cognitive radio , 2013, 2013 13th International Symposium on Communications and Information Technologies (ISCIT).

[2]  E. Douglas Jensen Asynchronous Decentralized Realtime Computer Systems , 1992, NATO ASI RTC.

[3]  Björn Hartmann,et al.  What's the Right Price? Pricing Tasks for Finishing on Time , 2011, Human Computation.

[4]  Panagiotis G. Ipeirotis,et al.  Managing crowdsourced human computation: a tutorial , 2011, WWW.

[5]  M. P. Moghaddam,et al.  Optimal real time pricing in an agent-based retail market using a comprehensive demand response model , 2011 .

[6]  Agripino Gabriel Miranda Damasceno,et al.  A Base Station Workload-Aware Dynamic Pricing Scheme for Mobile Internet Access , 2015, MobiArch.

[7]  Krzysztof Z. Gajos,et al.  Platemate: crowdsourcing nutritional analysis from food photographs , 2011, UIST.

[8]  Wolfgang Nejdl,et al.  Incremental diversification for very large sets: a streaming-based approach , 2011, SIGIR '11.

[9]  Hamed Mohsenian Rad,et al.  Optimal Residential Load Control With Price Prediction in Real-Time Electricity Pricing Environments , 2010, IEEE Transactions on Smart Grid.

[10]  Beng Chin Ooi,et al.  CrowdOp: Query Optimization for Declarative Crowdsourcing Systems , 2015, IEEE Transactions on Knowledge and Data Engineering.

[11]  Elena Paslaru Bontas Simperl,et al.  Improving Paid Microtasks through Gamification and Adaptive Furtherance Incentives , 2015, WWW.

[12]  B. L. Welch THE SIGNIFICANCE OF THE DIFFERENCE BETWEEN TWO MEANS WHEN THE POPULATION VARIANCES ARE UNEQUAL , 1938 .

[13]  Jeffrey S. Simonoff,et al.  Tree Induction Vs Logistic Regression: A Learning Curve Analysis , 2001, J. Mach. Learn. Res..

[14]  Muhammad Imran,et al.  Engineering Crowdsourced Stream Processing Systems , 2013, ArXiv.

[15]  Duncan J. Watts,et al.  Financial incentives and the "performance of crowds" , 2009, HCOMP '09.

[16]  Yong Wang,et al.  Content-based utility function prediction for real-time MPEG-4 video transcoding , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[17]  Binoy Ravindran,et al.  Time-utility function-driven switched Ethernet: packet scheduling algorithm, implementation, and feasibility analysis , 2004, IEEE Transactions on Parallel and Distributed Systems.

[18]  Pratik Mishra,et al.  Incentive design to mould online behavior: a game mechanics perspective , 2014, GamifIR '14.

[19]  Leif Azzopardi,et al.  Studying user browsing behavior through gamified search tasks , 2014, GamifIR '14.

[20]  Gianluca Demartini,et al.  Scaling-Up the Crowd: Micro-Task Pricing Schemes for Worker Retention and Latency Improvement , 2014, HCOMP.

[21]  Aditya G. Parameswaran,et al.  Finish Them!: Pricing Algorithms for Human Computation , 2014, Proc. VLDB Endow..

[22]  Alain Biem,et al.  Real-time analysis and management of big time-series data , 2013, IBM J. Res. Dev..

[23]  Carlos Castillo,et al.  AIDR: artificial intelligence for disaster response , 2014, WWW.

[24]  Shree K. Nayar,et al.  Attribute and simile classifiers for face verification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[25]  Binoy Ravindran,et al.  Adaptive time-critical resource management using time/utility functions: past, present, and future , 2004, Proceedings of the 28th Annual International Computer Software and Applications Conference, 2004. COMPSAC 2004..

[26]  Theodore Johnson,et al.  Sampling algorithms in a stream operator , 2005, SIGMOD '05.

[27]  Stéphane Roche,et al.  GeoWeb and crisis management: issues and perspectives of volunteered geographic information , 2011, GeoJournal.

[28]  David R. B. Stockwell,et al.  Effects of sample size on accuracy of species distribution models , 2002 .

[29]  Matthew O. Ward,et al.  Summarization and Matching of Density-Based Clusters in Streaming Environments , 2011, Proc. VLDB Endow..

[30]  Rob Miller,et al.  VizWiz: nearly real-time answers to visual questions , 2010, UIST.

[31]  Stefan Siersdorfer,et al.  Competitive Game Designs for Improving the Cost Effectiveness of Crowdsourcing , 2014, CIKM.

[32]  Muhammad Imran,et al.  Coordinating human and machine intelligence to classify microblog communications in crises , 2014, ISCRAM.

[33]  Stefan Siersdorfer,et al.  Groupsourcing: Team Competition Designs for Crowdsourcing , 2015, WWW.

[34]  Vincent W. S. Wong,et al.  Optimal Real-Time Pricing Algorithm Based on Utility Maximization for Smart Grid , 2010, 2010 First IEEE International Conference on Smart Grid Communications.

[35]  Michael S. Bernstein,et al.  Crowds in two seconds: enabling realtime crowd-powered interfaces , 2011, UIST.

[36]  Ness B. Shroff,et al.  A utility-based power-control scheme in wireless cellular systems , 2003, TNET.

[37]  Beng Chin Ooi,et al.  CrowdOp: Query Optimization for Declarative Crowdsourcing Systems , 2015, IEEE Trans. Knowl. Data Eng..

[38]  Aditya G. Parameswaran,et al.  Crowd-powered find algorithms , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[39]  Hideyuki Tokuda,et al.  A Time-Driven Scheduling Model for Real-Time Operating Systems , 1985, RTSS.

[40]  Udo Kruschwitz,et al.  Proceedings of the First International Workshop on Gamification for Information Retrieval , 2014 .