Diminished Control in Crowdsourcing

Obtaining high-quality data from crowds can be difficult if contributors do not give tasks sufficient attention. Attention checks are often used to mitigate this problem, but, because the roots of inattention are poorly understood, checks often compel attentive contributors to complete unnecessary work. We investigated a potential source of inattentiveness during crowdwork: multitasking. We found that workers switched to other tasks every 5 minutes, on average. There were indications that increasing switch frequency negatively affected performance. To address this, we tested an intervention that encouraged workers to stay focused on our task after multitasking was detected. We found that our intervention reduced the frequency of task switching. It also improves on existing attention checks because it does not place additional demands on workers who are already focused. Our approach shows that crowds can help to overcome some of the limitations of laboratory studies by affording access to naturalistic multitasking behavior.

[1]  Anna L. Cox,et al.  What does it mean for an interruption to be relevant? An investigation of relevance as a memory effect , 2013 .

[2]  Víctor M. González,et al.  Why do i keep interrupting myself?: environment, habit and self-interruption , 2011, CHI.

[3]  Vanessa Evers,et al.  Combining social strategies and workload: a new design to reduce the negative effects of task interruptions , 2013, CHI Extended Abstracts.

[4]  Dana Chandler,et al.  Preventing Satisficing in Online Surveys: A "Kapcha" to Ensure Higher Quality Data , 2010 .

[5]  Ann Blandford,et al.  The effect of interruptions on postcompletion and other procedural errors: an account based on the activation-based goal memory model. , 2008, Journal of experimental psychology. Applied.

[6]  Eric Horvitz,et al.  Disruption and recovery of computing tasks: field study, analysis, and directions , 2007, CHI.

[7]  Víctor M. González,et al.  "Constant, constant, multi-tasking craziness": managing multiple working spheres , 2004, CHI.

[8]  Brian P. Bailey,et al.  Understanding changes in mental workload during execution of goal-directed tasks and its application for interruption management , 2008, TCHI.

[9]  Christopher A. Monk,et al.  Recovering From Interruptions: Implications for Driver Distraction Research , 2004, Hum. Factors.

[10]  Jonathan Back,et al.  Recovering from an interruption: investigating speed-accuracy trade-offs in task resumption behavior. , 2013, Journal of experimental psychology. Applied.

[11]  Duncan P. Brumby,et al.  Now Check Your Input: Brief Task Lockouts Encourage Checking, Longer Lockouts Encourage Task Switching , 2016, CHI.

[12]  K. Nakayama,et al.  Is the Web as good as the lab? Comparable performance from Web and lab in cognitive/perceptual experiments , 2012, Psychonomic Bulletin & Review.

[13]  Michael S. Bernstein,et al.  Break It Down: A Comparison of Macro- and Microtasks , 2015, CHI.

[14]  Brian P. Bailey,et al.  Effects of intelligent notification management on users and their tasks , 2008, CHI.

[15]  Anna L. Cox,et al.  Home is Where the Lab is: A Comparison of Online and Lab Data From a Time-sensitive Study of Interruption , 2015, Hum. Comput..

[16]  Deborah A. Boehm-Davis,et al.  What Makes Us Resilient to Interruptions? Understanding the Role of Individual Differences in Resumption , 2011 .

[17]  Amar Cheema,et al.  Data collection in a flat world: the strengths and weaknesses of mechanical turk samples , 2013 .

[18]  Aniket Kittur,et al.  Instrumenting the crowd: using implicit behavioral measures to predict task performance , 2011, UIST.

[19]  Víctor M. González,et al.  Multitasking and interruptions: a SIG on bridging the gap between research on the micro and macro worlds , 2012, CHI Extended Abstracts.

[20]  Studs Terkel,et al.  Working: People Talk About What They Do All Day and How They Feel About What They Do , 1974 .

[21]  Ricardo Matsumura de Araújo,et al.  99designs: An Analysis of Creative Competition in Crowdsourced Design , 2013, HCOMP.

[22]  Schahram Dustdar,et al.  Simulation-Based Modeling and Evaluation of Incentive Schemes in Crowdsourcing Environments , 2013, OTM Conferences.

[23]  A. Acquisti,et al.  Reputation as a sufficient condition for data quality on Amazon Mechanical Turk , 2013, Behavior Research Methods.

[24]  Todd M. Gureckis,et al.  CUNY Academic , 2016 .

[25]  Panagiotis G. Ipeirotis,et al.  Running Experiments on Amazon Mechanical Turk , 2010, Judgment and Decision Making.

[26]  Brian P. Bailey,et al.  Investigating the effectiveness of mental workload as a predictor of opportune moments for interruption , 2005, CHI Extended Abstracts.

[27]  Jonathan Back,et al.  Locked-out: investigating the effectiveness of system lockouts to reduce errors in routine tasks , 2010, CHI Extended Abstracts.

[28]  Raj M. Ratwani,et al.  Recovering from Interruptions: Does Alert Type Matter? , 2009 .

[29]  Brian P. Bailey,et al.  On the need for attention-aware systems: Measuring effects of interruption on task performance, error rate, and affective state , 2006, Comput. Hum. Behav..

[30]  Brian Shackel,et al.  ERGONOMICS IN THE DESIGN OF A LARGE DIGITAL COMPUTER CONSOLE , 1962 .

[31]  P. Sanderson,et al.  Interruption management in the intensive care unit: Predicting resumption times and assessing distributed support. , 2010, Journal of experimental psychology. Applied.

[32]  Niels Taatgen,et al.  What Makes Interruptions Disruptive?: A Process-Model Account of the Effects of the Problem State Bottleneck on Task Interruption and Resumption , 2015, CHI.

[33]  Raj M. Ratwani,et al.  A memory for goals model of sequence errors , 2011, Cognitive Systems Research.

[34]  W. Dunsmuir,et al.  The impact of interruptions on clinical task completion , 2010, Quality and Safety in Health Care.

[35]  Walter S. Lasecki,et al.  Online quality control for real-time crowd captioning , 2012, ASSETS '12.

[36]  Eric Horvitz,et al.  Balancing Awareness and Interruption: Investigation of Notification Deferral Policies , 2005, User Modeling.

[37]  Mark Harman,et al.  Pricing crowdsourcing-based software development tasks , 2013, 2013 35th International Conference on Software Engineering (ICSE).

[38]  Adam Marcus,et al.  The Effects of Sequence and Delay on Crowd Work , 2015, CHI.

[39]  Paul Johns,et al.  Bored mondays and focused afternoons: the rhythm of attention and online activity in the workplace , 2014, CHI.

[40]  Eric Horvitz,et al.  Why Stop Now? Predicting Worker Engagement in Online Crowdsourcing , 2013, HCOMP.

[41]  Colin Camerer,et al.  The Effects of Financial Incentives in Experiments: A Review and Capital-Labor-Production Framework , 1999 .

[42]  K. A. Ericsson,et al.  A deliberate practice account of typing proficiency in everyday typists. , 2007, Journal of experimental psychology. Applied.

[43]  J. Trafton,et al.  The effect of interruption duration and demand on resuming suspended goals. , 2008, Journal of experimental psychology. Applied.

[44]  Raj M. Ratwani,et al.  The Effect of Interruption Modality on Primary Task Resumption , 2008 .

[45]  Jesse Chandler,et al.  Nonnaïveté among Amazon Mechanical Turk workers: Consequences and solutions for behavioral researchers , 2013, Behavior Research Methods.

[46]  Michael S. Bernstein,et al.  Crowds in two seconds: enabling realtime crowd-powered interfaces , 2011, UIST.

[47]  Victor Kaptelinin,et al.  Unpacking the social dimension of external interruptions , 2007, GROUP.

[48]  E. Walther,et al.  The Role of Evaluative Conditioning in Attitude Formation , 2011 .

[49]  Jeffrey Heer,et al.  Crowdsourcing graphical perception: using mechanical turk to assess visualization design , 2010, CHI.

[50]  Duncan P. Brumby,et al.  Integrating knowledge of multitasking and interruptions across different perspectives and research methods , 2015, Int. J. Hum. Comput. Stud..

[51]  W. Cordell,et al.  Emergency department workplace interruptions: are emergency physicians "interrupt-driven" and "multitasking"? , 2000, Academic emergency medicine : official journal of the Society for Academic Emergency Medicine.

[52]  Lorrie Faith Cranor,et al.  Are your participants gaming the system?: screening mechanical turk workers , 2010, CHI.

[53]  Christopher A. Monk,et al.  The Effect of Frequent versus Infrequent Interruptions on Primary Task Resumption , 2004 .

[54]  Brian P. Bailey,et al.  Understanding and developing models for detecting and differentiating breakpoints during interactive tasks , 2007, CHI.

[55]  Michael D. Buhrmester,et al.  Amazon's Mechanical Turk , 2011, Perspectives on psychological science : a journal of the Association for Psychological Science.

[56]  E. M. Altmann,et al.  Momentary interruptions can derail the train of thought. , 2014, Journal of experimental psychology. General.

[57]  Raquel Benbunan-Fich,et al.  The effects of positive and negative self-interruptions in discretionary multitasking , 2012, CHI Extended Abstracts.

[58]  Wei Pan,et al.  Supporting task resumption using visual feedback , 2014, CSCW.

[59]  Robert E. Kraut,et al.  Controlling interruptions: awareness displays and social motivation for coordination , 2004, CSCW.

[60]  James Fogarty,et al.  Biases in human estimation of interruptibility: effects and implications for practice , 2007, CHI.

[61]  Dylan M. Jones,et al.  Interruption of the Tower of London task: support for a goal-activation approach. , 2006, Journal of experimental psychology. General.

[62]  Dario D. Salvucci On reconstruction of task context after interruption , 2010, CHI.

[63]  Loukia D. Loukopoulos,et al.  COCKPIT INTERRUPTIONS AND DISTRACTIONS: A LINE OBSERVATION STUDY , 2001 .

[64]  Aniket Kittur,et al.  Crowdsourcing user studies with Mechanical Turk , 2008, CHI.

[65]  Dario D. Salvucci,et al.  The effects of time constraints on user behavior for deferrable interruptions , 2011, CHI.

[66]  Duncan P. Brumby,et al.  Task Lockouts Induce Crowdworkers to Switch to Other Activities , 2015, CHI Extended Abstracts.

[67]  Peng Dai,et al.  Inserting Micro-Breaks into Crowdsourcing Workflows , 2013, HCOMP.

[68]  Niels Taatgen,et al.  Toward a unified theory of the multitasking continuum: from concurrent performance to task switching, interruption, and resumption , 2009, CHI.

[69]  K. Gegenfurtner,et al.  Design Issues in Gaze Guidance Under review with ACM Transactions on Computer Human Interaction , 2009 .

[70]  Duncan P. Brumby,et al.  Frequency and Duration of Self-Initiated Task-Switching in an Online Investigation of Interrupted Performance , 2013, HCOMP.

[71]  Laura A. Dabbish,et al.  Self-interruption on the computer: a typology of discretionary task interleaving , 2009, CHI.

[72]  Víctor M. González,et al.  No task left behind?: examining the nature of fragmented work , 2005, CHI.

[73]  Brendan T. O'Connor,et al.  Cheap and Fast – But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks , 2008, EMNLP.

[74]  Dario D. Salvucci,et al.  Multitasking and monotasking: the effects of mental workload on deferred task interruptions , 2010, CHI.

[75]  Bipin Indurkhya,et al.  Cognitively inspired task design to improve user performance on crowdsourcing platforms , 2014, CHI.

[76]  Katharina Reinecke,et al.  Crowdsourcing performance evaluations of user interfaces , 2013, CHI.

[77]  Gloria Mark,et al.  "A pace not dictated by electrons": an empirical study of work without email , 2012, CHI.

[78]  Peng Dai,et al.  And Now for Something Completely Different: Improving Crowdsourcing Workflows with Micro-Diversions , 2015, CSCW.

[79]  Michael S. Bernstein,et al.  The future of crowd work , 2013, CSCW.

[80]  Aniket Kittur,et al.  CrowdScape: interactively visualizing user behavior and output , 2012, UIST.

[81]  Frédéric Dandurand,et al.  Comparing online and lab methods in a problem-solving experiment , 2008, Behavior research methods.

[82]  Gloria Mark,et al.  Stress and multitasking in everyday college life: an empirical study of online activity , 2014, CHI.

[83]  Duncan J. Watts,et al.  Financial incentives and the "performance of crowds" , 2009, HCOMP '09.

[84]  Tara S. Behrend,et al.  The viability of crowdsourcing for survey research , 2011, Behavior research methods.

[85]  Ann Blandford,et al.  Further investigations into post-completion error : the effects of interruption position and duration , 2006 .