An Extendable Toolkit for Managing Quality of Human-Based Electronic Services

Micro-task markets like Amazon MTurk enable online workers to provide human intelligence asWeb-based on demand services (so called people services). Businesses facing large amounts of knowledge work can benefit from increased flexibility and scalability of their workforce but need to cope with reduced control of result quality. While this problem is well recognized, it has so far only rudimentarily been addressed by existing platforms and tools. In this paper, we present a flexible research toolkit which enables experiments with advanced quality management mechanisms for generic micro-task markets. The toolkit enables control of correctness and performance of task fulfillment by means of continuous sampling, dynamic majority voting and worker pooling. While we demonstrate its application and performance for an OCR scenario building on Amazon MTurk, the toolkit supports the development of advanced quality management mechanisms for a large variety of people service scenarios and platforms.

[1]  Sudhir Agarwal,et al.  Managing Quality of Human-Based eServices , 2008, ICSOC Workshops.

[2]  H. F. Dodge A Sampling Inspection Plan for Continuous Production , 1943, Journal of Fluids Engineering.

[3]  Lydia B. Chilton,et al.  TurKit: Tools for iterative tasks on mechanical turk , 2009, 2009 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC).

[4]  Hans Thies,et al.  Statistical Quality Control for Human-Based Electronic Services , 2010, ICSOC.

[5]  Hans Thies,et al.  Validating results of human-based electronic services leveraging multiple reviewers , 2010, AMCIS.

[6]  Luis von Ahn Human Computation , 2008, ICDE.

[7]  Laura A. Dabbish,et al.  Labeling images with a computer game , 2004, AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors.

[8]  Fred Spiring,et al.  Introduction to Statistical Quality Control , 2007, Technometrics.

[9]  Manuel Blum,et al.  reCAPTCHA: Human-Based Character Recognition via Web Security Measures , 2008, Science.