论文信息 - Conducting behavioral research on Amazon’s Mechanical Turk

Conducting behavioral research on Amazon’s Mechanical Turk

Amazon’s Mechanical Turk is an online labor market where requesters post jobs and workers choose which jobs to do for pay. The central purpose of this article is to demonstrate how to use this Web site for conducting behavioral research and to lower the barrier to entry for researchers who could benefit from this platform. We describe general techniques that apply to a variety of types of research and experiments across disciplines. We begin by discussing some of the advantages of doing experiments on Mechanical Turk, such as easy access to a large, stable, and diverse subject pool, the low cost of doing experiments, and faster iteration between developing theory and executing experiments. While other methods of conducting behavioral research may be comparable to or even better than Mechanical Turk on one or more of the axes outlined above, we will show that when taken as a whole Mechanical Turk can be a useful tool for many researchers. We will discuss how the behavior of workers compares with that of experts and laboratory subjects. Then we will illustrate the mechanics of putting a task on Mechanical Turk, including recruiting subjects, executing the task, and reviewing the work that was submitted. We also provide solutions to common problems that a researcher might face when executing their research on this platform, including techniques for conducting synchronous experiments, methods for ensuring high-quality work, how to keep data private, and how to maintain code security.

Siddharth Suri | Winter A. Mason | Winter Mason | Siddharth Suri

[1] Samuel D. Gosling,et al. Advanced Methods for Conducting Online Behavioral Research , 2010 .

[2] Michael D. Buhrmester,et al. Amazon's Mechanical Turk , 2011, Perspectives on psychological science : a journal of the Association for Psychological Science.

[3] William C. Schmidt,et al. Technical considerations when implementing online research , 2009 .

[4] Kimberly A. Barchard,et al. Practical advice for conducting ethical online experiments and questionnaires for United States psychologists , 2008, Behavior research methods.

[5] Panagiotis G. Ipeirotis,et al. Get another label? improving data quality and data mining using multiple, noisy labelers , 2008, KDD.

[6] Iadh Ounis,et al. Crowdsourcing a News Query Classification Dataset , 2010 .

[7] Anja S. Göritz,et al. The Long-Term Effect of Material Incentives on Participation in Online Panels , 2008 .

[8] A. Shariff,et al. God Is Watching You Priming God Concepts Increases Prosocial Behavior in an Anonymous Economic Game , 2007 .

[9] Michael A. Smith,et al. Virtual subjects: Using the Internet as an alternative source of subjects and research environment , 1997 .

[10] Colin Camerer,et al. The Effects of Financial Incentives in Experiments: A Review and Capital-Labor-Production Framework , 1999 .

[11] A. Tversky,et al. Extensional versus intuitive reasoning: the conjunction fallacy in probability judgment , 1983 .

[12] Carol Peters,et al. Proceedings of the SIGIR 2009 Workshop on the Future of IR Evaluation , 2009 .

[13] Lydia B. Chilton,et al. Exploring iterative and parallel human computation processes , 2010, HCOMP '10.

[14] Peter V. Miller,et al. Web Survey Methods Introduction , 2008 .