Is the Web as good as the lab? Comparable performance from Web and lab in cognitive/perceptual experiments

With the increasing sophistication and ubiquity of the Internet, behavioral research is on the cusp of a revolution that will do for population sampling what the computer did for stimulus control and measurement. It remains a common assumption, however, that data from self-selected Web samples must involve a trade-off between participant numbers and data quality. Concerns about data quality are heightened for performance-based cognitive and perceptual measures, particularly those that are timed or that involve complex stimuli. In experiments run with uncompensated, anonymous participants whose motivation for participation is unknown, reduced conscientiousness or lack of focus could produce results that would be difficult to interpret due to decreased overall performance, increased variability of performance, or increased measurement noise. Here, we addressed the question of data quality across a range of cognitive and perceptual tests. For three key performance metrics—mean performance, performance variance, and internal reliability—the results from self-selected Web samples did not differ systematically from those obtained from traditionally recruited and/or lab-tested samples. These findings demonstrate that collecting data from uncompensated, anonymous, unsupervised, self-selected participants need not reduce data quality, even for demanding cognitive and perceptual experiments.

[1]  L. Cronbach Coefficient alpha and the internal structure of tests , 1951 .

[2]  R. Plomin Behavioral genetics. , 1991, Research publications - Association for Research in Nervous and Mental Disease.

[3]  Jacob Cohen,et al.  A power primer. , 1992, Psychological bulletin.

[4]  D. Wechsler Wechsler Adult Intelligence Scale , 2021, Encyclopedia of Evolutionary Psychological Science.

[5]  John H. Krantz,et al.  Comparing the results of laboratory and World-Wide Web samples on the determinants of female attractiveness , 1997 .

[6]  John L. Smith,et al.  Using the Internet for psychological research: personality testing on the World Wide Web. , 1999, British journal of psychology.

[7]  M L Phillips,et al.  An investigation into the perception of dominance from schematic faces: A study using the World-Wide Web , 1999, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[8]  M. Birnbaum Psychological experiments on the internet , 2000 .

[9]  M. Birnbaum Decision Making in the Lab and on the Web , 2000 .

[10]  Reeshad S. Dalal,et al.  Validity of Web-Based Psychological Research , 2000 .

[11]  K. McGraw,et al.  The Integrity of Web-Delivered Experiments: Can You Trust the Data? , 2000, Psychological science.

[12]  S. Baron-Cohen,et al.  The "Reading the Mind in the Eyes" Test revised version: a study with normal adults, and adults with Asperger syndrome or high-functioning autism. , 2001, Journal of child psychology and psychiatry, and allied disciplines.

[13]  J. J. Ryan,et al.  Wechsler Adult Intelligence Scale-III , 2001 .

[14]  P. Graf,et al.  Verbal Paired Associates tests limits on validity and reliability. , 2002, Archives of clinical neuropsychology : the official journal of the National Academy of Neuropsychologists.

[15]  Ulf-Dietrich Reips Standards for Internet-based experimenting. , 2002, Experimental psychology.

[16]  Brian A. Nosek,et al.  Harvesting implicit group attitudes and beliefs from a demonstration web site , 2002 .

[17]  M. Banaji,et al.  Psychological. , 2015, The journals of gerontology. Series B, Psychological sciences and social sciences.

[18]  S. Gosling,et al.  Should we trust web-based studies? A comparative analysis of six preconceptions about internet questionnaires. , 2004, The American psychologist.

[19]  B. D. De Stavola,et al.  The cognitive cost of being a twin: evidence from comparisons within families in the Aberdeen children of the 1950s cohort study , 2005, BMJ : British Medical Journal.

[20]  Tom Buchanan,et al.  Nonequivalence of on-line and paper-and-pencil psychological tests: The case of the prospective memory questionnaire , 2005, Behavior research methods.

[21]  John A. Johnson,et al.  Implementing a five-factor personality inventory for use on the internet , 2005 .

[22]  K. Nakayama,et al.  The Cambridge Face Memory Test: Results for neurologically intact individuals and an investigation of its validity using inverted face stimuli and prosopagnosic participants , 2006, Neuropsychologia.

[23]  Katelyn Y. A. McKenna,et al.  Oxford Handbook of Internet Psychology , 2007 .

[24]  Tom Buchanan,et al.  Personality testing on the Internet: what we know and what we do not , 2007 .

[25]  R. Plomin,et al.  Internet Cognitive Testing of Large Samples Needed in Genetic Research , 2007, Twin Research and Human Genetics.

[26]  K. Nakayama,et al.  Family resemblance: Ten family members with prosopagnosia and within-class object agnosia , 2007, Cognitive neuropsychology.

[27]  W. Sommer,et al.  Toward a comprehensive test battery for face cognition: Assessment of the tasks , 2008, Behavior research methods.

[28]  G. Yovel,et al.  Diagnosing prosopagnosia: Effects of ageing, sex, and participant–stimulus ethnic match on the Cambridge Face Memory Test and Cambridge Face Perception Test , 2009, Cognitive neuropsychology.

[29]  Ulf-Dietrich Reips,et al.  The methodology of Internet-based experiments , 2009 .

[30]  K. Kendler,et al.  A Web-Based Study of Personality, Psychopathology and Substance Use in Twin, Other Relative and Relationship Pairs , 2009, Twin Research and Human Genetics.

[31]  L. Germine,et al.  Face emotion recognition is related to individual differences in psychosis-proneness , 2010, Psychological Medicine.

[32]  K. Nakayama,et al.  Human face recognition ability is specific and highly heritable , 2010, Proceedings of the National Academy of Sciences.

[33]  J. Henrich,et al.  Beyond WEIRD: Towards a broad-based behavioral science , 2010, Behavioral and Brain Sciences.

[34]  Samuel D Gosling,et al.  Age differences in personality traits from 10 to 65: Big Five domains and facets in a large cross-sectional sample. , 2011, Journal of personality and social psychology.

[35]  K. Nakayama,et al.  Where cognitive development and aging meet: Face learning ability peaks after age 30 , 2011, Cognition.

[36]  Emrah Düzel,et al.  A new selective developmental deficit: Impaired object recognition with normal face recognition , 2011, Cortex.