Data Collection in the Digital Age: Innovative Alterantives to Student Samples

Online crowdsourcing markets (OCM) are becoming more popular as a source for data collection. In this paper, we examine the consistency of survey results across student samples, consumer panels, and online crowdsourcing markets (specifically Amazon's Mechanical Turk) both within the United States and outside. We conduct two studies examining the technology acceptance model (TAM) and the expectation-disconfirmation theory (EDT) to explore potential differences in demographics, psychometrics, structural model estimates, and measurement invariances. Our findings indicate that (1) U.S.-based OCM samples provide demographics much more similar to our student and consumer panel samples than the non-U.S.-based OCM samples; (2) both U.S. and non-U.S. OCM samples provide initial psychometric properties (reliability, convergent, and divergent validity) that are similar to those of both student and consumer panels; (3) non-U.S. OCM samples generally provide differences in scale means compared to those of our students, consumer panels, and U.S. OCM samples; and (4) one of the non-U.S. OCM samples refuted the highly replicated and validated TAM model in the relationship of perceived usefulness to behavioral intentions. Although our post hoc analyses isolated some cultural and demographic effects with regard to the non-U.S. samples in Study 1, they did not address the model differences found in Study 2. Specifically, the inclusion of non-U.S. OCM respondents led to statistically significant differences in parameter estimates, and hence to different statistical conclusions. Due to these unexplained differences that exist within the non-U.S. OCM samples, we caution that the inclusion of non-U.S. OCM participants may lead to different conclusions than studies with only U.S. OCM participants. We are unable to conclude whether this is due to of cultural differences, differences in the demographic profiles of non-U.S. OCM participants, or some unexplored factors within the models. Therefore, until further research is conducted to explore these differences in detail, we urge researchers utilizing OCMs with the intention to generalize to U.S. populations focus on U.S.-based participants and exercise caution in using non-U.S. participants. We further recommend that researchers should clearly describe their OCM usage and design (e.g., demographics, participant filters, etc.) procedures. Overall, we find that U.S. OCM samples produced models that lead to similar statistical conclusions as both U.S. students and U.S. consumer panels at a considerably reduced cost.

[1]  John L. Smith,et al.  Research on the Internet: Validation of a World-Wide Web mediated personality scale , 1999, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[2]  Rodney W. Thomas When Student Samples Make Sense in Logistics Research , 2011 .

[3]  Selin A. Malkoc,et al.  Choosing Here and Now vs. There and Later: The Moderating Role of Psychological Distance on Assortment Size Preferences , 2012 .

[4]  A. Burton-Jones Minimizing Method Bias Through Programmatic Research , 2009 .

[5]  Winny Shen,et al.  Samples in applied psychology: over a decade of research in review. , 2011, The Journal of applied psychology.

[6]  Michael D. Buhrmester,et al.  Amazon's Mechanical Turk , 2011, Perspectives on psychological science : a journal of the Association for Psychological Science.

[7]  Miguel A. Vadillo,et al.  Further evidence on the validity of web-based research on associative learning: Augmentation in a predictive learning task , 2011, Comput. Hum. Behav..

[8]  Chih-Ping Chou,et al.  Measurement invariance in longitudinal clinical research assuming change from development and intervention. , 1994, Journal of consulting and clinical psychology.

[9]  Paul Benjamin Lowry,et al.  Profiling the Research Productivity of Tenured Information Systems Faculty at U.S. Institutions , 2011, MIS Q..

[10]  C. Fornell,et al.  Evaluating structural equation models with unobservable variables and measurement error. , 1981 .

[11]  Andrea Carignani,et al.  "We Don't Need no Education": Required Abilities in Online Labor Markets , 2010, ICIS.

[12]  Jacob A. Benfield,et al.  Internet-Based Data Collection: Promises and Realities , 2006 .

[13]  D. Straub,et al.  Editor's comments: a critical look at the use of PLS-SEM in MIS quarterly , 2012 .

[14]  S. Gosling,et al.  Should we trust web-based studies? A comparative analysis of six preconceptions about internet questionnaires. , 2004, The American psychologist.

[15]  Wael Soliman,et al.  Crowdsourcing as a Mobile Service - Case Study: Publishing Photography , 2012, ECIS.

[16]  Silvia Biffignandi,et al.  Handbook of Web Surveys , 2021 .

[17]  William R. King,et al.  A meta-analysis of the technology acceptance model , 2006, Inf. Manag..

[18]  R. Groves,et al.  Survey Errors and Survey Costs. , 1991 .

[19]  Katherine M. White,et al.  Internet versus paper-and-pencil survey methods in psychological experiments : equivalence testing of participant responses to health-related messages , 2009 .

[20]  Kai H. Lim,et al.  Understanding sustained participation in transactional virtual communities , 2012, Decis. Support Syst..

[21]  E. Singer,et al.  The effects of response rate changes on the index of consumer sentiment. , 2000, Public opinion quarterly.

[22]  Fred D. Davis,et al.  User Acceptance of Computer Technology: A Comparison of Two Theoretical Models , 1989 .

[23]  Pieter H. Hartel,et al.  PRIVACY IN ONLINE SOCIAL NETWORKS , 2012, ICIS 2012.

[24]  LimayemMoez,et al.  Data collection in the digital age , 2014 .

[25]  Wynne W. Chin How to Write Up and Report PLS Analyses , 2010 .

[26]  Beibei Li,et al.  International Conference on Information Systems ( ICIS ) 2010 DESIGNING RANKING SYSTEMS FOR HOTELS ON TRAVEL SEARCH ENGINES TO ENHANCE USER EXPERIENCE , 2017 .

[27]  Jae-On Kim,et al.  Introduction to Factor Analysis: What It Is and How To Do It , 1978 .

[28]  R. Bagchi,et al.  Illusionary Progress in Loyalty Programs: Magnitudes, Reward Distances, and Step-Size Ambiguity , 2011 .

[29]  Maurice E. Schweitzer,et al.  The Imbibing Idiot Bias: Consuming Alcohol Can Be Hazardous to Your (Perceived) Intelligence , 2012 .

[30]  Susan M. Broniarczyk,et al.  It’s Not Me, It’s You: How Gift Giving Creates Giver Identity Threat as a Function of Social Closeness , 2011 .

[31]  L.Michael Honaker,et al.  The equivalency of computerized and conventional MMPI administration: A critical review☆☆☆ , 1988 .

[32]  Liang Chen,et al.  Comparing Strategies for Winning Expert-rated and Crowd-rated Crowdsourcing Contests: First Findings , 2012, AMCIS.

[33]  Daniel J. Veit,et al.  More than fun and money. Worker Motivation in Crowdsourcing - A Study on Mechanical Turk , 2011, AMCIS.

[34]  M. Gordon,et al.  The “Science of the Sophomore” Revisited: from Conjecture to Empiricism , 1986 .

[35]  Manoj Thomas,et al.  Psychological Distance and Subjective Experience: How Distancing Reduces the Feeling of Difficulty , 2011 .

[36]  Robert S. Wyer,et al.  The Role of Bolstering and Counterarguing MindSets in Persuasion Author ( s ) : , 2012 .

[37]  Detmar W. Straub,et al.  A Practical Guide To Factorial Validity Using PLS-Graph: Tutorial And Annotated Example , 2005, Commun. Assoc. Inf. Syst..

[38]  Rex B. Kline,et al.  Principles and Practice of Structural Equation Modeling , 1998 .

[39]  James M. Leonhardt,et al.  Avoiding the Risk of Responsibility by Seeking Uncertainty: Responsibility Aversion and Preference for Indirect Agency When Choosing for Others , 2011 .

[40]  Panagiotis G. Ipeirotis,et al.  Running Experiments on Amazon Mechanical Turk , 2010, Judgment and Decision Making.

[41]  Xianggui Qu,et al.  Multivariate Data Analysis , 2007, Technometrics.

[42]  Dorothy E. Leidner,et al.  Review: A Review of Culture in Information Systems Research: Toward a Theory of Information Technology Culture Conflict , 2006, MIS Q..

[43]  William C. Schmidt,et al.  World-Wide Web survey research: Benefits, potential problems, and solutions , 1997 .

[44]  E. Singer,et al.  Leverage-saliency theory of survey participation: description and an illustration. , 2000, Public opinion quarterly.

[45]  J. Steenkamp,et al.  Assessing Measurement Invariance in Cross-National Consumer Research , 1998 .

[46]  Moez Limayem,et al.  "You Want It When?" How Temporal Dissonance in IT Workers Contributes to Project Failures , 2011, ICIS.

[47]  Yasuaki Sakamoto,et al.  Testing tournament selection in creative problem solving using crowds , 2011, ICIS.

[48]  Jeffrey V. Nickerson,et al.  Generating Creative Ideas Through Crowds: An Experimental Study of Combination , 2011, ICIS.

[49]  Gabriele Paolacci,et al.  The Intermediate Alternative Effect: Considering a Small Tradeoff Increases Subsequent Willingness to Make Large Tradeoffs , 2011 .

[50]  Chong Wang,et al.  Privacy in Online Social Networks , 2011, ICIS.

[51]  John C. Mitchell,et al.  How Good Are Humans at Solving CAPTCHAs? A Large Scale Evaluation , 2010, 2010 IEEE Symposium on Security and Privacy.

[52]  Jonathan J. Koehler,et al.  Safety First? The Role of Emotion in Safety Product Betrayal Aversion , 2011 .

[53]  Siddharth Suri,et al.  Conducting behavioral research on Amazon’s Mechanical Turk , 2010, Behavior research methods.

[54]  Bill Tomlinson,et al.  Who are the crowdworkers?: shifting demographics in mechanical turk , 2010, CHI Extended Abstracts.

[55]  Michael A. Smith,et al.  Virtual subjects: Using the Internet as an alternative source of subjects and research environment , 1997 .

[56]  Gordon W. Cheung,et al.  Evaluating Goodness-of-Fit Indexes for Testing Measurement Invariance , 2002 .

[57]  William M. K. Trochim,et al.  Research methods knowledge base , 2001 .

[58]  Detmar W. Straub,et al.  Toward a Theory-Based Measurement of Culture , 2002, J. Glob. Inf. Manag..

[59]  Daniel M. Oppenheimer,et al.  Instructional Manipulation Checks: Detecting Satisficing to Increase Statistical Power , 2009 .

[60]  J. Henrich,et al.  The weirdest people in the world? , 2010, Behavioral and Brain Sciences.

[61]  J. Bohannon Human subject research. Social science for pennies. , 2011, Science.

[62]  Stephen A. Spiller,et al.  Opportunity Cost Consideration , 2011 .

[63]  Katja Thoring,et al.  Crowdsourcing with Semantic Differentials: A Game to Investigate the Meaning of Form , 2010, AMCIS.

[64]  M. Couper A REVIEW OF ISSUES AND APPROACHES , 2000 .

[65]  Vera Toepoel,et al.  Effects of Design in Web Surveys Comparing Trained and Fresh Respondents , 2008 .

[66]  R M Groves,et al.  Consequences of reducing nonresponse in a national telephone survey. , 2000, Public opinion quarterly.

[67]  Aner Sela,et al.  Decision Quicksand: How Trivial Choices Suck Us In , 2012 .

[68]  G. Loewenstein,et al.  Slow Down! Insensitivity to Rate of Consumption Leads to Avoidable Satiation , 2013 .

[69]  Hans van der Heijden,et al.  A New Look at User Commitment towards Information Systems: Evidence from Amazon's mechanical Turk , 2012, ECIS.

[70]  Birk Diedenhofen,et al.  Seriousness checks are useful to improve data validity in online research , 2012, Behavior Research Methods.

[71]  Moez Limayem,et al.  Obsessive Compulsive Tendencies as Predictors of Dangerous Mobile Phone Usage , 2012, AMCIS.

[72]  Sepandar D. Kamvar,et al.  How Happiness Affects Choice , 2011 .

[73]  Junghwan Kim,et al.  The Effect of Design Characteristics of Mobile Applications on User Retention: An Environmental Psychology Perspective , 2012, AMCIS.

[74]  Don A. Dillman,et al.  Does a Probability-Based Household Panel Benefit from Assignment to Postal Response as an Alternative to Internet-Only? , 2008 .

[75]  Fred D. Davis Perceived Usefulness, Perceived Ease of Use, and User Acceptance of Information Technology , 1989, MIS Q..

[76]  Paul Meyerson,et al.  Validating Internet research: A test of the psychometric equivalence of Internet and in-person samples , 2003, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[77]  Hans Thies,et al.  Validating results of human-based electronic services leveraging multiple reviewers , 2010, AMCIS.

[78]  Martin Oberhofer,et al.  A Preliminary Study on Methods for Retaining Data Quality Problems in Automatically Generated Test Data , 2012, AMCIS.

[79]  John W. Daly,et al.  Electronic bulletin board distributed questionnaires for exploratory research , 1996, J. Inf. Sci..

[80]  Varun Grover,et al.  Nurturing a Thriving Information Systems Discipline: A Call to Action , 2011, ICIS.

[81]  Jon Sprouse A validation of Amazon Mechanical Turk for the collection of acceptability judgments in linguistic theory , 2010, Behavior research methods.

[82]  Neil Malhotra,et al.  Completion Time and Response Order Effects in Web Surveys , 2008 .

[83]  Christoph Rosenkranz,et al.  Increasing the Willingness to Collaborate Online: an Analysis of Sentiment-Driven Interactions in Peer Content Production , 2011, ICIS.

[84]  C. Townsend,et al.  Self-Affirmation through the Choice of Highly Aesthetic Products , 2012 .

[85]  Aniket Kittur,et al.  Crowdsourcing user studies with Mechanical Turk , 2008, CHI.

[86]  R. Peterson On the Use of College Students in Social Science Research: Insights from a Second‐Order Meta‐analysis , 2001 .

[87]  Alexander I. Rudnicky,et al.  Using the Amazon Mechanical Turk for transcription of spoken language , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[88]  R. Groves Nonresponse Rates and Nonresponse Bias in Household Surveys , 2006 .

[89]  Siddhartha S. Srinivasa,et al.  People helping robots helping people: Crowdsourcing for grasping novel objects , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[90]  Yongqiang Sun,et al.  Working for one penny: Understanding why people would like to participate in online tasks with low payment , 2011, Comput. Hum. Behav..

[91]  F. Davis,et al.  The Relations among Executive Functions and Users' Perceptions toward Using Technologies to Multitask , 2012 .

[92]  Yongqiang Sun,et al.  Investigating the Non-Linear Relationships in the Expectancy Theory: The Case of Crowdsourcing Marketplace , 2012, AMCIS.

[93]  P. Bentler,et al.  Cutoff criteria for fit indexes in covariance structure analysis : Conventional criteria versus new alternatives , 1999 .

[94]  Norbert Schwarz,et al.  The Presenter's Paradox , 2012 .

[95]  Moez Limayem,et al.  Impact of Online Content on Attitudes and Buying Intentions , 2012, AMCIS.

[96]  Barbara M. Byrne,et al.  Structural equation modeling with AMOS , 2010 .

[97]  M. Birnbaum Human research and data collection via the internet. , 2004, Annual review of psychology.

[98]  Gordon B. Davis,et al.  User Acceptance of Information Technology: Toward a Unified View , 2003, MIS Q..