Detection of Careless Responses in Online Surveys Using Answering Behavior on Smartphone

Some respondents make careless responses due to the “satisficing,” which is an attempt to complete a questionnaire as quickly and easily as possible. To obtain results that reflect a fact, detecting satisficing and excluding the responses with satisficing from the analysis targets are required. One of the devised methods detects satisficing by adding questions that check violations of instructions and inconsistencies. However, this approach may cause respondents to lose their motivation and prompt them to satisficing. Additionally, a deep learning model that automatically answers these questions was reported. This threatens the reliability of the conventional method. To detect careless responses without inserting such screening questions, machine learning (ML) detection using data obtained from answer results was attempted in a previous study, with a detection rate of 55.6%, which is not sufficient from the viewpoint of practicality. Therefore, we hypothesized that a supervised ML model with a higher detection rate could be constructed by using on-screen answering behavior as features. However, (1) no existing questionnaire system can record on-screen answering behavior and (2) even if the answering behavior can be recorded, it is unclear which answering behavior features are associated with satisficing. We developed an answering behavior recording plug-in for LimeSurvey, an online questionnaire system used all over the world, and collected a large amount of data (from 5,692 people) in Japan. Then, a variety of features were examined and generated from answering behavior, and we constructed ML models to detect careless responses. We call this detection method the ML-ABS (ML-based answering behavior scale). Evaluation by cross-validation demonstrated that the detection rate for careless responses was 85.9%, which is much higher than the previous ML method. Among the various features we proposed, we found that reselecting the Likert scale and scrolling particularly contributed to the detection of careless responses.

[1]  Ronald D. Rogge,et al.  Caring about carelessness: Participant inattention and its effects on research. , 2014 .

[2]  Vera Toepoel,et al.  The Use of PCs, Smartphones, and Tablets in a Probability-Based Panel Survey , 2016 .

[3]  Penny Black,et al.  Straightlining: Overview of Measurement, Comparison of Indicators, and Effects in Mail–Web Mixed-Mode Surveys , 2019 .

[4]  A. Nichols,et al.  Why don’t we care more about carelessness? Understanding the causes and consequences of careless participants , 2020 .

[5]  Tie-Yan Liu,et al.  LightGBM: A Highly Efficient Gradient Boosting Decision Tree , 2017, NIPS.

[6]  Daniel M. Oppenheimer,et al.  Instructional Manipulation Checks: Detecting Satisficing to Increase Statistical Power , 2009 .

[7]  Joseph W. Houpt,et al.  Will the Questions Ever End? Person-Level Increases in Careless Responding During Questionnaire Completion , 2020 .

[8]  Eunjin Kim,et al.  A Novel Biometric Identification Based on a User’s Input Pattern Analysis for Intelligent Mobile Devices , 2012 .

[9]  Hans Baumgartner,et al.  Response Styles in Marketing Research: A Cross-National Investigation , 2001 .

[10]  Melanie Revilla,et al.  What are the Links in a Web Survey Among Response Time, Quality, and Auto-Evaluation of the Efforts Done? , 2015 .

[11]  Chuan Yue,et al.  Attention Please: Your Attention Check Questions in Survey Studies Can Be Automatically Answered , 2020, WWW.

[12]  J. Krosnick Response strategies for coping with the cognitive demands of attitude measures in surveys , 1991 .

[13]  Modeling compliance with COVID-19 prevention guidelines: the critical role of trust in science , 2020, Psychology, health & medicine.

[14]  David J. Hauser,et al.  It’s a Trap! Instructional Manipulation Checks Prompt Systematic Thinking on “Tricky” Tasks , 2015 .

[15]  Kevin Warwick,et al.  Non-conventional keystroke dynamics for user authentication , 2017, Pattern Recognit. Lett..

[16]  J. Smetana,et al.  Helicopter Parenting and Perceived Overcontrol by Emerging Adults: A Family-Level Profile Analysis , 2020, Journal of Child and Family Studies.

[17]  Hamed Taherdoost Validity and Reliability of the Research Instrument; How to Test the Validation of a Questionnaire/Survey in a Research , 2016 .

[18]  M. Malesza,et al.  Predictors of anxiety during the COVID-19 pandemic in Poland , 2020, Personality and Individual Differences.

[19]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[20]  Takashi Suzuki,et al.  Using Machine Learning to Predict Inappropriate Respondents , 2019, Kodo Keiryogaku (The Japanese Journal of Behaviormetrics).

[21]  Asako Miura,et al.  Survey Satisficing Inflates Stereotypical Responses in Online Experiment: The Case of Immigration Study , 2016, Front. Psychol..

[22]  Timo Gnambs,et al.  A Meta-Analysis of Test Scores in Proctored and Unproctored Ability Assessments , 2020 .

[23]  Georgina M. Gross,et al.  Development and psychometric properties of the Multidimensional Schizotypy Scale: A new measure for assessing positive, negative, and disorganized schizotypy , 2017, Schizophrenia Research.

[24]  Sanjay Goel,et al.  Shared Benefits and Information Privacy: What Determines Smart Meter Technology Adoption? , 2017, J. Assoc. Inf. Syst..

[25]  H. Simon,et al.  Rational choice and the structure of the environment. , 1956, Psychological review.

[26]  Takuya Akiba,et al.  Optuna: A Next-generation Hyperparameter Optimization Framework , 2019, KDD.

[27]  Adam J. Berinsky,et al.  Can we turn shirkers into workers , 2016 .

[28]  Carol Galais,et al.  Answering Without Reading: IMCs and Strong Satisficing in Online Surveys , 2016 .

[29]  Ulrich Schroeders,et al.  Detecting Careless Responding in Survey Data Using Stochastic Gradient Boosting , 2021, Educational and Psychological Measurement.

[30]  A. Miura,et al.  Survey Satisficing Biases the Estimation of Moderation Effects , 2018, Japanese Psychological Research.

[31]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.

[32]  Roger Tourangeau,et al.  Web Surveys by Smartphones and Tablets , 2018 .