Traditional versus Facebook-based surveys: Evaluation of biases in self-reported demographic and psychometric information

Background: Social media in scientific research offers a unique digital observatory of human behaviours and hence great opportunities to conduct research at large scale, answering complex sociodemographic questions. We focus on the identification and assessment of biases in social-media-administered surveys. Objective: This study aims to shed light on population, self-selection, and behavioural biases, empirically comparing the consistency between self-reported information collected traditionally versus social-media-administered questionnaires, including demographic and psychometric attributes. Methods: We engaged a demographically representative cohort of young adults in Italy (approximately 4,000 participants) in taking a traditionally administered online survey and then, after one year, we invited them to use our ad hoc Facebook application (988 accepted) where they filled in part of the initial survey. We assess the statistically significant differences indicating population, self-selection, and behavioural biases due to the different context in which the questionnaire is administered. Results: Our findings suggest that surveys administered on Facebook do not exhibit major biases with respect to traditionally administered surveys in terms of neither demographics nor personality traits. Loyalty, authority, and social binding values were higher in the Facebook platform, probably due to the platform’s intrinsic social character. Conclusions: We conclude that Facebook apps are valid research tools for administering demographic and psychometric surveys, provided that the entailed biases are taken into consideration. Contribution: We contribute to the characterisation of Facebook apps as a valid scientific tool to administer demographic and psychometric surveys, and to the assessment of population, self-selection, and behavioural biases in the collected data.

[1]  Venkata Rama Kiran Garimella,et al.  Studying Migrant Assimilation Through Facebook Interests , 2018, SocInfo.

[2]  Sarah Vieweg,et al.  Processing Social Media Messages in Mass Emergency , 2014, ACM Comput. Surv..

[3]  J. Knodel,et al.  What does living alone really mean for older persons? A comparative study of Myanmar, Vietnam, and Thailand , 2015 .

[4]  F. Wilcoxon,et al.  Individual comparisons of grouped data by ranking methods. , 1946, Journal of economic entomology.

[5]  G. vanRossum Python reference manual , 1995 .

[6]  D. Ruths,et al.  Social media for large studies of behavior , 2014, Science.

[7]  Morteza Dehghani,et al.  Moralization in social networks and the emergence of violence during protests , 2018, Nature Human Behaviour.

[8]  Robert E. Kraut,et al.  Using facebook after losing a job: differential benefits of strong and weak ties , 2013, CSCW.

[9]  A. Komter,et al.  Family ties and depressive mood in Eastern and Western Europe , 2012 .

[10]  Zeynep Cemalcilar,et al.  Age differences in privacy attitudes, literacy and privacy management on Facebook , 2016 .

[11]  A. Vespignani Predicting the Behavior of Techno-Social Systems , 2009, Science.

[12]  Matthew J. Salganik,et al.  Bit by bit: social research in the digital age , 2019, The Journal of mathematical sociology.

[13]  F. Conrad,et al.  Social Media Analyses for Social Measurement. , 2016, Public opinion quarterly.

[14]  S. Gosling,et al.  A very brief measure of the Big-Five personality domains , 2003 .

[15]  Daniela Paolotti,et al.  How Search Engine Data Enhance the Understanding of Determinants of Suicide in India and Inform Prevention: Observational Study , 2019, Journal of medical Internet research.

[16]  Manuel Cebrián,et al.  Social Media Fingerprints of Unemployment , 2014, PloS one.

[17]  Hans-Peter Kohler,et al.  Social Networks and Fertility , 2001 .

[18]  Robert M. Groves,et al.  Total Survey Error: Past, Present, and Future , 2010 .

[19]  Panagiotis Takis Metaxas,et al.  How (Not) to Predict Elections , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[20]  Zeynep Tufekci,et al.  Big Questions for Social Media Big Data: Representativeness, Validity and Other Methodological Pitfalls , 2014, ICWSM.

[21]  Chareen Snelson,et al.  Qualitative and Mixed Methods Social Media Research , 2016 .

[22]  Fabian Flöck,et al.  A Total Error Framework for Digital Traces of Humans , 2019, ArXiv.

[23]  Emre Kıcıman,et al.  Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries , 2018, Front. Big Data.

[24]  Abbe Mowshowitz,et al.  Bias on the web , 2002, CACM.

[25]  Paul T. Costa,et al.  The Five-Factor Model of Personality and Its Relevance to Personality Disorders , 1992 .

[26]  D. Kerby The Simple Difference Formula: An Approach to Teaching Nonparametric Correlation1: , 2014 .

[27]  Kyriaki Kalimeri,et al.  Predicting Demographics, Moral Foundations, and Human Values from Digital Behaviors , 2017, Comput. Hum. Behav..

[28]  Fabrício Benevenuto,et al.  Using Facebook Ads Audiences for Global Lifestyle Disease Surveillance: Promises and Limitations , 2017, WebSci.

[29]  Jeong-Hwa Ho The problem group? Psychological wellbeing of unmarried people living alone in the Republic of Korea , 2015 .

[30]  Ulf-Dietrich Reips Standards for Internet-based experimenting. , 2002, Experimental psychology.

[31]  Duncan J. Watts,et al.  Who says what to whom on twitter , 2011, WWW.

[32]  A. Pentland,et al.  Life in the network: The coming age of computational social science: Science , 2009 .

[33]  W. Clark,et al.  Loss aversion and duration of residence , 2016 .

[34]  N. Parr Satisfaction with life as an antecedent of fertility: Partner + happiness = children? , 2010 .

[35]  J. Haidt,et al.  Intuitive ethics: how innately prepared intuitions generate culturally variable virtues , 2004, Daedalus.

[36]  Lindsay T. Graham,et al.  A Review of Facebook Research in the Social Sciences , 2012, Perspectives on psychological science : a journal of the Association for Psychological Science.

[37]  Joseph Murphy,et al.  Social Media, Sociality, and Survey Research , 2013 .

[38]  N. Cliff Dominance statistics: Ordinal analyses to answer ordinal questions. , 1993 .

[39]  J. Burkell,et al.  Facebook: public space, or private space? , 2014 .

[40]  Kyriaki Kalimeri,et al.  Human Values and Attitudes towards Vaccination in Social Media , 2019, WWW.