Twitter Users’ Privacy Concerns: What do Their Accounts’ First Names Tell Us?

Abstract Purpose In this paper, we describe how gender recognition on Twitter can be used as an intelligent business tool to determine the privacy concerns among users, and ultimately offer a more personalized service for customers who are more likely to respond positively to targeted advertisements. Design/methodology/approach We worked with two different data sets to examine whether Twitter users’ gender, inferred from the first name of the account and the profile description, correlates with the privacy setting of the account. We also used a set of features including the inferred gender of Twitter users to develop classifiers that predict user privacy settings. Findings We found that the inferred gender of Twitter users correlates with the account’s privacy setting. Specifically, females tend to be more privacy concerned than males. Users whose gender cannot be inferred from their provided first names tend to be more privacy concerned. In addition, our classification performance suggests that inferred gender can be used as an indicator of the user’s privacy preference. Research limitations It is known that not all twitter accounts are real user accounts, and social bots tweet as well. A major limitation of our study is the lack of consideration of social bots in the data. In our study, this implies that at least some percentage of the undefined accounts, that is, accounts that had names non-existent in the name dictionary, are social bots. It will be interesting to explore the privacy setting of social bots in the Twitter space. Practical implications Companies are investing large amounts of money in business intelligence tools that allow them to know the preferences of their consumers. Due to the large number of consumers around the world, it is very difficult for companies to have direct communication with each customer to anticipate market changes. For this reason, the social network Twitter has gained relevance as one ideal tool for information extraction. On the other hand, users’ privacy preference needs to be considered when companies consider leveraging their publicly available data. This paper suggests that gender recognition of Twitter users, based on Twitter users’ provided first names and their profile descriptions, can be used to infer the users’ privacy preference. Originality/value This study explored a new way of inferring Twitter user’s gender, that is, to recognize the user’s gender based on the provided first name and the user’s profile description. The potential of this information for predicting the user’s privacy preference is explored.

[1]  D. Ruths,et al.  What's in a Name? Using First Names as Features for Gender Inference in Twitter , 2013, AAAI Spring Symposium: Analyzing Microtext.

[2]  Robert E. Mercer,et al.  Privacy Behaviour and Profile Configuration in Twitter , 2016, WWW.

[3]  Alison E. Adam,et al.  Gender and computer ethics , 2000, CSOC.

[4]  Calton Pu,et al.  Large Online Social Footprints--An Emerging Threat , 2009, 2009 International Conference on Computational Science and Engineering.

[5]  Daniela Moctezuma,et al.  Features combination for gender recognition on Twitter users , 2016, 2016 IEEE International Autumn Meeting on Power, Electronics and Computing (ROPEC).

[6]  Sonali R. Mishra,et al.  A Fair Exchange: Exploring How Online Privacy is Valued , 2016, 2016 49th Hawaii International Conference on System Sciences (HICSS).

[7]  Amac Herdagdelen,et al.  Twitter n-gram corpus with demographic metadata , 2013, Language Resources and Evaluation.

[8]  Sushil Jajodia,et al.  Who is tweeting on Twitter: human, bot, or cyborg? , 2010, ACSAC '10.

[9]  Isabel P. Riquelme,et al.  Is the influence of privacy and security on online trust the same for all type of consumers? , 2014, Electron. Mark..

[10]  ティモシー モリー,et al.  Customer Data : Designing for Transparency and Trust , 2015 .

[11]  Alessandro Acquisti,et al.  Information revelation and privacy in online social networks , 2005, WPES '05.

[12]  P. Lachenbruch Statistical Power Analysis for the Behavioral Sciences (2nd ed.) , 1989 .

[13]  Sajjad Nazir,et al.  How Online Shopping Is Affecting Consumers Buying Behavior in Pakistan , 2012 .

[14]  Julien Lusson About our site , 2013 .

[15]  John D. Burger,et al.  Discriminating Gender on Twitter , 2011, EMNLP.

[16]  Sebastian Möller,et al.  Gender differences in the perception of security of mobile phones , 2012, Mobile HCI.

[17]  Joseph S. Fulda,et al.  The internet as an engine of scholarship , 2000, CSOC.

[18]  Michael Zimmer,et al.  A topology of Twitter research: disciplines, methods, and ethics , 2014, Aslib J. Inf. Manag..

[19]  Richard B. Parker A Definition of Privacy , 2017 .

[20]  Robert E. Mercer,et al.  Privacy Preference Inference via Collaborative Filtering , 2016, ICWSM.

[21]  Arthur D. Fisk,et al.  Privacy and technology: folk definitions and perspectives , 2008, CHI Extended Abstracts.

[22]  Anat Rachel Shimoni,et al.  Gender, genre, and writing style in formal written texts , 2003 .

[23]  Aaron C. Kay,et al.  Journal of Personality and Social Psychology Evidence That Gendered Wording in Job Advertisements Exists and Sustains Gender Inequality , 2011 .