On Predicting Sociodemographic Traits and Emotions from Communications in Social Networks and Their Implications to Online Self-Disclosure

Social media services such as Twitter and Facebook are virtual environments where people express their thoughts, emotions, and opinions and where they reveal themselves to their peers. We analyze a sample of 123,000 Twitter users and 25 million of their tweets to investigate the relation between the opinions and emotions that users express and their predicted psychodemographic traits. We show that the emotions that we express on online social networks reveal deep insights about ourselves. Our methodology is based on building machine learning models for inferring coarse-grained emotions and psychodemographic profiles from user-generated content. We examine several user attributes, including gender, income, political views, age, education, optimism, and life satisfaction. We correlate these predicted demographics with the emotional profiles emanating from user tweets, as captured by Ekman's emotion classification. We find that some users tend to express significantly more joy and significantly less sadness in their tweets, such as those predicted to be in a relationship, with children, or with a higher than average annual income or educational level. Users predicted to be women tend to be more opinionated, whereas those predicted to be men tend to be more neutral. Finally, users predicted to be younger and liberal tend to project more negative opinions and emotions. We discuss the implications of our findings to online privacy concerns and self-disclosure behavior.

[1]  Preslav Nakov,et al.  SemEval-2013 Task 2: Sentiment Analysis in Twitter , 2013, *SEMEVAL.

[2]  R. Eisler,et al.  Masculine gender-role stress: predictor of anger, anxiety, and health-risk behaviors. , 1988, Journal of personality assessment.

[3]  Vaibhavi N Patodkar,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2016 .

[4]  Richard E. Lucas,et al.  Higher Income Is Associated With Less Daily Sadness but not More Daily Happiness , 2015 .

[5]  Ana-Maria Popescu,et al.  A Machine Learning Approach to Twitter User Classification , 2011, ICWSM.

[6]  Dong Nguyen,et al.  "How Old Do You Think I Am?" A Study of Language and Age in Twitter , 2013, ICWSM.

[7]  Yoram Bachrach,et al.  Studying User Income through Language, Behaviour and Affect in Social Media , 2015, PloS one.

[8]  Marja Kokkonen,et al.  Factors contributing to verbal self-disclosure , 2007 .

[9]  B. Farber,et al.  Self-Disclosure in Psychotherapy , 2006 .

[10]  Katelyn Y. A. McKenna,et al.  Plan 9 From Cyberspace: The Implications of the Internet for Personality and Social Psychology , 2000 .

[11]  S. Berg Snowball Sampling—I , 2006 .

[12]  P. Cozby Self-disclosure: a literature review. , 1973, Psychological bulletin.

[13]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[14]  Elena Kolesnikova,et al.  "It Won't Happen To Me!": Self-Disclosure in Online Social Networks , 2009, AMCIS.

[15]  Margaret L. Kern,et al.  Personality, Gender, and Age in the Language of Social Media: The Open-Vocabulary Approach , 2013, PloS one.

[16]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[17]  David Yarowsky,et al.  Exploring Demographic Language Variations to Improve Multilingual Sentiment Analysis in Social Media , 2013, EMNLP.

[18]  J. Forgas Affective influences on self-disclosure: mood effects on the intimacy and reciprocity of disclosing personal information. , 2011, Journal of personality and social psychology.

[19]  Saif Mohammad,et al.  NRC-Canada: Building the State-of-the-Art in Sentiment Analysis of Tweets , 2013, *SEMEVAL.

[20]  Jacob Ratkiewicz,et al.  Political Polarization on Twitter , 2011, ICWSM.

[21]  Wenyaw Chan,et al.  Anger in Adolescents: Sex, Ethnicity, Age Differences, and Psychometric Properties , 2003, Nursing research.

[22]  Zeynep Tufekci Can You See Me Now? Audience and Disclosure Regulation in Online Social Network Sites , 2008 .

[23]  Teresa Correa,et al.  Who interacts on the Web?: The intersection of users' personality and social media use , 2010, Comput. Hum. Behav..

[24]  Adam D. I. Kramer,et al.  Detecting Emotional Contagion in Massive Social Networks , 2014, PloS one.

[25]  Svitlana Volkova,et al.  Inferring Latent User Properties from Texts Published in Social Media , 2015, AAAI.

[26]  Svitlana Volkova,et al.  Inferring User Political Preferences from Streaming Communications , 2014, ACL.

[27]  David Yarowsky,et al.  Broadly Improving User Classification via Communication-Based Name and Location Clustering on Twitter , 2013, NAACL.

[28]  Anita Sharma,et al.  Personality and Patterns of Facebook Usage , 2016 .

[29]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[30]  Kazuya Hara,et al.  Computer-Mediated Relationship Development: A Cross-Cultural Comparison , 2005, J. Comput. Mediat. Commun..

[31]  Alecia Wolf,et al.  Emotional Expression Online: Gender Differences in Emoticon Use , 2000, Cyberpsychology Behav. Soc. Netw..

[32]  Sanda M. Harabagiu,et al.  EmpaTweet: Annotating and Detecting Emotions on Twitter , 2012, LREC.

[33]  Natalya N. Bazarova,et al.  Self‐Disclosure in Social Media: Extending the Functional Approach to Disclosure Motivations and Characteristics on Social Network Sites , 2014 .

[34]  Derek Ruths,et al.  Classifying Political Orientation on Twitter: It's Not Easy! , 2013, ICWSM.

[35]  Leonard Reinecke,et al.  The reciprocal effects of social network site use and the disposition for self-disclosure: A longitudinal study , 2013, Comput. Hum. Behav..

[36]  T. Graepel,et al.  Private traits and attributes are predictable from digital records of human behavior , 2013, Proceedings of the National Academy of Sciences.

[37]  Nina Wacholder,et al.  Identifying Sarcasm in Twitter: A Closer Look , 2011, ACL.

[38]  David Bamman,et al.  Gender identity and lexical variation in social media , 2012, 1210.4567.

[39]  M. Snyder Self-monitoring of expressive behavior. , 1974 .

[40]  Hua Qian,et al.  Anonymity and Self-Disclosure on Weblogs , 2007, J. Comput. Mediat. Commun..

[41]  Chris Callison-Burch,et al.  Fast, Cheap, and Creative: Evaluating Translation Quality Using Amazon’s Mechanical Turk , 2009, EMNLP.

[42]  Alice H. Oh,et al.  Do You Feel What I Feel? Social Aspects of Emotions in Twitter Conversations , 2012, ICWSM.

[43]  Clark McCauley,et al.  Individual differences in sensitivity to disgust: A scale sampling seven domains of disgust elicitors , 1994 .

[44]  Amit P. Sheth,et al.  Harnessing Twitter "Big Data" for Automatic Emotion Identification , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[45]  M. Allen,et al.  Sex differences in self-disclosure: a meta-analysis. , 1992, Psychological bulletin.

[46]  M. Dunnam,et al.  Two's Company: Self-Disclosure and Reciprocity in Triads Versus Dyads , 1985 .

[47]  Wendy Liu,et al.  Homophily and Latent Attribute Inference: Inferring Latent Attributes of Twitter Users from Neighbors , 2012, ICWSM.

[48]  Pavica Sheldon "I'll poke you. You'll poke me!" Self-disclosure, social attraction, predictability and trust as important predictors of Facebook relationships , 2009 .

[49]  John O'nolan,et al.  Designing Emotion , 2012 .

[50]  D. R. Shaffer,et al.  Self-monitoring as a determinant of self-disclosure reciprocity during the acquaintance process. , 1982 .

[51]  Cameron Marlow,et al.  Social network activity and social well-being , 2010, CHI.

[52]  Derek Ruths,et al.  Gender Inference of Twitter Users in Non-English Contexts , 2013, EMNLP.

[53]  U. Sailer,et al.  The affective profiles in the USA: happiness, depression, life satisfaction, and happiness-increasing strategies , 2013, PeerJ.

[54]  John Scott What is social network analysis , 2010 .

[55]  P. Ekman An argument for basic emotions , 1992 .

[56]  Jon M. Kleinberg,et al.  Romantic partnerships and the dispersion of social ties: a network analysis of relationship status on facebook , 2013, CSCW.

[57]  Saif Mohammad,et al.  Using Hashtags to Capture Fine Emotion Categories from Tweets , 2015, Comput. Intell..

[58]  Harith Alani,et al.  Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new dataset, the STS-Gold , 2013, ESSEM@AI*IA.

[59]  Javier Velasco-Martin,et al.  Self-disclosure in social media , 2011, CHI Extended Abstracts.

[60]  John D. Burger,et al.  Discriminating Gender on Twitter , 2011, EMNLP.

[61]  Benjamin Van Durme Streaming Analysis of Discourse Participants , 2012, EMNLP-CoNLL.

[62]  Alessandro Acquisti,et al.  Predicting Social Security numbers from public data , 2009, Proceedings of the National Academy of Sciences.

[63]  V. Derlega,et al.  Self-disclosure : theory, research, and therapy , 1987 .

[64]  Noah A. Smith,et al.  Log-Linear Models , 2004 .

[65]  Saif Mohammad,et al.  Using Nuances of Emotion to Identify Personality , 2013, Proceedings of the International AAAI Conference on Web and Social Media.

[66]  David Yarowsky,et al.  Classifying latent user attributes in twitter , 2010, SMUC '10.

[67]  Jennifer Golbeck,et al.  Predicting Personality from Twitter , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[68]  Kathryn Greene,et al.  Social Penetration Theory , 2015 .

[69]  Whitney P. Special,et al.  Self-disclosure and student satisfaction with Facebook , 2012, Comput. Hum. Behav..

[70]  Saif Mohammad,et al.  Tracking Sentiment in Mail: How Genders Differ on Emotional Axes , 2011, WASSA@ACL.

[71]  Gregory J. Park,et al.  From "Sooo excited!!!" to "So proud": using language to study development. , 2014, Developmental psychology.

[72]  Rui Chen,et al.  Self-disclosure at social networking sites: An exploration through relational capitals , 2011, Information Systems Frontiers.

[73]  Axel Schäfer,et al.  Gender differences in the processing of disgust- and fear-inducing pictures: an fMRI study , 2005, Neuroreport.

[74]  D. Fuqua,et al.  Sex Differences in the Relationship of Anger and Depression: An Empirical Study , 1999 .