Personality Traits Classification on Twitter

Personality traits have been shown to have strong influences on important aspects of life such as success in the workplace, political temperament, and general emotional stability. Computer-based personality assessments using information from social networking platforms have shown to be more accurate than judgments made by people close to the subject. This paper presents a personality traits classification system that incorporates language-based features, based on count-based vectorization (TF-IDF) and the GloVe word embedding technique, with an ensemble prediction system consisting of gradient-boosted decision trees and an SVM classifier. This combination allows to reliably estimate certain personality traits using only the latest 50 tweets from a user's profile. The performance of the proposed system is validated on a large, publicly available dataset and compares favourably with other state-of-the-art methods.

[1]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[2]  Dirk Hovy,et al.  Personality Traits on Twitter—or—How to Get 1,500 Personality Tests in a Week , 2015, WASSA@EMNLP.

[3]  Sherry H. Stewart,et al.  Big-five personality domains predict internal drinking motives in young adults , 2010 .

[4]  Walter Daelemans,et al.  TwiSty: A Multilingual Twitter Stylometry Corpus for Gender and Personality Profiling , 2016, LREC.

[5]  Thorsten Joachims,et al.  Text Categorization with Support Vector Machines: Learning with Many Relevant Features , 1998, ECML.

[6]  Gwendolyn Seidman Self-presentation and belonging on Facebook: How personality influences social media use and motivations , 2013 .

[7]  Lin Qiu,et al.  You are what you tweet: Personality expression and perception on Twitter , 2012 .

[8]  R. McCrae,et al.  An introduction to the five-factor model and its applications. , 1992, Journal of personality.

[9]  Daniele Quercia,et al.  Our Twitter Profiles, Our Selves: Predicting Personality with Twitter , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[10]  Marina L. Gavrilova,et al.  Social Behavioral Biometrics: An Emerging Trend , 2015, Int. J. Pattern Recognit. Artif. Intell..

[11]  Fei Liu,et al.  A Recurrent and Compositional Model for Personality Trait Recognition from Short Texts , 2016, PEOPLES@COLING.

[12]  H. Friedman,et al.  Health Status and the Five-factor Personality Traits in a Nationally Representative Sample , 2006, Journal of health psychology.

[13]  Irfan Ahmed,et al.  Self-reported secure and insecure cyber behaviour: factor structure and associations with personality factors , 2017 .

[14]  L. R. Goldberg The structure of phenotypic personality traits. , 1993, The American psychologist.

[15]  Fernando Nogueira,et al.  Imbalanced-learn: A Python Toolbox to Tackle the Curse of Imbalanced Datasets in Machine Learning , 2016, J. Mach. Learn. Res..

[16]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[17]  Jalal Mahmud,et al.  25 Tweets to Know You: A New Model to Predict Personality with Social Media , 2017, ICWSM.

[18]  R. Rhodes,et al.  Personality correlates of physical activity: a review and meta-analysis , 2006, British Journal of Sports Medicine.

[19]  Lyle H. Ungar,et al.  Analyzing Personality through Social Media Profile Picture Choice , 2016, ICWSM.

[20]  A. Furnham The big five versus the big four: the relationship between the Myers-Briggs Type Indicator (MBTI) and NEO-PI five factor model of personality , 1996 .

[21]  Brent Holland,et al.  Using theory to evaluate personality and job-performance relations: a socioanalytic perspective. , 2003, The Journal of applied psychology.

[22]  D. Paulhus,et al.  The Dark Triad of personality: Narcissism, Machiavellianism, and psychopathy , 2002 .

[23]  Luigi Ferrucci,et al.  Personality Predictors of Longevity: Activity, Emotional Stability, and Conscientiousness , 2008, Psychosomatic medicine.

[24]  Gregory A. Huber,et al.  The Big Five Personality Traits in the Political Arena , 2011 .

[25]  Bruce Ferwerda,et al.  Fusing Social Media Cues: Personality Prediction from Twitter and Instagram , 2016, WWW.

[26]  Danny Azucar,et al.  Predicting the Big 5 personality traits from digital footprints on social media: A meta-analysis , 2018 .

[27]  Juliane M. Stopfer,et al.  Facebook Profiles Reflect Actual Personality, Not Self-Idealization , 2010, Psychological science.

[28]  A. Caspi,et al.  The Power of Personality: The Comparative Validity of Personality Traits, Socioeconomic Status, and Cognitive Ability for Predicting Important Life Outcomes , 2007, Perspectives on psychological science : a journal of the Association for Psychological Science.

[29]  C. A. Higgins,et al.  THE BIG FIVE PERSONALITY TRAITS, GENERAL MENTAL ABILITY, AND CAREER SUCCESS ACROSS THE LIFE SPAN , 1999 .

[30]  P. Costa,et al.  The revised NEO personality inventory (NEO-PI-R) , 2008 .

[31]  Margaret L. Kern,et al.  Personality, Gender, and Age in the Language of Social Media: The Open-Vocabulary Approach , 2013, PloS one.

[32]  Jennifer Golbeck,et al.  Predicting Personality from Twitter , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[33]  Angela L. Duckworth,et al.  The Economics and Psychology of Personality Traits , 2008, The Journal of Human Resources.