Personality Recognition from Facebook Text

This work concerns a study in the Natural Language Processing field aiming to recognise personality traits in Portuguese written text. To this end, we first built a corpus of Facebook status updates labelled with the personality traits of their authors, from which we trained a number of computational models of personality recognition. The models include a range of alternatives ranging from a standard approach relying on lexical knowledge from the LIWC dictionary and others, to purely text-based methods such as bag of words, word embeddings and others. Results suggest that word embedding models slightly outperform the alternatives under consideration, with the advantage of not requiring any language-specific lexical resources.

[1]  Ivandré Paraboni,et al.  Big Five Personality Recognition from Multiple Text Genres , 2017, TSD.

[2]  Jon Oberlander,et al.  Whose Thumb Is It Anyway? Classifying Author Personality from Weblog Text , 2006, ACL.

[3]  Ivandre Paraboni,et al.  Learning Personality Traits from Facebook Text , 2018, IEEE Latin America Transactions.

[4]  Marilyn A. Walker,et al.  Using Linguistic Cues for the Automatic Recognition of Personality in Conversation and Text , 2007, J. Artif. Intell. Res..

[5]  S. Gosling,et al.  Facebook as a research tool for the social sciences: Opportunities, challenges, ethical considerations, and practical guidelines. , 2015, The American psychologist.

[6]  A. Tellegen,et al.  PERSONALITY PROCESSES AND INDIVIDUAL DIFFERENCES An Alternative "Description of Personality": The Big-Five Factor Structure , 2022 .

[7]  Ivandré Paraboni,et al.  Author Profiling from Facebook Corpora , 2018, LREC.

[8]  Max Coltheart,et al.  The MRC Psycholinguistic Database , 1981 .

[9]  Floyd H. Allport,et al.  Personality Traits: Their Classification And Measurement... , 2012 .

[10]  Ivandré Paraboni,et al.  Building a Corpus for Personality-dependent Natural Language Understanding and Generation , 2018, LREC.

[11]  Ivandré Paraboni,et al.  Temporal Aspects of Content Recommendation on a Microblog Corpus , 2014, PROPOR.

[12]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[13]  Jon Oberlander,et al.  Identifying more bloggers: Towards large scale personality classification of personal weblogs , 2007, ICWSM.

[14]  Josemberg Moura de Andrade Evidências de validade do inventário dos cinco grandes fatores de personalidade para o Brasil , 2008 .

[15]  Fabio Celli Adaptive Personality Recognition from Text , 2013 .

[16]  Sandra M. Aluísio,et al.  An Evaluation of the Brazilian Portuguese LIWC Dictionary for Sentiment Analysis , 2013, STIL.

[17]  Sandra M. Aluísio,et al.  A Lightweight Regression Method to Infer Psycholinguistic Properties for Brazilian Portuguese , 2017, TSD.

[18]  Geoffrey Zweig,et al.  Linguistic Regularities in Continuous Space Word Representations , 2013, NAACL.