Personality Profiling of Fictional Characters using Sense-Level Links between Lexical Resources

This study focuses on personality prediction of protagonists in novels based on the Five-Factor Model of personality. We present and publish a novel collaboratively built dataset of fictional character personality and design our task as a text classification problem. We incorporate a range of semantic features, including WordNet and VerbNet sense-level information and word vector representations. We evaluate three machine learning models based on the speech, actions and predicatives of the main characters, and show that especially the lexical-semantic features significantly outperform the baselines. The most predictive features correspond to reported findings in personality psychology.

[1]  Marilyn A. Walker,et al.  Using Linguistic Cues for the Automatic Recognition of Personality in Conversation and Text , 2007, J. Artif. Intell. Res..

[2]  S. Srivastava,et al.  The Big Five Trait taxonomy: History, measurement, and theoretical perspectives. , 1999 .

[3]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[4]  Kathleen McKeown,et al.  Extracting Social Networks from Literary Fiction , 2010, ACL.

[5]  A. Furnham,et al.  Extraversion: The Unloved Variable in Applied Linguistic Research , 1999 .

[6]  Alexandre Passant,et al.  dbrec - Music Recommendations Using DBpedia , 2010, SEMWEB.

[7]  A. Tellegen,et al.  PERSONALITY PROCESSES AND INDIVIDUAL DIFFERENCES An Alternative "Description of Personality": The Big-Five Factor Structure , 2022 .

[8]  Nathanael Chambers,et al.  Event Schema Induction with a Probabilistic Entity-Driven Model , 2013, EMNLP.

[9]  Yair Neuman,et al.  A Vectorial Semantics Approach to Personality Assessment , 2014, Scientific Reports.

[10]  Jean-Marc Dewaele,et al.  Variation in the Contextuality of Language: An Empirical Measure , 2002 .

[11]  J. Pennebaker,et al.  Linguistic styles: language use as an individual difference. , 1999, Journal of personality and social psychology.

[12]  Taniya Mishra,et al.  From Speaker Identification to Affective Analysis: A Multi-Step System for Analyzing Children’s Stories , 2014, CLfL@EACL.

[13]  Daniel J. Kruger,et al.  Portrayal of Personality in Victorian Novels Reflects Modern Research Findings but Amplifies the Significance of Agreeableness , 2011 .

[14]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[15]  Jon Oberlander,et al.  Weblogs, genres and individual differences , 2005 .

[16]  George M. Mohay,et al.  Gender-preferential text mining of e-mail discourse , 2002, 18th Annual Computer Security Applications Conference, 2002. Proceedings..

[17]  Martha Palmer,et al.  Verbnet: a broad-coverage, comprehensive verb lexicon , 2005 .

[18]  Daniel S. Dotson Portrayal of Physicists in Fictional Works , 2009 .

[19]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[20]  T. Chartrand,et al.  The chameleon effect: the perception-behavior link and social interaction. , 1999, Journal of personality and social psychology.

[21]  Daniel Gatica-Perez,et al.  The YouTube Lens: Crowdsourced Personality Impressions and Audiovisual Analysis of Vlogs , 2013, IEEE Transactions on Multimedia.

[22]  Brendan T. O'Connor,et al.  Learning Latent Personas of Film Characters , 2013, ACL.

[23]  Daniel J Levitin,et al.  The structure of musical preferences: a five-factor model. , 2011, Journal of personality and social psychology.

[24]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[25]  Pushmeet Kohli,et al.  Manifestations of user personality in website choice and behaviour on online social networks , 2013, Machine Learning.

[26]  Markus Zanker,et al.  Linked open data to support content-based recommender systems , 2012, I-SEMANTICS '12.

[27]  Denilson Barbosa,et al.  Identification of Speakers in Novels , 2013, ACL.

[28]  Subramanian Ramanathan,et al.  Employing social gaze and speaking activity for automatic determination of the Extraversion trait , 2010, ICMI-MLMI '10.

[29]  S. Gosling,et al.  Personality in its natural habitat: manifestations and implicit folk theories of personality in daily life. , 2006, Journal of personality and social psychology.

[30]  Mats Malm,et al.  Character Profiling in 19th Century Fiction , 2011 .

[31]  T. Graepel,et al.  Private traits and attributes are predictable from digital records of human behavior , 2013, Proceedings of the National Academy of Sciences.

[32]  Keith Oatley,et al.  Exploring the link between reading fiction and empathy: Ruling out individual differences and examining outcomes , 2009 .

[33]  David J. Pittenger,et al.  Cautionary comments regarding the Myers-Briggs Type Indicator. , 2005 .

[34]  James W. Pennebaker,et al.  Linguistic Inquiry and Word Count (LIWC2007) , 2007 .

[35]  Jon Oberlander,et al.  Whose Thumb Is It Anyway? Classifying Author Personality from Weblog Text , 2006, ACL.

[36]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[37]  I. B. Myers Manual: A Guide to the Development and Use of the Myers-Briggs Type Indicator , 1985 .

[38]  Saif Mohammad,et al.  Using Nuances of Emotion to Identify Personality , 2013, Proceedings of the International AAAI Conference on Web and Social Media.

[39]  Fabio Pianesi,et al.  Workshop on Computational Personality Recognition: Shared Task , 2013, Proceedings of the International AAAI Conference on Web and Social Media.

[40]  William C. Tirre,et al.  Reading interests: Their dimensionality and correlation with personality and cognitive factors , 1995 .

[41]  Jon Oberlander,et al.  Identifying more bloggers: Towards large scale personality classification of personal weblogs , 2007, ICWSM.

[42]  Simone Paolo Ponzetto,et al.  Exploiting FrameNet for Content-Based Book Recommendation , 2014, CBRecSys@RecSys.

[43]  Peter D. Turney,et al.  Emotions Evoked by Common Words and Phrases: Using Mechanical Turk to Create an Emotion Lexicon , 2010, HLT-NAACL 2010.

[44]  Daniel Gatica-Perez,et al.  Cross-domain personality prediction: from video blogs to small group meetings , 2013, ICMI '13.

[45]  Alastair J. Gill,et al.  Taking Care of the Linguistic Features of Extraversion , 2019, Proceedings of the Twenty-Fourth Annual Conference of the Cognitive Science Society.

[46]  Max Mühlhäuser,et al.  Darmstadt Knowledge Processing Repository Based on UIMA , 2007 .

[47]  L. R. Goldberg THE DEVELOPMENT OF MARKERS FOR THE BIG-FIVE FACTOR STRUCTURE , 1992 .

[48]  D. MacDonald,et al.  Examination of the Relationship between the Myers-Briggs Type Indicator and the Neo Personality Inventory , 1994 .

[49]  Oliver Ferschke,et al.  DKPro TC: A Java-based Framework for Supervised Learning Experiments on Textual Data , 2014, ACL.

[50]  James R. Curran,et al.  A Sequence Labelling Approach to Quote Attribution , 2012, EMNLP.

[51]  P. Costa,et al.  Validation of the five-factor model of personality across instruments and observers. , 1987, Journal of personality and social psychology.

[52]  Fabio Pianesi,et al.  The Workshop on Computational Personality Recognition 2014 , 2014, ACM Multimedia.

[53]  David Bamman,et al.  A Bayesian Mixed Effects Model of Literary Character , 2014, ACL.

[54]  Robert R. McCrae,et al.  The Five‐Factor Model in Fact and Fiction , 2012 .

[55]  Iryna Gurevych,et al.  UBY - A Large-Scale Unified Lexical-Semantic Resource Based on LMF , 2012, EACL.

[56]  Geoff F. Kaufman,et al.  Changing beliefs and behavior through experience-taking. , 2012, Journal of personality and social psychology.

[57]  D. Byrne Interpersonal attraction and attitude similarity. , 1961, Journal of abnormal and social psychology.

[58]  John A. Johnson,et al.  The international personality item pool and the future of public-domain personality measures ☆ , 2006 .

[59]  Corinna E Löckenhoff,et al.  Five-Factor Model personality profiles of drug users , 2008, BMC psychiatry.

[60]  P. Costa,et al.  Reinterpreting the Myers-Briggs Type Indicator from the perspective of the five-factor model of personality. , 1989, Journal of personality.

[61]  Conor Hayes,et al.  Using Linked Data to Build Open, Collaborative Recommender Systems , 2010, AAAI Spring Symposium: Linked Data Meets Artificial Intelligence.

[62]  R. Horton,et al.  Is actual similarity necessary for attraction? A meta-analysis of actual and perceived similarity , 2008 .

[63]  Michael Wilson MRC Psycholinguistic Database , 2001 .