Using Linguistic Cues for the Automatic Recognition of Personality in Conversation and Text

It is well known that utterances convey a great deal of information about the speaker in addition to their semantic content. One such type of information consists of cues to the speaker's personality traits, the most fundamental dimension of variation between humans. Recent work explores the automatic detection of other types of pragmatic variation in text and conversation, such as emotion, deception, speaker charisma, dominance, point of view, subjectivity, opinion and sentiment. Personality affects these other aspects of linguistic production, and thus personality recognition may be useful for these tasks, in addition to many other potential applications. However, to date, there is little work on the automatic recognition of personality traits. This article reports experimental results for recognition of all Big Five personality traits, in both conversation and text, utilising both self and observer ratings of personality. While other work reports classification results, we experiment with classification, regression and ranking models. For each model, we analyse the effect of different feature sets on accuracy. Results show that for some traits, any type of statistical model performs significantly better than the baseline, but ranking models perform best overall. We also present an experiment suggesting that ranking models are more accurate than multi-class classifiers for modelling personality. In addition, recognition models trained on observed personality perform better than models trained using self-reports, and the optimal feature set depends on the personality trait. A qualitative analysis of the learned models confirms previous findings linking language and personality, while revealing many new linguistic markers.

[1]  Swapna Somasundaran,et al.  Detecting Arguing and Sentiment in Meetings , 2007, SIGdial.

[2]  Dirk Heylen,et al.  Dominance Detection in Meetings Using Easily Obtainable Features , 2005, MLMI.

[3]  Thomas Rist,et al.  Integrating Models of Personality and Emotions into Lifelike Characters , 1999, IWAI.

[4]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[5]  Marilyn A. Walker,et al.  PERSONAGE: Personality Generation for Dialogue , 2007, ACL.

[6]  J. Pennebaker,et al.  Psychological aspects of natural language. use: our words, our selves. , 2003, Annual review of psychology.

[7]  W. T. Norman,et al.  Toward an adequate taxonomy of personality attributes: replicated factors structure in peer nomination personality ratings. , 1963, Journal of abnormal and social psychology.

[8]  Yorick Wilks,et al.  Error Analysis of Dialogue Act Classification , 2005, TSD.

[9]  K. Vogel,et al.  L'interlangue et la personnalité de l'apprenant , 1986 .

[10]  J. Pennebaker,et al.  Lying Words: Predicting Deception from Linguistic Styles , 2003, Personality & social psychology bulletin.

[11]  Bruce L. Smith,et al.  Effects of Speech Rate on Personality Perception , 1975, Language and speech.

[12]  Alastair J. Gill,et al.  Taking Care of the Linguistic Features of Extraversion , 2019, Proceedings of the Twenty-Fourth Annual Conference of the Cognitive Science Society.

[13]  Claire Cardie,et al.  Multi-Perspective Question Answering Using the OpQA Corpus , 2005, HLT.

[14]  Jon Oberlander,et al.  Whose Thumb Is It Anyway? Classifying Author Personality from Weblog Text , 2006, ACL.

[15]  G. A. Mishne,et al.  Expiriments with mood classification in blog posts , 2005, SIGIR 2005.

[16]  Ellen Riloff,et al.  Creating Subjective and Objective Sentence Classifiers from Unannotated Texts , 2005, CICLing.

[17]  S. Lilienfeld,et al.  Is antisocial personality disorder continuous or categorical? A taxometric analysis , 2006, Psychological Medicine.

[18]  Jon Oberlander,et al.  Identifying more bloggers: Towards large scale personality classification of personal weblogs , 2007, ICWSM.

[19]  Julia Hirschberg,et al.  Acoustic/prosodic and lexical correlates of charismatic speech , 2005, INTERSPEECH.

[20]  Robert E. Schapire,et al.  A Brief Introduction to Boosting , 1999, IJCAI.

[21]  Frank J. Bernieri,et al.  Personality perception: A developmental study , 2006 .

[22]  Marilyn A. Walker,et al.  Automatic Recognition of Personality in Conversation , 2006, NAACL.

[23]  Klaus R. Scherer,et al.  Vocal communication of emotion: A review of research paradigms , 2003, Speech Commun..

[24]  S. Srivastava,et al.  The Big Five Trait taxonomy: History, measurement, and theoretical perspectives. , 1999 .

[25]  H. Bonner Language and personality. , 1961 .

[26]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[27]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[28]  Allen L. Edwards,et al.  The Relationship Between the Judged Desirability of a Trait and the Probability That the Trait Will Be Endorsed , 1953 .

[29]  Douglas Biber,et al.  Variation across speech and writing: Methodology , 1988 .

[30]  Adrian Furnham,et al.  Personality, needs, social skills and academic achievement: A longitudinal study. , 1991 .

[31]  Yoram Singer,et al.  An Efficient Boosting Algorithm for Combining Preferences by , 2013 .

[32]  Pierre-Yves Oudeyer,et al.  Novel Useful Features and Algorithms for the Recognition of Emotions in Human Speech , 2002 .

[33]  A. Furnham,et al.  Personality, learning style and work performance , 1999 .

[34]  Steven J. Karau,et al.  THE RELATIONSHIP BETWEEN THE BIG FIVE PERSONALITY TRAITS AND ACADEMIC MOTIVATION , 2005 .

[35]  T. Judge,et al.  Personality and transformational and transactional leadership: a meta-analysis. , 2004, The Journal of applied psychology.

[36]  P. Borkenau,et al.  Deception and deception detection: the role of cross-modal inconsistency. , 1998, Journal of personality.

[37]  Marilyn A. Walker,et al.  Mixed Initiative in Dialogue: An Investigation into Discourse Segmentation , 1990, ACL.

[38]  Claire Cardie,et al.  Identifying Expressions of Opinion in Context , 2007, IJCAI.

[39]  G. Āllport,et al.  Trait-names: A psycho-lexical study. , 1936 .

[40]  D. Watson,et al.  On traits and temperament: general and specific factors of emotional experience and their relation to the five-factor model. , 1992, Journal of personality.

[41]  Andreas Stolcke,et al.  Combining Prosodic Lexical and Cepstral Systems for Deceptive Speech Detection , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[42]  H. Eysenck Dimensions of personality : 16, 5 or 3? ― Criteria for a taxonomic paradigm , 1991 .

[43]  A. Tellegen,et al.  An alternative "description of personality": the big-five factor structure. , 1990, Journal of personality and social psychology.

[44]  Thore Graepel,et al.  Large Margin Rank Boundaries for Ordinal Regression , 2000 .

[45]  Alastair J. Gill,et al.  Language With Character: A Stratified Corpus Comparison of Individual Differences in E-Mail Communication , 2006 .

[46]  D. Funder On the accuracy of personality judgment: a realistic approach. , 1995, Psychological review.

[47]  Michael Wilson MRC Psycholinguistic Database , 2001 .

[48]  E. Brunswik Perception and the Representative Design of Psychological Experiments , 1957 .

[49]  Julia Hirschberg,et al.  Classifying subject ratings of emotional speech using acoustic features , 2003, INTERSPEECH.

[50]  Ning Wang,et al.  The Politeness Effect: Pedagogical Agents and Learning Gains , 2005, AIED.

[51]  Pat Langley,et al.  A Personalized System for Conversational Recommendations , 2011, J. Artif. Intell. Res..

[52]  J Hogan,et al.  What we know about leadership. Effectiveness and personality. , 1994, The American psychologist.

[53]  J. Pennebaker,et al.  Linguistic styles: language use as an individual difference. , 1999, Journal of personality and social psychology.

[54]  D N Jackson,et al.  What is beyond the big five? Plenty! , 2000, Journal of personality.

[55]  Jean-Marc Dewaele,et al.  Variation in the Contextuality of Language: An Empirical Measure , 2002 .

[56]  Andreas Stolcke,et al.  Distinguishing deceptive from non-deceptive speech , 2005, INTERSPEECH.

[57]  Alastair J. Gill,et al.  Perception of e-mail personality at zero-acquaintance: Extraversion takes care of itself; Neuroticism is a worry , 2003 .

[58]  D. Byrne,et al.  ATTRACTION AS A LINEAR FUNCTION OF PROPORTION OF POSITIVE REINFORCEMENTS. , 1965, Journal of personality and social psychology.

[59]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[60]  Bo Pang,et al.  Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales , 2005, ACL.

[61]  Janyce Wiebe,et al.  Learning Subjective Language , 2004, CL.

[62]  Steve Whittaker,et al.  Accessing Multimodal Meeting Data: Systems, Problems and Possibilities , 2004, MLMI.

[63]  A. Furnham,et al.  Extraversion: The Unloved Variable in Applied Linguistic Research , 1999 .

[64]  Marilyn A. Walker,et al.  Improvising linguistic style: social and affective bases for agent personality , 1997, AGENTS '97.

[65]  Justine Cassell,et al.  Negotiated Collusion: Modeling Social Language and its Relationship Effects in Intelligent Agents , 2003, User Modeling and User-Adapted Interaction.

[66]  D. Funder,et al.  Personality as manifest in word use: correlations with self-report, acquaintance report, and behavior. , 2008, Journal of personality and social psychology.

[67]  J. Pennebaker,et al.  The Electronically Activated Recorder (EAR): A device for sampling naturalistic daily activities and conversations , 2001, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[68]  Lewis R. Goldberg,et al.  Some determinants of factor structures from personality-trait descriptors. , 1989 .

[69]  Ingrid Zukerman,et al.  Natural Language Processing and User Modeling: Synergies and Limitations , 2001, User Modeling and User-Adapted Interaction.

[70]  M. Brent Donnellan,et al.  The Big Five and enduring marriages , 2004 .

[71]  L. R. Goldberg,et al.  Some determinants of factor structures from personality-trait descriptors. , 1989, Journal of personality and social psychology.

[72]  D. Funder,et al.  Behavioral manifestations of personality: an ecological approach to judgmental accuracy. , 1993, Journal of personality and social psychology.

[73]  J. P. Rushton,et al.  Combining trait consistency and learning specificity approaches to personality, with illustrative data on faculty teaching performance , 1987 .

[74]  S. Gosling,et al.  Personality in its natural habitat: manifestations and implicit folk theories of personality in daily life. , 2006, Journal of personality and social psychology.

[75]  E. B. Mallory,et al.  A possible basis for the association of voice characteristics and personality traits , 1958 .

[76]  R Hogan,et al.  A socioanalytic theory of personality. , 1983, Nebraska Symposium on Motivation. Nebraska Symposium on Motivation.

[77]  Ian Witten,et al.  Data Mining , 2000 .

[78]  Jennifer Chu-Carroll,et al.  A Plan-Based Model for Response Generation in Collaborative Task-Oriented Dialogues , 1994, AAAI.

[79]  Ronald E. Riggio,et al.  Personality and deception ability , 1988 .

[80]  James C. Lester,et al.  Achieving Affective Impact: Visual Emotive Communication in Lifelike Pedagogical Agents , 1999 .

[81]  Sam Nunn Preventing the Next Terrorist Attack: The Theory and Practice of Homeland Security Information Systems , 2005 .

[82]  C. Nass,et al.  Does computer-synthesized speech manifest personality? Experimental tests of recognition, similarity-attraction, and consistency-attraction. , 2001, Journal of experimental psychology. Applied.

[83]  Julia Hirschberg,et al.  Personality factors in human deception detection: comparing human to machine performance , 2006, INTERSPEECH.

[84]  M. Walker,et al.  Words Mark the Nerds: Computational Models of Personality Recognition through Language , 2006 .

[85]  Ehud Reiter,et al.  Contextual Influences on Near-Synonym Choice , 2004, INLG.

[86]  Alastair J. Gill Personality and language: the projection and perception of personality in computer-mediated communication , 2004 .

[87]  J. Sigurdsson,et al.  Computer experience, attitudes toward computers and personality characteristics in psychology undergraduates , 1991 .

[88]  Janyce Wiebe,et al.  Just How Mad Are You? Finding Strong and Weak Opinion Clauses , 2004, AAAI.