A Review of Personality in Voice-Based Man Machine Interaction

In this paper, we will discuss state-of-the-art techniques for personality-aware user interfaces, and summarize recent work in automatically recognizing and synthesizing speech with "personality". We present an overview of personality "metrics", and show how they can be applied to the perception of voices, not only the description of personally known individuals. We present use cases for personality-aware speech input and/ or output, and discuss approaches at defining "personality" in this context. We take a middle-of-the-road approach, i.e. we will not try to uncover all fundamental aspects of personality in speech, but we'll also not aim for ad-hoc solutions that serve a single purpose, for example to create a positive attitude in a user, but do not generate transferable knowledge for other interfaces.

[1]  W. B. Arndt Theories of personality , 1974 .

[2]  L. Streeter,et al.  Effects of Pitch and Speech Rate on Personal Attributions , 1979 .

[3]  V. Drapela A Review of Personality Theories , 1987 .

[4]  P. Costa,et al.  Revised NEO Personality Inventory (NEO-PI-R) and NEO-Five-Factor Inventory (NEO-FFI) , 1992 .

[5]  L. R. Goldberg The structure of phenotypic personality traits. , 1993, The American psychologist.

[6]  B. J. Fogg,et al.  Can computer personalities be human personalities? , 1995, Int. J. Hum. Comput. Stud..

[7]  R. R. Abidin Psychological Assessment Resources , 1995 .

[8]  Alan W. Black,et al.  Unit selection in a concatenative speech synthesis system using a large speech database , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[9]  Clifford Nass,et al.  The media equation - how people treat computers, television, and new media like real people and places , 1996 .

[10]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[11]  Ian H. Witten,et al.  Weka: Practical machine learning tools and techniques with Java implementations , 1999 .

[12]  J. Cassell,et al.  Embodied conversational agents , 2000 .

[13]  C. Nass,et al.  Does computer-synthesized speech manifest personality? Experimental tests of recognition, similarity-attraction, and consistency-attraction. , 2001, Journal of experimental psychology. Applied.

[14]  E. Vesterinen,et al.  Affective Computing , 2009, Encyclopedia of Biometrics.

[15]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[16]  Raimo Bakis,et al.  Multilayered extensions to the speech synthesis markup language for describing expressiveness , 2003, INTERSPEECH.

[17]  Alastair J. Gill,et al.  Individual differences and implicit language: personality, parts-of-speech and pervasiveness , 2004 .

[18]  J. Cassell,et al.  Social Dialongue with Embodied Conversational Agents , 2005 .

[19]  Jason W. Osborne,et al.  Best practices in exploratory factor analysis: four recommendations for getting the most from your analysis. , 2005 .

[20]  Alastair J. Gill,et al.  Level of representation and semantic distance: Rating author personality from texts , 2006 .

[21]  Khalil Sima'an,et al.  Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship , 2006, Computational Linguistics.

[22]  Heiga Zen,et al.  Statistical Parametric Speech Synthesis , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[23]  Shrikanth S. Narayanan,et al.  A Statistical Approach for Modeling Prosody Features using POS Tags for Emotional Speech Synthesis , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[24]  Marilyn A. Walker,et al.  Using Linguistic Cues for the Automatic Recognition of Personality in Conversation and Text , 2007, J. Artif. Intell. Res..

[25]  A. Pentland Social Signal Processing [Exploratory DSP] , 2007, IEEE Signal Processing Magazine.

[26]  Tanja Schultz,et al.  Is voice transformation a threat to speaker identification? , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[27]  Maja Pantic,et al.  Social signal processing: Survey of an emerging domain , 2009, Image Vis. Comput..

[28]  Björn W. Schuller,et al.  The INTERSPEECH 2009 emotion challenge , 2009, INTERSPEECH.

[29]  Yuting Chen,et al.  Behavior and preference in minimal personality: a study on embodied conversational agents , 2010, ICMI-MLMI '10.

[30]  Marc Schröder,et al.  Evaluation of Expressive Speech Synthesis With Voice Conversion and Copy Resynthesis Techniques , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[31]  Ann K. Syrdal,et al.  Speech acts and dialog TTS , 2010, SSW.

[32]  P. Costa,et al.  NEO inventories for the NEO Personality Inventory-3 (NEO-PI-3), NEO Five-Factor Inventory-3 (NEO-FFI-3), NEO Personality Inventory-Revised (NEO PI-R) : professional manual , 2010 .

[33]  Tim Polzehl,et al.  Automatically assessing acoustic manifestations of personality in speech , 2010, 2010 IEEE Spoken Language Technology Workshop.

[34]  Tim Polzehl,et al.  Anger recognition in speech using acoustic and linguistic cues , 2011, Speech Commun..

[35]  Richard Catrambone,et al.  Anthropomorphic Agents as a User Interface Paradigm: Experimental Findings and a Framework for Research , 2019, Proceedings of the Twenty-Fourth Annual Conference of the Cognitive Science Society.