Rapport with Virtual Agents: What Do Human Social Cues and Personality Explain?

Rapport has been recognized as an important aspect of relationship building. While rapport in the context of human-human interaction has been widely studied, how it can be established and maintained in human-agent interaction has been studied only recently. Our study investigates how social cues and personality of a human interacting with an agent can be used for automatic prediction of rapport in this context. We conduct experiments with two emotional virtual agents. Alongside the audio-visual data, we also collect human personality measures and two measures of rapport: self-reported rapport and rapport judged by observers. The social cues, such as turn-taking patterns and facial expressions are extracted from audio-visual data. Our results show that the most significant cues that infer the rapport judgments are the number of turn-taking cues and pauses. We also find that some of the significant social cues related to rapport are similar to those reported in previous psychology literature. We also confirm previous findings on how human personality plays an important role in perceiving the interaction with agents—people who score high in extraversion and agreeableness report higher rapport with both agents. Finally, the rapport prediction results suggest that automatic analysis of social phenomena in human-agent interaction could be a feasible method for agent evaluation.

[1]  Jeremy N. Bailenson,et al.  Automatically Detected Nonverbal Behavior Predicts Creativity in Collaborating Dyads , 2014 .

[2]  Louis-Philippe Morency,et al.  Virtual Rapport 2.0 , 2011, IVA.

[3]  Fabio Valente,et al.  Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features , 2012, Speech Commun..

[4]  Ursula Hess,et al.  Display of Empathy and Perception of Out-Group Members , 2007 .

[5]  Jon Grahe,et al.  Psychological Data from an Exploration of the Rapport / Synchrony Interplay Using Motion Energy Analysis , 2014 .

[6]  Zhou Yu,et al.  Automatic Prediction of Friendship via Multi-model Dyadic Features , 2013, SIGDIAL Conference.

[7]  Ran Zhao,et al.  Towards a Computational Architecture of Dyadic Rapport Management for Virtual Agents , 2014, IVA.

[8]  K. Bell,et al.  Rapport Is Not So Soft Anymore , 1990 .

[9]  Kory Floyd,et al.  Nonverbal Expressions of Liking and Disliking in Initial Interaction: Encoding and Decoding Perspectives , 2006 .

[10]  Jonathan Gratch,et al.  Associations Between Interactants Personality Traits and Their Feelings of Rapport in Interactions With Virtual Humans , 2009 .

[11]  Dimitris Samaras,et al.  Two-person interaction detection using body-pose features and multiple instance learning , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[12]  Maja Pantic,et al.  Social signal processing: Survey of an emerging domain , 2009, Image Vis. Comput..

[13]  Anton Nijholt,et al.  A Multimodal Database for Mimicry Analysis , 2011, ACII.

[14]  Alvaro Marcos-Ramiro,et al.  Capturing Upper Body Motion in Conversation: An Appearance Quasi-Invariant Approach , 2014, ICMI.

[15]  Alex Pentland,et al.  Honest Signals - How They Shape Our World , 2008 .

[16]  Etienne de Sevin,et al.  Evaluation of Four Designed Virtual Agent Personalities , 2012, IEEE Transactions on Affective Computing.

[17]  Frank J. Bernieri,et al.  Interpersonal Sensitivity : Theory and Measurement , 2001 .

[18]  Frank J. Bernieri,et al.  Dyad rapport and the accuracy of its judgment across situations: A lens model analysis. , 1996 .

[19]  W. Ickes,et al.  Big Five predictors of behavior and perceptions in initial dyadic interactions: personality similarity helps extraverts and introverts, but hurts "disagreeables". , 2009, Journal of personality and social psychology.

[20]  L. Tickle-Degnen,et al.  The Nature of Rapport and Its Nonverbal Correlates , 1990 .

[21]  Behjat Siddiquie,et al.  Affect analysis in natural human interaction using Joint Hidden Conditional Random Fields , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[22]  D. Lakens,et al.  If They Move in Sync, They Must Feel in Sync: Movement Synchrony Leads to Attributions of Rapport and Entitativity , 2011 .

[23]  Frank J. Bernieri,et al.  Interactional Synchrony and Rapport: Measuring Synchrony in Displays Devoid of Sound and Facial Affect , 1994 .

[24]  A. Mehrabian Significance of posture and posiion in the communication of attitude and status relationships. , 1969, Psychological bulletin.

[25]  Daniel Gatica-Perez,et al.  Emergent leaders through looking and speaking: from audio-visual data to multimodal recognition , 2012, Journal on Multimodal User Interfaces.

[26]  Gwen Littlewort,et al.  The computer expression recognition toolbox (CERT) , 2011, Face and Gesture 2011.

[27]  Daniel Gatica-Perez,et al.  Broadcasting Oneself: Visual Discovery of Vlogging Styles , 2014, IEEE Transactions on Multimedia.

[28]  Björn W. Schuller,et al.  AVEC 2013: the continuous audio/visual emotion and depression recognition challenge , 2013, AVEC@ACM Multimedia.

[29]  Clifford Nass,et al.  Consistency of personality in interactive characters: verbal cues, non-verbal cues, and user characteristics , 2000, Int. J. Hum. Comput. Stud..

[30]  Petr Motlícek,et al.  Assessing the impact of language style on emergent leadership perception from ubiquitous audio , 2012, MUM.

[31]  Mohamed R. Amer,et al.  Human Social Interaction Modeling Using Temporal Deep Networks , 2015, ArXiv.

[32]  Ning Wang,et al.  Agreeable People Like Agreeable Virtual Humans , 2008, IVA.

[33]  T. Chartrand,et al.  The chameleon effect: the perception-behavior link and social interaction. , 1999, Journal of personality and social psychology.

[34]  Daniel Gatica-Perez,et al.  How Do You Like Your Virtual Agent?: Human-Agent Interaction Experience through Nonverbal Features and Personality Traits , 2014, HBU.

[35]  M. Knapp,et al.  Nonverbal communication in human interaction , 1972 .

[36]  K. Scherer,et al.  Emotion expression in body action and posture. , 2012, Emotion.

[37]  P. Ekman,et al.  Relative importance of face, body, and speech in judgments of personality and affect. , 1980 .

[38]  Nicole C. Krämer,et al.  How Our Personality Shapes Our Interactions with Virtual Characters - Implications for Research and Development , 2010, IVA.

[39]  Björn W. Schuller,et al.  Building Autonomous Sensitive Artificial Listeners , 2012, IEEE Transactions on Affective Computing.

[40]  Bilge Mutlu,et al.  MACH: my automated conversation coach , 2013, UbiComp.

[41]  Daniel Gatica-Perez,et al.  FaceTube: predicting personality from facial expressions of emotion in online conversational video , 2012, ICMI '12.

[42]  Jean-Marc Odobez,et al.  Gaze Estimation in the 3D Space Using RGB-D Sensors , 2015, International Journal of Computer Vision.

[43]  J. Pennebaker,et al.  The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods , 2010 .

[44]  Dwayne D. Gremler,et al.  Customer-Employee Rapport in Service Relationships , 2000 .

[45]  D. Funder,et al.  Behavioral manifestations of personality: an ecological approach to judgmental accuracy. , 1993, Journal of personality and social psychology.

[46]  H. Wallbott Bodily expression of emotion , 1998 .

[47]  Daniel Gatica-Perez,et al.  One of a kind: inferring personality impressions in meetings , 2013, ICMI '13.

[48]  Masayuki Numao,et al.  Predicting Levels of Rapport in Dyadic Interactions through Automatic Detection of Posture and Posture Congruence , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[49]  J. Cappella On Defining Conversational Coordination and Rapport , 1990 .

[50]  Adam J. Berinsky,et al.  Evaluating Online Labor Markets for Experimental Research: Amazon.com's Mechanical Turk , 2012, Political Analysis.