Navigating the Combinatorics of Virtual Agent Design Space to Maximize Persuasion

Designers of virtual agents have a combinatorically large space of choices for different media that comprise the look and behavior of their characters. We explore the systematic manipulation of animation quality, speech quality, and rendering style, and its impact on the perceptions of virtual agents in terms of naturalness, engagement, trust, credibility, and persuasion within a health counseling domain. The agent’s counseling behavior was based on live video footage of a human counselor. We conducted a between-subjects study that had 12 conditions. Character animation was varied between a static image, procedural animation using a gestuary, and manually rotoscoped animation. Voice quality was varied between recorded audio of the human counselor and synthetic speech. Character rendering style was varied between 3D-shaded realistic and toon-shaded. Prior studies indicate that people prefer and attribute more sociality to other people and agents when modalities are consistent in their level of quality. Thus, we hypothesize that people will be most affected by agents whose animation, voice, and rendering style are consistent, rather than the effects of channel quality being purely additive. Results indicate that natural animations and recorded voice are more appropriate for general acceptance, trust, and credibility of the agent, and how appropriate she seems for the task. However, our results indicate that for a brief health counseling task, animation might actually be distracting from the persuasive message, with the highest levels of persuasion found when the amount of agent animation is minimized.

[1]  Lawrence W. Stark,et al.  The Effects of Pictorial Realism, Delay of Visual Feedback, and Observer Interactivity on the Subjective Sense of Presence , 1996, Presence: Teleoperators & Virtual Environments.

[2]  Tatsuya Kawahara,et al.  Recent Development of Open-Source Speech Recognition Engine Julius , 2009 .

[3]  Larry F. Hodges,et al.  Can a virtual cat persuade you?: the role of gender and realism in speaker persuasiveness , 2006, CHI.

[4]  Sabarish V. Babu,et al.  Empirical Evaluation of Virtual Human Conversational and Affective Animations on Visual Attention in Inter-Personal Simulations , 2018, 2018 IEEE Conference on Virtual Reality and 3D User Interfaces (VR).

[5]  Timothy W. Bickmore,et al.  The Right Agent for the Job? - The Effects of Agent Visual Appearance on Task Domain , 2014, IVA.

[6]  Kristinn R. Thórisson,et al.  The Power of a Nod and a Glance: Envelope Vs. Emotional Feedback in Animated Conversational Agents , 1999, Appl. Artif. Intell..

[7]  Lawrence R. Wheeless,et al.  THE MEASUREMENT OF TRUST AND ITS RELATIONSHIP TO SELF‐DISCLOSURE , 1977 .

[8]  Mark Grimshaw,et al.  Uncanny behaviour in survival horror games , 2010 .

[9]  Daniel Hirst,et al.  SPeech Phonetization Alignment and Syllabification (SPPAS): a tool for the automatic analysis of speech prosody , 2012 .

[10]  B. J. Fogg,et al.  Web credibility research: a method for online experiments and early study results , 2001, CHI Extended Abstracts.

[11]  Karl F. MacDorman,et al.  The Uncanny Valley [From the Field] , 2012, IEEE Robotics Autom. Mag..

[12]  Justine Cassell,et al.  BEAT: the Behavior Expression Animation Toolkit , 2001, Life-like characters.

[13]  Jeremy N. Bailenson,et al.  A meta-analysis of the impact of the inclusion and realism of human-like faces on user experiences in interfaces , 2007, CHI.

[14]  Jeffrey W. Bertrand,et al.  Effects of Virtual Human Animation on Emotion Contagion in Simulated Inter-Personal Experiences , 2014, IEEE Transactions on Visualization and Computer Graphics.

[15]  Hung-Hsuan Huang,et al.  Embodied Conversational Agents , 2009 .

[16]  Timothy W. Bickmore,et al.  Establishing and maintaining long-term human-computer relationships , 2005, TCHI.

[17]  Rachel McDonnell,et al.  Does render style affect perception of personality in virtual humans? , 2014, SAP.

[18]  T. Bickmore,et al.  A Randomized Controlled Trial of an Automated Exercise Coach for Older Adults , 2013, Journal of the American Geriatrics Society.

[19]  Sabarish V. Babu,et al.  Effects of Virtual Human Appearance Fidelity on Emotion Contagion in Affective Inter-Personal Simulations , 2016, IEEE Transactions on Visualization and Computer Graphics.

[20]  Steven E. Stern,et al.  The Persuasiveness of Synthetic Speech versus Human Speech , 1999, Hum. Factors.

[21]  Everlyne Kimani,et al.  A Smartphone-Based Virtual Agent for Atrial Fibrillation Education and Counseling , 2016, IVA.

[22]  L. Shams,et al.  Crossmodal influences on visual perception. , 2010, Physics of life reviews.

[23]  Mel Slater,et al.  Building Characters: Lessons Drawn from Virtual Environments , 2005 .

[24]  J. J. Higgins,et al.  The aligned rank transform for nonparametric factorial analyses using only anova procedures , 2011, CHI.

[25]  Heinrich H. Bülthoff,et al.  Render me real? , 2012, ACM Trans. Graph..

[26]  Matthias Scheutz,et al.  A mismatch in the human realism of face and voice produces an uncanny valley , 2011, i-Perception.

[27]  Heloir,et al.  The Uncanny Valley , 2019, The Animation Studies Reader.

[28]  Li Gong,et al.  Shall we mix synthetic speech and human speech?: impact on users' performance, perception, and attitude , 2001, CHI.

[29]  Mark G. Core,et al.  Learning intercultural communication skills with virtual humans: Feedback and fidelity. , 2013 .