Computational models of social and emotional turn-taking for embodied conversational agents: a review

The emotional involvement of participants in a conversation not only shows in the words they speak and in the way they speak and gesture but also in their turn-taking behavior. This paper reviews research into computational models of embodied conversational agents. We focus on models for turn-taking management and (social) emotions. We are particularly interested in how in these models emotions of the agent itself and those of the others in uence the agent's turn-taking behavior and vice versa how turn-taking behavior of the partner is perceived by the agent itself. The system of turn-taking rules presented by Sacks, Schegloff and Jefferson (1974) is often a starting point for computational turn-taking models of conversational agents. But emotions have their own rules besides the "one-at-a-time" paradigm of the SSJ system. It turns out that almost without exception computational models of turn-taking behavior that allow "continuous interaction" and "natural turntaking" do not model the underlying psychological, affective, attentional and cognitive processes. They are restricted to rules in terms of a number of supercially observable cues. On the other hand computational models for virtual humans that are based on a functional theory of social emotion do not contain explicit rules on how social emotions affect turn-taking behavior or how the emotional state of the agent is affected by turn-taking behavior of its interlocutors. We conclude with some preliminary ideas on what an architecture for emotional turn-taking should look like and we discuss the challenges in building believable emotional turn-taking agents.

[1]  David Harel,et al.  Statecharts: A Visual Formalism for Complex Systems , 1987, Sci. Comput. Program..

[2]  Andrew Ortony,et al.  The Cognitive Structure of Emotions , 1988 .

[3]  Elisabetta Bevacqua,et al.  Engagement Capabilities for ECAs , 2005 .

[4]  Anne Berry,et al.  Spanish and American Turn-Taking Styles: A Comparative Study. , 1994 .

[5]  Matthias Scheutz,et al.  The Architectural Basis of Affective States and Processes , 2005, Who Needs Emotions?.

[6]  David R. Traum,et al.  Fight, Flight, or Negotiate: Believable Strategies for Conversing Under Crisis , 2005, IVA.

[7]  Dirk Heylen,et al.  ON THE NATURE OF ENGINEERING SOCIAL ARTIFICIAL COMPANIONS , 2011, Appl. Artif. Intell..

[8]  Dirk Heylen,et al.  Turn Management or Impression Management? , 2009, IVA.

[9]  H. Simon,et al.  Motivational and emotional controls of cognition. , 1967, Psychological review.

[10]  Jens Allwood,et al.  An activity-based approach to pragmatics , 2000, Abduction, Belief and Context in Dialogue.

[11]  D. O’connell,et al.  Turn-taking: A critical analysis of the research tradition , 1990 .

[12]  Liisa Vilkki Politeness, face and facework: Current issues , 2006 .

[13]  Dirk Heylen,et al.  Flipper: An Information State Component for Spoken Dialogue Systems , 2011, IVA.

[14]  Michael A. Arbib,et al.  Who Needs Emotions? - The brain meets the robot , 2004, Who Needs Emotions?.

[15]  Harry Bunt,et al.  Abduction, Belief and Context in Dialogue , 2000, Natural Language Processing.

[16]  Gene H. Lerner Selecting next speaker: The context-sensitive operation of a context-free organization , 2003, Language in Society.

[17]  Rosalind W. Picard What does it mean for a computer to “ have ” emotions ? , 2001 .

[18]  H. H. Clark,et al.  Speaking while monitoring addressees for understanding , 2004 .

[19]  S. Duncan,et al.  Some Signals and Rules for Taking Speaking Turns in Conversations , 1972 .

[20]  Dirk Heylen,et al.  An Action Selection Architecture for an Emotional Agent , 2003, FLAIRS.

[21]  Y. Wilks,et al.  Book Review: Close Engagements with Artificial Companions: Key Social, Psychological, Ethical, and Design Issues edited by Yorick Wilks , 2010, CL.

[22]  C. Castelfranchi,et al.  From Automaticity to Autonomy: The Frontier of Artificial Agents , 2003 .

[23]  Fredrik Kronlid Steps towards Multi-Party Dialogue Management , 2008 .

[24]  Catherine Pelachaud,et al.  Emotion-Oriented Systems , 2011 .

[25]  Eric Horvitz,et al.  Computational Models for Multiparty Turn-Taking , 2010 .

[26]  C. Pelachaud,et al.  GRETA. A BELIEVABLE EMBODIED CONVERSATIONAL AGENT , 2005 .

[27]  David R. Traum,et al.  Embodied agents for multi-party dialogue in immersive virtual worlds , 2002, AAMAS '02.

[28]  付伶俐 打磨Using Language,倡导新理念 , 2014 .

[29]  Carole Edelsky Who's got the floor? , 1981, Language in Society.

[30]  Julian Togelius,et al.  Siren: Towards adaptive serious games for teaching conflict resolution , 2010 .

[31]  Siobhan Chapman Logic and Conversation , 2005 .

[32]  Kristinn R. Thórisson,et al.  The Power of a Nod and a Glance: Envelope Vs. Emotional Feedback in Animated Conversational Agents , 1999, Appl. Artif. Intell..

[33]  A. Sloman Beyond Shallow Models of Emotion , 2001 .

[34]  Stacy Marsella,et al.  EMA: A process model of appraisal dynamics , 2009, Cognitive Systems Research.

[35]  Fredrik Kronlid,et al.  Implementing the Information-State Update Approach to Dialogue Management in a Slightly Extended SCXML , 2007 .

[36]  David R. Traum,et al.  Multi-party, Multi-issue, Multi-strategy Negotiation for Multi-modal Virtual Agents , 2008, IVA.

[37]  Stacy Marsella,et al.  Towards More Comprehensive Listening Behavior: Beyond the Bobble Head , 2011, IVA.

[38]  P. Kay,et al.  Universals and cultural variation in turn-taking in conversation , 2009, Proceedings of the National Academy of Sciences.

[39]  G. Beattie Interruption in conversational interaction, and its relation to the sex and status of the interactants* , 1981 .

[40]  J. Stainer,et al.  The Emotions , 1922, Nature.

[41]  Joseph E LeDoux The Emotional Brain: The Mysterious Underpinnings of Emotional Life , 1996 .

[42]  David G. Novick,et al.  Coordinating turn-taking with gaze , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[43]  Patrick Olivier,et al.  Exploring Persuasive Potential of Embodied Conversational Agents Utilizing Synthetic Embodied Conversational Agents , 2007, PERSUASIVE.

[44]  Kristinn R. Thórisson,et al.  A Multiparty Multimodal Architecture for Realtime Turntaking , 2010, IVA.

[45]  B. Parkinson,et al.  Emotions in social interactions: Unfolding emotional experience , 2011 .

[46]  Renaud Blanch Facilitating post-WIMP Interaction Programming using the Hierarchical State Machine Toolkit , 2005 .

[47]  S. Paradiso The Emotional Brain: The Mysterious Underpinnings of Emotional Life , 1998 .

[48]  Stefan Kopp,et al.  Why emotions should be integrated into conversational agents , 2007 .

[49]  Mitsuru Ishizuka,et al.  SCREAM: scripting emotion-based agent minds , 2002, AAMAS '02.

[50]  Thomas Rist,et al.  Adding the Emotional Dimension to Scripting Character Dialogues , 2003, IVA.

[51]  Timothy W. Bickmore,et al.  Establishing and maintaining long-term human-computer relationships , 2005, TCHI.

[52]  Mitsuru Ishizuka,et al.  Social role awareness in animated agents , 2001, AGENTS '01.

[53]  Maxine Eskénazi,et al.  A multi-layer architecture for semi-synchronous event-driven dialogue management , 2007, 2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU).

[54]  Fredrik Kronlid,et al.  Turn Taking for Artificial Conversational Agents , 2006, CIA.

[55]  Eric Horvitz,et al.  Multiparty Turn Taking in Situated Dialog: Study, Lessons, and Directions , 2011, SIGDIAL Conference.

[56]  Anne Cutler,et al.  Why is Mrs Thatcher interrupted so often? , 1982, Nature.

[57]  V. Yngve On getting a word in edgewise , 1970 .

[58]  Mark ter Maat,et al.  Response Selection and Turn-taking for a Sensitive Artificial Listening Agent , 2011 .

[59]  Dirk Heylen,et al.  Emotional Characters for Automatic Plot Creation , 2004, TIDSE.

[60]  Dennis Reidsma,et al.  Continuous interaction with a virtual human , 2011, Journal on Multimodal User Interfaces.

[61]  P. Lang Behavioral treatment and bio-behavioral assessment: computer applications , 1980 .

[62]  R. Hayashi,et al.  Floor structure of English and Japanese conversation , 1991 .

[63]  J. Oberlander,et al.  Abduction, Belief and Context in Dialogue , 2000 .

[64]  Jonathan Klein,et al.  Computers that recognise and respond to user emotion: theoretical and practical implications , 2002, Interact. Comput..

[65]  Harry T. Reis,et al.  The effects of interruption, gender, and status on interpersonal perceptions , 1989 .

[66]  C. Pelachaud,et al.  Generating Listening Behaviour , 2011 .

[67]  Oliver Lemon,et al.  DIPPER: Description and Formalisation of an Information-State Update Dialogue System Architecture , 2003, SIGDIAL Workshop.

[68]  E. Schegloff Overlapping talk and the organization of turn-taking for conversation , 2000, Language in Society.

[69]  J. Goldberg Interrupting the discourse on interruptions , 1990 .

[70]  E. Schegloff Sequencing in Conversational Openings , 1968 .

[71]  E. Goffman Interaction Ritual: Essays on Face-To-Face Behavior , 1967 .

[72]  J. Eccles The emotional brain. , 1980, Bulletin et memoires de l'Academie royale de medecine de Belgique.

[73]  C. Pelachaud,et al.  Emotion-Oriented Systems: The Humaine Handbook , 2011 .

[74]  Rieks op den Akker,et al.  The organisation of floor in meetings and the relation with speaker addressee patterns , 2010, SSPW '10.

[75]  Louis ten Bosch,et al.  On temporal aspects of turn taking in conversational dialogues , 2005, Speech Commun..

[76]  Thomas Rist,et al.  CrossTalk: An Interactive Installation with Animated Presentation Agents , 2002 .

[77]  Rieks op den Akker,et al.  Natural interaction with a virtual guide in a virtual environment , 2010, Journal on Multimodal User Interfaces.

[78]  Joseph Bates,et al.  The role of emotion in believable agents , 1994, CACM.

[79]  Kaius Sinnemäki,et al.  A man of measure : Festschrift in Honour of Fred Karlsson on His 60th Birthday , 2006 .

[80]  S. Cowley Of Timing, Turn-Taking, and Conversations , 1998 .

[81]  E. Schegloff,et al.  A simplest systematics for the organization of turn-taking for conversation , 1974 .

[82]  Maxine Eskénazi,et al.  A Finite-State Turn-Taking Model for Spoken Dialog Systems , 2009, NAACL.

[83]  Dirk Heylen,et al.  Feedback Loops in Communication and Human Computing , 2007, Artifical Intelligence for Human Computing.

[84]  Erik Hollnagel Is affective computing an oxymoron? , 2003, Int. J. Hum. Comput. Stud..

[85]  Frank Dignum,et al.  A Generic Architecture for a Companion Robot , 2018, ICINCO-RA.

[86]  Penelope Brown,et al.  Politeness: Some Universals in Language Usage , 1989 .

[87]  B. Granström,et al.  NATURAL TURN-TAKING NEEDS NO MANUAL : COMPUTATIONAL THEORY AND MODEL , FROM PERCEPTION TO ACTION , 2002 .

[88]  M. Bradley,et al.  Measuring emotion: the Self-Assessment Manikin and the Semantic Differential. , 1994, Journal of behavior therapy and experimental psychiatry.