Detection of Listener Uncertainty in Robot-Led Second Language Conversation Practice

Uncertainty is a frequently occurring affective state that learners experience during the acquisition of a second language. This state can constitute both a learning opportunity and a source of learner frustration. An appropriate detection could therefore benefit the learning process by reducing cognitive instability. In this study, we use a dyadic practice conversation between an adult second-language learner and a social robot to elicit events of uncertainty through the manipulation of the robot's spoken utterances (increased lexical complexity or prosody modifications). The characteristics of these events are then used to analyze multi-party practice conversations between a robot and two learners. Classification models are trained with multimodal features from annotated events of listener (un)certainty. We report the performance of our models on different settings, (sub)turn segments and multimodal inputs.

[1]  Gabriel Skantze Error Handling in Spoken Dialogue Systems : Managing Uncertainty, Grounding and Miscommunication , 2007 .

[2]  Louis-Philippe Morency,et al.  OpenFace 2.0: Facial Behavior Analysis Toolkit , 2018, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[3]  Sanne H. G. van der Ven,et al.  Social Robots for Language Learning: A Review , 2018, Review of Educational Research.

[4]  Stuart M. Shieber,et al.  Recognizing Uncertainty in Speech , 2011, EURASIP J. Adv. Signal Process..

[5]  Barbara Schuppler,et al.  Automatic detection of uncertainty in spontaneous German dialogue , 2015, INTERSPEECH.

[6]  Ryan Shaun Joazeiro de Baker,et al.  Sequences of Frustration and Confusion, and Learning , 2013, EDM.

[7]  Olov Engwall,et al.  Uncertainty in Robot Assisted Second Language Conversation Practice , 2020, HRI.

[8]  E. Schegloff,et al.  The preference for self-correction in the organization of repair in conversation , 1977 .

[9]  A. Graesser,et al.  Confusion can be beneficial for learning. , 2014 .

[10]  Gabriel Skantze,et al.  Furhat: A Back-Projected Human-Like Robot Head for Multiparty Human-Machine Interaction , 2011, COST 2102 Training School.

[11]  Colin Raffel,et al.  librosa: Audio and Music Signal Analysis in Python , 2015, SciPy.

[12]  Dimosthenis Kontogiorgos,et al.  Estimating Uncertainty in Task-Oriented Dialogue , 2019, ICMI.

[13]  Olov Engwall,et al.  Robot Interaction Styles for Conversation Practice in Second Language Learning , 2021, Int. J. Soc. Robotics.

[14]  Xiaofei Lu The Relationship of Lexical Richness to the Quality of ESL Learners' Oral Narratives. , 2012 .

[15]  Samuel B. Williams,et al.  ASSOCIATION FOR COMPUTING MACHINERY , 2000 .

[16]  Diane Horton,et al.  Repairing conversational misunderstandings and non-understandings , 1994, Speech Communication.

[17]  Sidney K. D'Mello,et al.  It's Written on Your Face: Detecting Affective States from Facial Expressions while Learning Computer Programming , 2014, Intelligent Tutoring Systems.

[18]  M. Swerts,et al.  Audiovisual prosody and feeling of knowing , 2005 .

[19]  Catherine J. Stevens,et al.  A review of the applicability of robots in education , 2013 .

[20]  Lori Lockyer,et al.  Eye tracking and early detection of confusion in digital learning environments: proof of concept , 2016 .

[21]  Carolyn Penstein Rosé,et al.  Identification of confusion and surprise in spoken dialog using prosodic features , 2006, INTERSPEECH.

[22]  Arthur C. Graesser,et al.  Mind and Body: Dialogue and Posture for Affect Detection in Learning Environments , 2007, AIED.

[23]  Dirk Heylen,et al.  Detecting Uncertainty in Spoken Dialogues An explorative research to the automatic detection of a speakers' uncertainty by using prosodic markers , 2011 .

[24]  J. Hart,et al.  Memory and the feeling-of-knowing experience. , 1965, Journal of educational psychology.

[25]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..