Turn-Taking Cues in a Human Tutoring Corpus

Most spoken dialogue systems are still lacking in their ability to accurately model the complex process that is human turntaking. This research analyzes a human-human tutoring corpus in order to identify prosodic turn-taking cues, with the hopes that they can be used by intelligent tutoring systems to predict student turn boundaries. Results show that while there was variation between subjects, three features were significant turn-yielding cues overall. In addition, a positive relationship between the number of cues present and the probability of a turn yield was demonstrated.

[1]  Diane J. Litman,et al.  ITSPOKE: An Intelligent Tutoring Spoken Dialogue System , 2004, NAACL.

[2]  Julia Hirschberg,et al.  Detecting certainness in spoken tutorial dialogues , 2005, INTERSPEECH.

[3]  Maxine Eskénazi,et al.  A Finite-State Turn-Taking Model for Spoken Dialog Systems , 2009, NAACL.

[4]  Diane J. Litman,et al.  Classifying turn-level uncertainty using word-level prosody , 2009, INTERSPEECH.

[5]  Diane J. Litman,et al.  Benefits and challenges of real-time uncertainty detection and adaptation in a spoken dialogue computer tutor , 2011, Speech Commun..

[6]  Caroline Clemens,et al.  Prosodic Turn-Yielding Cues With and Without Optical Feedback , 2009, SIGDIAL Conference.

[7]  A. Cutler,et al.  On The Analysis of Prosodic Turn-Taking Cues , 2018, Intonation in Discourse.

[8]  Julia Hirschberg,et al.  Turn-Yielding Cues in Task-Oriented Dialogue , 2009, SIGDIAL Conference.

[9]  Carolyn Penstein Rosé,et al.  Spoken Versus Typed Human and Computer Dialogue Tutoring , 2006, Int. J. Artif. Intell. Educ..

[10]  Gareth M. James,et al.  Challenges For Spoken Dialogue Systems , 1999 .

[11]  Jason D. Williams Spoken dialogue systems: Challenges, and opportunities for research , 2009, ASRU.

[12]  Peter A. Heeman,et al.  Importance-Driven Turn-Bidding for Spoken Dialogue Systems , 2010, ACL.

[13]  Björn Schuller,et al.  Opensmile: the munich versatile and fast open-source audio feature extractor , 2010, ACM Multimedia.

[14]  S. Duncan,et al.  Some Signals and Rules for Taking Speaking Turns in Conversations , 1972 .

[15]  David G. Novick,et al.  Root causes of lost time and user stress in a simple dialog system , 2005, INTERSPEECH.