Prosodic cues for interaction control in spoken dialogue systems

This paper discusses the feasibility of using prosodic features for interaction control in spoken dialogue systems, and points to experimental evidence that automatically extracted prosodic features can be used to improve the efficiency of identifying relevant places at which a machine can legitimately begin to talk to a human interlocutor, as well as to shorten system response times.

[1]  Andreas Stolcke,et al.  Is the speaker done yet? faster and more accurate end-of-utterance detection using prosody , 2002, INTERSPEECH.

[2]  Johan Boye,et al.  Real-time Handling of Fragmented Utterances , 2001 .

[3]  S. Duncan,et al.  Some Signals and Rules for Taking Speaking Turns in Conversations , 1972 .

[4]  Jens Edlund,et al.  Robust interpretation in the Higgins spoken dialogue system , 2004 .

[5]  Nigel G. Ward,et al.  Prosodic features which cue back-channel responses in English and Japanese , 2000 .

[6]  John Local,et al.  Projection and ‘silences’: Notes on phonetic and conversational structure , 1986 .

[7]  Eddy B. Brixen Near-Field Registration of the Human Voice: Spectral Changes Due to Positions , 1998 .

[8]  Mattias Heldner,et al.  Exploring the prosody-syntax interface in conversations , 2003 .

[9]  R.J.J.H. van Son,et al.  Early Preparation of Experimentally Elicited Minimal Responses , 2005, SIGDIAL.

[10]  J. Local,et al.  Towards a phonology of conversation: turn-taking in Tyneside English , 1986, Journal of Linguistics.

[11]  Mattias Heldner,et al.  vertical bar nailon vertical bar : Software for Online Analysis of Prosody , 2006 .

[12]  Cecilia E. Ford,et al.  Interaction and grammar: Interactional units in conversation: syntactic, intonational, and pragmatic resources for the management of turns , 1996 .

[13]  Mattias Heldner,et al.  Exploring Prosody in Interaction Control , 2005, Phonetica.

[14]  Cecilia E. Ford,et al.  Interactional units in conversation: Syntactic, intonational, and pragmatic resources for the mana , 1996 .

[15]  B. Granström,et al.  NATURAL TURN-TAKING NEEDS NO MANUAL : COMPUTATIONAL THEORY AND MODEL , FROM PERCEPTION TO ACTION , 2002 .

[16]  Nigel G. Ward Methods for Discovering Prosodic Cues to Turn-Taking , 2005 .

[17]  Andreas Stolcke,et al.  Direct Modeling of Prosody: An Overview of Applications in Automatic Speech Processing , 2004 .

[18]  Johanneke Caspers,et al.  Local speech melody as a limiting factor in the turn-taking system in Dutch , 2003, J. Phonetics.

[19]  Sandra A. Thompson,et al.  Interaction and grammar: Frontmatter , 1996 .