/nailon/ - Software for Online Analysis of Prosody

This paper presents /nailon/ – a software package for online real-time prosodic analysis that captures a number of prosodic features relevant for interaction control in spoken dialogue systems. The current implementation captures silence durations; voicing, intensity, and pitch; pseudo-syllable durations; and intonation patterns. The paper provides detailed information on how this is achieved. As an example application of /nailon/, we demonstrate how it is used to improve the efficiency of identifying relevant places at which a machine can legitimately begin to talk to a human interlocutor, as well as to shorten system response times. Index Terms: automatic extraction of prosodic features, dialogue systems, interaction control

[1]  Johan Boye,et al.  Real-time Handling of Fragmented Utterances , 2001 .

[2]  Andreas Stolcke,et al.  Direct Modeling of Prosody: An Overview of Applications in Automatic Speech Processing , 2004 .

[3]  Mattias Heldner,et al.  Exploring the prosody-syntax interface in conversations , 2003 .

[4]  Cecilia E. Ford,et al.  Interactional units in conversation: Syntactic, intonational, and pragmatic resources for the mana , 1996 .

[5]  Andreas Stolcke,et al.  Is the speaker done yet? faster and more accurate end-of-utterance detection using prosody , 2002, INTERSPEECH.

[6]  R.J.J.H. van Son,et al.  Early Preparation of Experimentally Elicited Minimal Responses , 2005, SIGDIAL.

[7]  B. Granström,et al.  NATURAL TURN-TAKING NEEDS NO MANUAL : COMPUTATIONAL THEORY AND MODEL , FROM PERCEPTION TO ACTION , 2002 .

[8]  P. Mermelstein Automatic segmentation of speech into syllabic units. , 1975, The Journal of the Acoustical Society of America.

[9]  Kenneth N. Stevens,et al.  Automatic syllable detection for vowel landmarks , 2000 .

[10]  Nigel G. Ward,et al.  Prosodic features which cue back-channel responses in English and Japanese , 2000 .

[11]  Cecilia E. Ford,et al.  Interaction and grammar: Interactional units in conversation: syntactic, intonational, and pragmatic resources for the management of turns , 1996 .

[12]  J. Local,et al.  Towards a phonology of conversation: turn-taking in Tyneside English , 1986, Journal of Linguistics.

[13]  Johanneke Caspers,et al.  Local speech melody as a limiting factor in the turn-taking system in Dutch , 2003, J. Phonetics.

[14]  Mattias Heldner,et al.  Exploring Prosody in Interaction Control , 2005, Phonetica.

[15]  S. Duncan,et al.  Some Signals and Rules for Taking Speaking Turns in Conversations , 1972 .

[16]  Jens Edlund,et al.  Robust interpretation in the Higgins spoken dialogue system , 2004 .

[17]  John Local,et al.  Projection and ‘silences’: Notes on phonetic and conversational structure , 1986 .