Collaborative Signaling of Informational Structures by Dynamic Speech Rate

The research reported in this paper is an attempt to elucidate the functions of dynamic speech rates as contextualization cues in conversational Japanese. We examine five spontaneous task-oriented dialogs conducted in Japanese and analyze the potential of speech rate changes in signaling the structure of the information being exchanged in the dialogs. A correlation is found between speech decelerations and the openings of new information, and another is found between speech accelerations and the absence of information openings. These correlations hold not only in the case of a single speaker's speech, but also in the case of multiple speakers' sequential utterances, both with and without turn shifts. On the basis of these findings, we examine the potential of dynamic speech rates as cues to information structures in dialogs, in terms of their precision, recall, and primacy. We claim that changes in the speech rate in conversational Japanese have a definite potential for cuing the structure of information collaboratively constructed by participants of a conversation.

[1]  Gwyneth Doherty-Sneddon,et al.  The Reliability of a Dialogue Structure Coding Scheme , 1997, CL.

[2]  Candace L. Sidner,et al.  Attention, Intentions, and the Structure of Discourse , 1986, CL.

[3]  Mari Ostendorf,et al.  Prosodic and lexical indications of discourse structure in human-machine interactions , 1997, Speech Commun..

[4]  Atsushi Shimojima,et al.  Scorekeeping for Conversation-Construction , 1999 .

[5]  Stephen Isard,et al.  Segment durations in a syllable frame , 1991 .

[6]  Anne H. Anderson,et al.  The Hcrc Map Task Corpus , 1991 .

[7]  Peter Auer,et al.  The contextualization of language , 1992 .

[8]  Akira Ichikawa,et al.  The Japanese map task corpus : an interim report , 1994 .

[9]  Florien J. van Beinum,et al.  Relationship between discourse structure and dynamic speech rate , 1996, ICSLP.

[10]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[11]  Julia Hirschberg,et al.  A Prosodic Analysis of Discourse Segments in Direction-Giving Monologues , 1996, ACL.

[12]  Herbert H. Clark,et al.  Contributing to Discourse , 1989, Cogn. Sci..

[13]  E. Schegloff Discourse as an interactional achievement : Some uses of "Uh huh" and other things that come between sentences , 1982 .

[14]  H. Fujisaki,et al.  Temporal organization of segmental features in Japanese disyllables , 1980 .

[15]  Rebecca J. Passonneau,et al.  Discourse Segmentation by Human and Automated Means , 1997, CL.

[16]  John J. Gumperz,et al.  Contextualization and Understanding , 1989 .

[17]  R S Brubaker Rate and pause characteristics of oral reading , 1972, Journal of psycholinguistic research.

[18]  S. M. Marcus Acoustic determinants of perceptual center (P-center) location , 1981, Perception & psychophysics.

[19]  Rebecca J. Passonneau,et al.  Empirical Analysis of Three Dimensions of Spoken Discourse: Segmentation, Coherence, and Linguistic Devices , 1996 .

[20]  Charles Goodwin,et al.  Context, Activity and Participation , 1992 .