Evaluation of Different Segmentation Techniques for Dialogue Turns

In dialogue systems, it is necessary to decode the user input into semantically meaningful units. These semantical units, usually Dialogue Acts (DA), are used by the system to produce the most appropriate response. The user turns can be segmented into utterances, which are meaningful segments from the dialogue viewpoint. In this case, a single DA is associated to each utterance. Many previous works have used DA assignation models on segmented dialogue corpora, but only a few have tried to perform the segmentation and assignation at the same time. The knowledge of the segmentation of turns into utterances is not common in dialogue corpora, and knowing the quality of the segmentations provided by the models that simultaneously perform segmentation and assignation would be interesting. In this work, we evaluate the accuracy of the segmentation offered by this type of model. The evaluation is done on a Spanish dialogue system on a railway information task. The results reveal that one of these techniques provides a high quality segmentation for this corpus.

[1]  R. Morante Review of the book Current and New Directions in Discourse and Dialogue, J. van Kuppevelt & R.W. Smith, 2003 , 2004 .

[2]  Nigel Gilbert,et al.  Simulating speech systems , 1991 .

[3]  John J. Godfrey,et al.  SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Steve J. Young,et al.  USING POMDPS FOR DIALOG MANAGEMENT , 2006, 2006 IEEE Spoken Language Technology Workshop.

[5]  Carlos D. Martínez-Hinarejos,et al.  Automatic Annotation of Dialogues Using n-Grams , 2006, TSD.

[6]  Francisco Casacuberta,et al.  Inference of finite-state transducers from regular languages , 2005, Pattern Recognit..

[7]  Lori S. Levin,et al.  Tagging of Speech Acts and Dialogue Games in Spanish Call Home , 1999 .

[8]  Giuseppe Riccardi,et al.  How may I help you? , 1997, Speech Commun..

[9]  Shalom Lappin,et al.  Current and New Directions in Discourse and Dialogue , 2003 .

[10]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[11]  R. Granell,et al.  Acquisition and Labelling of a Spontaneous Speech Dialogue Corpus ∗ , 2005 .

[12]  Carlos D. Martínez-Hinarejos,et al.  Segmented and Unsegmented Dialogue-Act Annotation with Statistical Dialogue Models , 2006, ACL.

[13]  Camino de Vera,et al.  Segmented and unsegmented dialogue-act annotation with statistical dialogue models , 2006 .