Evaluation of HMM-based Models for the Annotation of Unsegmented Dialogue Turns

Corpus-based dialogue systems rely on statistical models, whose parameters are inferred from annotated dialogues. The dialogues are usually annotated in terms of Dialogue Acts (DA), and the manual annotation is difficult (as annotation rule are hard to define), error-prone and time-consuming. Therefore, several semi-automatic annotation processes have been proposed to speed-up the process and consequently obtain a dialogue system in less total time. These processes are usually based on statistical models. The standard statistical annotation model is based on Hidden Markov Models (HMM). In this work, we explore the impact of different types of HMM, with different number of states, on annotation accuracy. We performed experiments using these models on two dialogue corpora (Dihana and SwitchBoard) of dissimilar features. The results show that some types of models improve standard HMM in a human-computer task-oriented dialogue corpus (Dihana corpus), but their impact is lower in a human-human non-task-oriented dialogue corpus (SwitchBoard corpus).

[1]  Mark G. Core,et al.  Coding Dialogs with the DAMSL Annotation Scheme , 1997 .

[2]  Detlef Koll,et al.  Probabilistic dialogue act extraction for concept based multilingual translation systems , 1998, ICSLP.

[3]  Elizabeth Shriberg,et al.  Switchboard SWBD-DAMSL shallow-discourse-function annotation coders manual , 1997 .

[4]  Carlos D. Martínez-Hinarejos,et al.  Statistical framework for a Spanish spoken dialogue corpus , 2008, Speech Commun..

[5]  Nigel Gilbert,et al.  Simulating speech systems , 1991 .

[6]  R. Granell,et al.  Acquisition and Labelling of a Spontaneous Speech Dialogue Corpus ∗ , 2005 .

[7]  Laila Dybkjær,et al.  Recent trends in discourse and dialogue , 2008 .

[8]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[9]  Carlos D. Martínez-Hinarejos,et al.  Improving Unsegmented Dialogue Turns Annotation with N-gram Transducers , 2009, PACLIC.

[10]  Carlos D. Martínez-Hinarejos,et al.  Improving Unsegmented Statistical Dialogue Act Labelling , 2009, RANLP.

[11]  John J. Godfrey,et al.  SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Elizabeth Shriberg,et al.  Automatic dialog act segmentation and classification in multiparty meetings , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[13]  Min Wu,et al.  Dialogue Management for an Automated Multilingual Call Center , 2003, HLT-NAACL 2003.

[14]  Carlos D. Martínez-Hinarejos,et al.  A Study of a Segmentation Technique for Dialogue Act Assignation (short paper) , 2009, IWCS.

[15]  Yorick Wilks,et al.  Error Analysis of Dialogue Act Classification , 2005, TSD.

[16]  Giuseppe Riccardi,et al.  How may I help you? , 1997, Speech Commun..

[17]  Harry Bunt,et al.  Context and Dialogue Control , 1994 .

[18]  Stephen Young Probabilistic methods in spoken–dialogue systems , 2000, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.