Automatic Dialog Acts Recognition Based on Sentence Structure

This paper deals with automatic dialog acts (DAs) recognition in Czech. Our work focuses on two applications: a multimodal reservation system and an animated talking head for hearing-impaired people. In that context, we consider the following DAs: statements, orders, investigation questions and other questions. The main goal of this paper is to propose, implement and evaluate new approaches to automatic DAs recognition based on sentence structure and prosody. Our system is tested on a Czech corpus that simulates a task of train tickets reservation. With lexical-only information, the classification accuracy is 91%. We proposed two methods to include sentence structure information, which respectively give 94% and 95%. When prosodic information is further considered, the recognition accuracy reaches 96%

[1]  E. Maier,et al.  Dialogue Acts in VERBMOBIL , 1995 .

[2]  A. van den Bosch,et al.  Finding Classes of Dialogue Utterances with Kohonen Networks , 1997 .

[3]  Norbert Reithinger,et al.  Utilizing Statistical Dialogue Act Processing in Verbrnobil , 1995, ACL.

[4]  Jeff A. Bilmes,et al.  Factored Language Models and Generalized Parallel Backoff , 2003, NAACL.

[5]  Pavel Král,et al.  Combination of classifiers for automatic recognition of dialog acts , 2005, INTERSPEECH.

[6]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[7]  Lori Lamel,et al.  Automatic detection of dialog acts based on multilevel information , 2004, INTERSPEECH.

[8]  Andreas Stolcke,et al.  Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech? , 1998, Language and speech.

[9]  JurafskyDaniel,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000 .

[10]  Elmar Nöth,et al.  Automatic classification of dialog acts with semantic classification trees and polygrams , 1995, Learning for Natural Language Processing.

[11]  Andrei Popescu-Belis,et al.  Multi-level Dialogue Act Tags , 2004, SIGDIAL Workshop.

[12]  Volker Strom,et al.  Detection of accents, phrase boundaries and sentence modality in German with prosodic features , 1995, EUROSPEECH.

[13]  Jeff A. Bilmes,et al.  Dialog act tagging using graphical models , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[14]  A. Stolcke,et al.  Automatic detection of discourse structure for speech recognition and understanding , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[15]  Anton Nijholt,et al.  Dialogue Act Recognition with Bayesian Networks for Dutch Dialogues , 2002, SIGDIAL Workshop.

[16]  James F. Allen,et al.  Draft of DAMSL Dialog Act Markup in Several Layers , 2007 .