Machine Learning for Shallow Interpretation of User Utterances in Spoken Dialogue Systems

We investigate to what extent automatic learning techniques can be used for shallow interpretation of user utterances in spoken dialogue systems. This task involves dialogue act classification, shallow understanding and problem detection simultaneously. For this purpose we train both a rule-induction and a memory-based learning algorithm on a large set of surface features obtained by affordable means from an annotated corpus of human-machine dialogues. Using a pseudo-exhaustive search, the parameters of both algorithms are optimized. The shallow interpretation task turns out to be a difficult one, partly since there are 94 types of user answers. The best overall accuracy (exact match) obtained was 73.5%, which is a significant improvement over the baseline. The best average precision and recall for dialogue act classification was 91.2%, for classifying slot types 86.8% and for detecting communication problems 91.0%.

[1]  Elisabeth Maier,et al.  Dialogue Processing in Spoken Language Systems , 1996, Lecture Notes in Computer Science.

[2]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[3]  Ken Samuel,et al.  Dialogue Act Tagging with Transformation-Based Learning , 1998, ACL.

[4]  Emiel Krahmer,et al.  The dual of denial: Two uses of disconfirmations in dialogue and their prosodic correlates , 2002, Speech Commun..

[5]  Mikio Nakano,et al.  Understanding Unsegmented User Utterances in Real-Time Spoken Dialogue Systems , 1999, ACL.

[6]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[7]  Khalil Sima'an,et al.  Evaluation of the NLP Components of the OVIS2 Spoken Dialogue System , 1999, ArXiv.

[8]  Jungyun Seo,et al.  Analysis System of Speech Acts and Discourse Structures Using Maximum Entropy Model , 1999, ACL.

[9]  Julia Hirschberg,et al.  Identifying User Corrections Automatically in Spoken Dialogue Systems , 2001, NAACL.

[10]  Ken Samuel,et al.  Computing Dialogue Acts from Features with Transformation-Based Learning , 1998, ArXiv.

[11]  Emiel Krahmer,et al.  Improving machine-learned detection of miscommunications in human-machine dialogues through informed data splitting , 2002 .

[12]  Emiel Krahmer,et al.  Multi-feature Error Detection in Spoken Dialogue Systems , 2001, CLIN.

[13]  Emiel Krahmer,et al.  Detecting Problematic Turns in Human-Machine Interactions: Rule-induction Versus Memory-based Learning Approaches , 2001, ACL.

[14]  Walter Daelemans,et al.  TiMBL: Tilburg Memory-Based Learner, version 2.0, Reference guide , 1998 .

[15]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[16]  Mark-Jan Nederhof,et al.  Robust grammatical analysis for spoken dialogue systems , 1999, Natural Language Engineering.

[17]  Anton Nijholt,et al.  Dialogue Act Recognition with Bayesian Networks for Dutch Dialogues , 2002, SIGDIAL Workshop.

[18]  Norbert Reithinger,et al.  Utilizing Statistical Dialogue Act Processing in Verbrnobil , 1995, ACL.