论文信息 - Comparative Error Analysis of Dialog State Tracking

Comparative Error Analysis of Dialog State Tracking

A primary motivation of the Dialog State Tracking Challenge (DSTC) is to allow for direct comparisons between alternative approaches to dialog state tracking. While results from DSTC 1 mention performance limitations, an examination of the errors made by dialog state trackers was not discussed in depth. For the new challenge, DSTC 2, this paper describes several techniques for examining the errors made by the dialog state trackers in order to refine our understanding of the limitations of various approaches to the tracking process. The results indicate that no one approach is universally superior, and that different approaches yield different error type distributions. Furthermore, the results show that a pairwise comparative analysis of tracker performance is a useful tool for identifying dialogs where differential behavior is observed. These dialogs can provide a data source for a more careful analysis of the source of errors.

Ronnie W. Smith

[1] Antoine Raux,et al. The Dialog State Tracking Challenge , 2013, SIGDIAL Conference.

[2] Robert D. Rodman,et al. The effects of restricted vocabulary size on voice interactive discourse structure , 1988 .

[3] Yi Ma,et al. Efficient Probabilistic Tracking of User Goal and Dialog History for Spoken Dialog Systems , 2011, INTERSPEECH.

[4] Siobhan Chapman. Logic and Conversation , 2005 .

[5] Alan W. Biermann,et al. An Architecture for Voice Dialog Systems Based on Prolog-Style Theorem Proving , 1995, CL.

[6] Ronnie W. Smith,et al. An evaluation of strategies for selectively verifying utterance meanings in spoken natural language dialog , 1998, Int. J. Hum. Comput. Stud..

[7] Ronnie W. Smith,et al. Effects of Variable Initiative on Linguistic Behavior in Human-Computer Spoken Natural Language Dialogue , 1997, Comput. Linguistics.

[8] Matthew Henderson,et al. The Second Dialog State Tracking Challenge , 2014, SIGDIAL Conference.