GALATEA: A Discourse Modeller Supporting Concept-Level Error Handling in Spoken Dialogue Systems

In this chapter, a discourse modeller for conversational spoken dialogue systems, called GALATEA, is presented. Apart from handling the resolution of ellipses and anaphora, it tracks the “grounding status” of concepts that are mentioned during the discourse, i.e., information about who said what when. This grounding information also contains concept confidence scores that are derived from the speech recogniser word confidence scores. The discourse model may then be used for concept-level error handling, i.e., grounding of concepts, fragmentary clarification requests, and detection of erroneous concepts in the model at later stages in the dialogue. An evaluation of GALATEA, used in a complete spoken dialogue system with naive users, is also presented.

[1]  Jaime G. Carbonell,et al.  Discourse Pragmatics and Ellipsis Resolution in Task-Oriented Natural Language Interfaces , 1983, ACL.

[2]  David Schlangen,et al.  Causes and Strategies for Requesting Clarification in Dialogue , 2004, SIGDIAL Workshop.

[3]  Manny Rayner,et al.  Comparing grammar-based and robust approaches to speech understanding: a case study , 2001, INTERSPEECH.

[4]  Joakim Nivre,et al.  On the Semantics and Pragmatics of Linguistic Feedback , 1992, J. Semant..

[5]  Eric K. Ringger,et al.  Robust Understanding in a Dialogue System , 1996 .

[6]  Staffan Larsson,et al.  Information state and dialogue management in the TRINDI dialogue move engine toolkit , 2000, Natural Language Engineering.

[7]  Jonathan Ginzburg,et al.  On the Means for Clarification in Dialogue , 2001, SIGDIAL Workshop.

[8]  Malte Gabsdil,et al.  Clarification in Spoken Dialogue Systems , 2003 .

[9]  Staffan Larsson Interactive Communication Management in an Issue-based Dialogue System , 2005 .

[10]  Diane Horton,et al.  Repairing conversational misunderstandings and non-understandings , 1994, Speech Communication.

[11]  Emiel Krahmer,et al.  Error Detection in Spoken Human-Machine Interaction , 2001, Int. J. Speech Technol..

[12]  Jens Edlund,et al.  Robust interpretation in the Higgins spoken dialogue system , 2004 .

[13]  E. Schegloff Repair After Next Turn: The Last Structurally Provided Defense of Intersubjectivity in Conversation , 1992, American Journal of Sociology.

[14]  Michael F. McTear,et al.  Handling errors and determining confirmation strategies - An object-based approach , 2003, Speech Commun..

[15]  Jens Edlund,et al.  Early error detection on word level , 2004 .

[16]  Scott McGlashan,et al.  Units of dialogue management: an example , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[17]  Wolfgang Wahlster,et al.  A resource-adaptive mobile navigation system , 2002, IUI '02.

[18]  David R Traum,et al.  Towards a Computational Theory of Grounding in Natural Language Conversation , 1991 .

[19]  James F. Allen,et al.  An architecture for more realistic conversational systems , 2001, IUI '01.

[20]  Jens Edlund,et al.  Higgins - a spoken dialogue system for investigating error handling techniques , 2004, INTERSPEECH.

[21]  Staffan Larsson,et al.  Issue-based Dialogue Management , 2002 .

[22]  Verena Rieser Confidence-Based Fragmentary Clarifications on Several Levels for Robust Dialogue Systems , 2004 .

[23]  Gabriel Skantze,et al.  Exploring human error recovery strategies: Implications for spoken dialogue systems , 2005, Speech Communication.

[24]  David Schlangen,et al.  The interpretation of non-sentential utterances in dialogue , 2003, SIGDIAL Workshop.

[25]  Jenny Brusk,et al.  Computer Assisted Conversation Training for Second Language Learners , 2007 .

[26]  Pekka Kilpeläinen,et al.  Tree Matching Problems with Applications to Structured Text Databases , 2022 .

[27]  Jens Edlund,et al.  Applications of distributed dialogue systems : the KTH Connector , 2005 .

[28]  A.I. Rudnicky,et al.  Constructing accurate beliefs in spoken dialog systems , 2005, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005..

[29]  Johan Bos,et al.  Combining Acoustic Confidence Scores with Deep Semantic Analysis for Clarification Dialogues , 2003 .

[30]  Thierry Dutoit,et al.  The MBROLA project: towards a set of high quality speech synthesizers free of use for non commercial purposes , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[31]  Jens Edlund,et al.  The effects of prosodic features on the interpretation of clarification ellipses , 2005, INTERSPEECH.

[32]  Gunnar Evermann,et al.  Large vocabulary decoding and confidence estimation using word posterior probabilities , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[33]  Anne H. Anderson,et al.  The Hcrc Map Task Corpus , 1991 .

[34]  Oliver Lemon,et al.  Combining Acoustic and Pragmatic Features to Predict Recognition Performance in Spoken Dialogue Systems , 2004, ACL.