A model for incremental grounding in spoken dialogue systems

We present a computational model of incremental grounding, including state updates and action selection. The model is inspired by corpus-based examples of overlapping utterances of several sorts, including backchannels and completions. The model has also been partially implemented within a virtual human system that includes incremental understanding, and can be used to track grounding and provide overlapping verbal and non-verbal behaviors from a listener, before a speaker has completed her utterance.

[1]  Stacy Marsella,et al.  A Virtual Human Dialogue Model for Non-Team Interaction , 2008 .

[2]  D. Traum,et al.  Coding Discourse Structure in Dialogue (Version 1.0) , 1999 .

[3]  Lenhart K. Schubert,et al.  Knowledge Representation in the TRAINS-93 Conversation System , 1996 .

[4]  Stefan Kopp,et al.  The analysis of embodied communicative feedback in multimodal corpora: a prerequisite for behavior simulation , 2007, Lang. Resour. Evaluation.

[5]  付伶俐 打磨Using Language,倡导新理念 , 2014 .

[6]  David R. Traum,et al.  Multi-party, Multi-issue, Multi-strategy Negotiation for Multi-modal Virtual Agents , 2008, IVA.

[7]  Stefan Kopp,et al.  Middleware for Incremental Processing in Conversational Agents , 2010, SIGDIAL Conference.

[8]  Nigel Ward,et al.  A Responsive Dialog System , 1999 .

[9]  Stefan Kopp,et al.  Modeling Embodied Feedback with Virtual Humans , 2006, ZiF Workshop.

[10]  David R. Traum,et al.  Conversational Actions and Discourse Situations , 1997, Comput. Intell..

[11]  Gabriel Skantze,et al.  A General, Abstract Model of Incremental Dialogue Processing , 2011 .

[12]  David R Traum,et al.  Towards a Computational Theory of Grounding in Natural Language Conversation , 1991 .

[13]  Stephen T. Wu,et al.  A Framework for Fast Incremental Interpretation during Speech Decoding , 2009, Computational Linguistics.

[14]  David R. Traum,et al.  Negotiation over tasks in hybrid human-agent teams for simulation-based training , 2003, AAMAS '03.

[15]  Sarah Brown-Schmidt,et al.  Language processing in the natural world , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[16]  David DeVault,et al.  Detecting the Status of a Predictive Incremental Speech Understanding Model for Real-Time Decision-Making in a Spoken Dialogue System , 2011, INTERSPEECH.

[17]  David DeVault,et al.  Can I Finish? Learning When to Respond to Incremental Interpretation Results in Interactive Dialogue , 2009, SIGDIAL Conference.

[18]  David DeVault,et al.  Toward Rapid Development of Multi-Party Virtual Human Negotiation Scenarios , 2011 .

[19]  Herbert H. Clark,et al.  Contributing to Discourse , 1989, Cogn. Sci..

[20]  David R. Traum,et al.  Modelling Grounding and Discourse Obligations Using Update Rules , 2000, ANLP.

[21]  David Schlangen,et al.  Collaborating on Utterances with a Spoken Dialogue System Using an ISU-based Approach to Incremental Dialogue Management , 2010, SIGDIAL Conference.

[22]  Gabriel Skantze,et al.  Incremental Dialogue Processing in a Micro-Domain , 2009, EACL.

[23]  Louis-Philippe Morency,et al.  Integration of Visual Perception in Dialogue Understanding for Virtual Humans in Multi-Party interaction , 2010, AAMAS 2010.

[24]  Stacy Marsella,et al.  Virtual Rapport , 2006, IVA.

[25]  Philip R. Cohen,et al.  Discourse structure and performance efficiency in interactive and non-interactive spoken modalities☆ , 1991 .

[26]  Gabriel Skantze,et al.  Towards Incremental Speech Generation in Dialogue Systems , 2010, SIGDIAL Conference.

[27]  David Traum,et al.  Semantics and Pragmatics of Questions and Answers for Dialogue Agents , 2003 .

[28]  Anton Leuski,et al.  All Together Now - Introducing the Virtual Human Toolkit , 2013, IVA.

[29]  Louis-Philippe Morency,et al.  Virtual Rapport 2.0 , 2011, IVA.

[30]  Louis-Philippe Morency,et al.  A probabilistic multimodal approach for predicting listener backchannels , 2009, Autonomous Agents and Multi-Agent Systems.

[31]  David DeVault,et al.  Incremental Dialogue Understanding and Feedback for Multiparty, Multimodal Conversation , 2012, IVA.

[32]  David DeVault,et al.  A method for the approximation of incremental understanding of explicit utterance meaning using predictive models in finite domains , 2013, HLT-NAACL.

[33]  David R. Traum,et al.  Degrees of Grounding Based on Evidence of Understanding , 2008, SIGDIAL Workshop.

[34]  Stacy Marsella,et al.  Towards More Comprehensive Listening Behavior: Beyond the Bobble Head , 2011, IVA.

[35]  Jean Carletta,et al.  Unleashing the killer corpus: experiences in creating the multi-everything AMI Meeting Corpus , 2007, Lang. Resour. Evaluation.

[36]  David Milward Dynamics, Dependency Grammar And Incremental Interpretation , 1992, COLING.

[37]  Jason D. Williams,et al.  Stability and Accuracy in Incremental Speech Recognition , 2011, SIGDIAL Conference.

[38]  David Traum,et al.  Dialogue management in spoken dialogue systems with degrees of grounding , 2009 .

[39]  Christian Husodo-Schulz,et al.  Exploring Features and Classifiers for Dialogue Act Segmentation , 2008, MLMI.

[40]  David DeVault,et al.  Incremental interpretation and prediction of utterance meaning for interactive dialogue , 2011, Dialogue Discourse.

[41]  Eric Horvitz,et al.  Learning to Predict Engagement with a Spoken Dialog System in Open-World Settings , 2009, SIGDIAL Conference.