A statistical prediction model of speakers' intentions using multi-level features in a goal-oriented dialog system

A dialog system is an intelligent program that helps users easily access information stored in a knowledge base by formulating requests in their natural language. A dialog system needs an intention prediction module for use as a preprocessor to reduce the search space of an automatic speech recognizer. To satisfy these needs, we propose a statistical model to predict speakers' intentions. The proposed model represents a dialog history, with various levels of linguistic features. The proposed model predicts the user's next intention by giving the linguistic features as inputs to a statistical machine learning model. In experiments conducted in a schedule management domain, the proposed model showed a higher average precision than the previous model.

[1]  Jlfnm Fpoli,et al.  Training a Sentence Planner for Spoken Dialogue Using Boosting , 2002 .

[2]  David R. Traum,et al.  Book Reviews: Spoken Natural Language Dialogue Systems: A Practical Approach , 1996, CL.

[3]  Alon Lavie,et al.  Domain Specific Speech Acts for Spoken Language Translation , 2003, SIGDIAL Workshop.

[4]  Dirk Heylen,et al.  DIALOGUE-ACT TAGGING USING SMART FEATURE SELECTION; RESULTS ON MULTIPLE CORPORA , 2006, 2006 IEEE Spoken Language Technology Workshop.

[5]  Norbert Reithinger,et al.  Utilizing Statistical Dialogue Act Processing in Verbrnobil , 1995, ACL.

[6]  Adwait Ratnaparkhi,et al.  A Maximum Entropy Approach to Identifying Sentence Boundaries , 1997, ANLP.

[7]  Gary Geunbae Lee,et al.  Using Utterance and Semantic Level Confidence for Interactive Spoken Dialog Clarification , 2008, J. Comput. Sci. Eng..

[8]  Norbert Reithinger,et al.  Dialogue act classification using language models , 1997, EUROSPEECH.

[9]  Jungyun Seo,et al.  Efficient Domain Action Classification Using Neural Networks , 2006, ICONIP.

[10]  James F. Allen,et al.  A Plan Recognition Model for Subdialogues in Conversations , 1987, Cogn. Sci..

[11]  Manuel Palomar,et al.  A Maximum Entropy-based Word Sense Disambiguation System , 2002, COLING.

[12]  Jungyun Seo,et al.  A Dialogue-Based Information Retrieval Assistant Using Shallow NLP Techniques in Online Sales Domains , 2005, IEICE Trans. Inf. Syst..

[13]  Joseph Polifroni,et al.  A form-based dialogue manager for spoken language applications , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[14]  Nicholas Roy,et al.  Efficient model learning for dialog management , 2007, 2007 2nd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[15]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[16]  Lynn Lambert,et al.  A Tripartite Plan-Based Model of Dialogue , 1991, ACL.

[17]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[18]  Mark G. Core,et al.  Coding Dialogs with the DAMSL Annotation Scheme , 1997 .

[19]  Yorick Wilks,et al.  Dialogue Act Classification Based on Intra-Utterance Features∗ , 2005 .

[20]  Gary Geunbae Lee,et al.  Recent Approaches to Dialog Management for Spoken Dialog Systems , 2010, J. Comput. Sci. Eng..

[21]  Eric Sven Ristad,et al.  Maximum Entropy Modeling Toolkit , 1996, ArXiv.

[22]  Sandra Carberry A Pragmatics-Based Approach to Ellipsis Resolution , 1989, Comput. Linguistics.