论文信息 - Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access - 字舞流文

Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access

Most prior work on task-oriented dialogue systems are restricted to a limited coverage of domain APIs, while users oftentimes have domain related requests that are not covered by the APIs. In this paper, we propose to expand coverage of task-oriented dialogue systems by incorporating external unstructured knowledge sources. We define three sub-tasks: knowledge-seeking turn detection, knowledge selection, and knowledge-grounded response generation, which can be modeled individually or jointly. We introduce an augmented version of MultiWOZ 2.1, which includes new out-of-API-coverage turns and responses grounded on external knowledge sources. We present baselines for each sub-task using both conventional and neural approaches. Our experimental results demonstrate the need for further research in this direction to enable more informative conversational systems.

Dilek Z. Hakkani-Tür | Yang Liu | Seokhwan Kim | Mihail Eric | Karthik Gopalakrishnan | Dilek Hakkani-Tur | Behnam Hedayatnia | Seokhwan Kim | Behnam Hedayatnia | Yang Liu | Karthik Gopalakrishnan | Mihail Eric

[1] Quoc V. Le,et al. ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators , 2020, ICLR.

[2] Xiaodong Liu,et al. Unified Language Model Pre-training for Natural Language Understanding and Generation , 2019, NeurIPS.

[3] Milica Gasic,et al. POMDP-Based Statistical Spoken Dialog Systems: A Review , 2013, Proceedings of the IEEE.

[4] Steve J. Young,et al. Partially observable Markov decision processes for spoken dialog systems , 2007, Comput. Speech Lang..

[5] Gokhan Tur,et al. Spoken Language Understanding: Systems for Extracting Semantic Information from Speech , 2011 .

[6] Dilek Z. Hakkani-Tür,et al. Topical-Chat: Towards Knowledge-Grounded Open-Domain Conversations , 2019, INTERSPEECH.

[7] Yang Feng,et al. Knowledge Diffusion for Neural Dialogue Generation , 2018, ACL.

[8] Christopher D. Manning,et al. Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[9] Eunsol Choi,et al. QuAC: Question Answering in Context , 2018, EMNLP.

[10] Quoc V. Le,et al. A Neural Conversational Model , 2015, ArXiv.

[11] Parma Nand,et al. GENERATION : A SURVEY AND CLASSIFICATION OF THE EMPIRICAL LITERATURE , 2017 .

[12] Thomas Wolf,et al. TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents , 2019, ArXiv.

[13] Jian Zhang,et al. SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.

[14] Yue Zhao,et al. PyOD: A Python Toolbox for Scalable Outlier Detection , 2019, J. Mach. Learn. Res..

[15] Jason Weston,et al. Wizard of Wikipedia: Knowledge-Powered Conversational agents , 2018, ICLR.

[16] Xiaoyan Zhu,et al. Commonsense Knowledge Aware Conversation Generation with Graph Attention , 2018, IJCAI.

[17] S. Singh,et al. Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System , 2011, J. Artif. Intell. Res..

[18] Matthew Richardson,et al. MCTest: A Challenge Dataset for the Open-Domain Machine Comprehension of Text , 2013, EMNLP.

[19] Joelle Pineau,et al. A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues , 2016, AAAI.

[20] Hans-Peter Kriegel,et al. LOF: identifying density-based local outliers , 2000, SIGMOD 2000.

[21] Ming-Wei Chang,et al. A Knowledge-Grounded Neural Conversation Model , 2017, AAAI.

[22] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[23] Vaibhava Goel,et al. Minimum Bayes-risk automatic speech recognition , 2000, Comput. Speech Lang..

[24] Roberto Pieraccini,et al. A stochastic model of human-machine interaction for learning dialog strategies , 2000, IEEE Trans. Speech Audio Process..

[25] Danqi Chen,et al. CoQA: A Conversational Question Answering Challenge , 2018, TACL.

[26] Hugo Zaragoza,et al. The Probabilistic Relevance Framework: BM25 and Beyond , 2009, Found. Trends Inf. Retr..

[27] Jianfeng Gao,et al. DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation , 2020, ACL.

[28] Dilek Z. Hakkani-Tür,et al. Are Neural Open-Domain Dialog Systems Robust to Speech Recognition Errors in the Dialog History? An Empirical Study , 2020, INTERSPEECH.

[29] Kyunghyun Cho,et al. Passage Re-ranking with BERT , 2019, ArXiv.

[30] Phil Blunsom,et al. Teaching Machines to Read and Comprehend , 2015, NIPS.

[31] Mihail Eric,et al. MultiWOZ 2. , 2019 .

[32] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[33] Dilek Z. Hakkani-Tür,et al. MultiWOZ 2.1: Multi-Domain Dialogue State Corrections and State Tracking Baselines , 2019, ArXiv.

[34] R'emi Louf,et al. HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.

[35] Yejin Choi,et al. The Curious Case of Neural Text Degeneration , 2019, ICLR.

[36] Alan Ritter,et al. Data-Driven Response Generation in Social Media , 2011, EMNLP.

[37] Bill Dolan,et al. Grounded Response Generation Task at DSTC7 , 2019 .

[38] C. Spearman. The proof and measurement of association between two things. , 2015, International journal of epidemiology.

[39] Ilya Sutskever,et al. Language Models are Unsupervised Multitask Learners , 2019 .