论文信息 - Soloist: Building Task Bots at Scale with Transfer Learning and Machine Teaching - 字舞流文

Soloist: Building Task Bots at Scale with Transfer Learning and Machine Teaching

Abstract We present a new method, Soloist,1 that uses transfer learning and machine teaching to build task bots at scale. We parameterize classical modular task-oriented dialog systems using a Transformer-based auto-regressive language model, which subsumes different dialog modules into a single neural model. We pre-train, on heterogeneous dialog corpora, a task-grounded response generation model, which can generate dialog responses grounded in user goals and real-world knowledge for task completion. The pre-trained model can be efficiently adapted to accomplish new tasks with a handful of task-specific dialogs via machine teaching, where training samples are generated by human teachers interacting with the system. Experiments show that (i)Soloist creates new state-of-the-art on well-studied task-oriented dialog benchmarks, including CamRest676 and MultiWOZ; (ii) in the few-shot fine-tuning settings, Soloist significantly outperforms existing methods; and (iii) the use of machine teaching substantially reduces the labeling cost of fine-tuning. The pre-trained models and codes are available at https://aka.ms/soloist.

Baolin Peng | Jinchao Li | Jianfeng Gao | Shahin Shayandeh | Lars Liden | Chunyuan Li | Chunyuan Li | Jianfeng Gao | Baolin Peng | Lars Lidén | Jinchao Li | Shahin Shayandeh

[1] Jason Weston,et al. The Dialogue Dodecathlon: Open-Domain Knowledge and Image Grounded Conversational Agents , 2020, ACL.

[2] Lysandre Debut,et al. HuggingFace's Transformers: State-of-the-art Natural Language Processing , 2019, ArXiv.

[3] Yejin Choi,et al. The Curious Case of Neural Text Degeneration , 2019, ICLR.

[4] Ivan Vulić,et al. Hello, It’s GPT-2 - How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems , 2019, EMNLP.

[5] Jianfeng Gao,et al. End-to-End Task-Completion Neural Dialogue Systems , 2017, IJCNLP.

[6] Yu Li,et al. Alternating Recurrent Dialog Model with Large-scale Pre-trained Language Models , 2019, EACL.

[7] Maxine Eskénazi,et al. Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog Agents with Latent Variable Models , 2019, NAACL.

[8] Jiahuan Pei,et al. A Modular Task-oriented Dialogue System Using a Neural Mixture-of-Experts , 2019, ArXiv.

[9] Min-Yen Kan,et al. Sequicity: Simplifying Task-oriented Dialogue Systems with Single Sequence-to-Sequence Architectures , 2018, ACL.

[10] Hua Wu,et al. PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable , 2020, ACL.

[11] Geoffrey Zweig,et al. Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning , 2017, ACL.

[12] Sungjin Lee,et al. ConvLab: Multi-Domain End-to-End Dialog System Platform , 2019, ACL.

[13] Wenhu Chen,et al. Semantically Conditioned Dialog Response Generation via Hierarchical Disentangled Self-Attention , 2019, ACL.

[14] Jianfeng Gao,et al. RADDLE: An Evaluation Benchmark and Analysis Platform for Robust Task-oriented Dialog Systems , 2020, ACL.

[15] Jianfeng Gao,et al. A Controllable Model of Grounded Response Generation , 2020, AAAI.

[16] Jason D. Williams,et al. Demonstration of interactive teaching for end-to-end dialog control with hybrid code networks , 2017, SIGDIAL Conference.

[17] Richard Socher,et al. A Simple Language Model for Task-Oriented Dialogue , 2020, NeurIPS.

[18] Quoc V. Le,et al. Towards a Human-like Open-Domain Chatbot , 2020, ArXiv.

[19] Anoop Cherian,et al. The Eighth Dialog System Technology Challenge , 2019, ArXiv.

[20] David Vandyke,et al. A Network-based End-to-End Trainable Task-oriented Dialogue System , 2016, EACL.

[21] Bill Byrne,et al. Taskmaster-1: Toward a Realistic and Diverse Dialog Dataset , 2019, EMNLP.

[22] Matthew Henderson,et al. Efficient Intent Detection with Dual Sentence Encoders , 2020, NLP4CONVAI.

[23] Philip S. Yu,et al. Find or Classify? Dual Strategy for Slot-Value Predictions on Multi-Domain Dialog State Tracking , 2019, STARSEM.

[24] Richard Socher,et al. Global-Locally Self-Attentive Encoder for Dialogue State Tracking , 2018, ACL.

[25] Dilek Z. Hakkani-Tür,et al. Dialog State Tracking: A Neural Reading Comprehension Approach , 2019, SIGdial.

[26] Matthew Henderson,et al. ConveRT: Efficient and Accurate Conversational Representations from Transformers , 2020, EMNLP.

[27] Mary Williamson,et al. Open-Domain Conversational Agents: Current Progress, Open Problems, and Future Directions , 2020, ArXiv.

[28] Milica Gasic,et al. POMDP-Based Statistical Spoken Dialog Systems: A Review , 2013, Proceedings of the IEEE.

[29] David Maxwell Chickering,et al. Machine Teaching: A New Paradigm for Building Machine Learning Systems , 2017, ArXiv.

[30] Jianfeng Gao,et al. Results of the Multi-Domain Task-Completion Dialog Challenge , 2020, AAAI 2020.

[31] Mary Williamson,et al. Recipes for Building an Open-Domain Chatbot , 2020, EACL.

[32] Erik Cambria,et al. Augmenting End-to-End Dialogue Systems With Commonsense Knowledge , 2018, AAAI.

[33] Jianfeng Gao,et al. DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation , 2020, ACL.

[34] Lav R. Varshney,et al. CTRL: A Conditional Transformer Language Model for Controllable Generation , 2019, ArXiv.

[35] Feng Ji,et al. Teacher-Student Framework Enhanced Multi-domain Dialogue Generation , 2019, ArXiv.

[36] Tiancheng Zhao,et al. Pretraining Methods for Dialog Context Representation Learning , 2019, ACL.

[37] Jianfeng Gao,et al. Few-shot Natural Language Generation for Task-Oriented Dialog , 2020, FINDINGS.

[38] Xiaoyan Zhu,et al. Emotional Chatting Machine: Emotional Conversation Generation with Internal and External Memory , 2017, AAAI.

[39] Harry Shum,et al. The Design and Implementation of XiaoIce, an Empathetic Social Chatbot , 2018, CL.

[40] Richard Socher,et al. Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems , 2019, ACL.

[41] Richard Socher,et al. TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogue , 2020, EMNLP.

[42] Richard Socher,et al. Non-Autoregressive Dialog State Tracking , 2020, ICLR.

[43] Nick Pawlowski,et al. Rasa: Open Source Language Understanding and Dialogue Management , 2017, ArXiv.

[44] Chi Wang,et al. Schema-Guided Multi-Domain Dialogue State Tracking with Graph Attention Neural Networks , 2020, AAAI.

[45] Haoran Xie,et al. End-to-End latent-variable task-oriented dialogue system with exact log-likelihood optimization , 2019, World Wide Web.

[46] Ilya Sutskever,et al. Language Models are Unsupervised Multitask Learners , 2019 .

[47] Jianmo Ni,et al. Scalable and Accurate Dialogue State Tracking via Hierarchical Sequence Generation , 2019, EMNLP.

[48] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[49] Maxine Eskénazi,et al. Structured Fusion Networks for Dialog , 2019, SIGdial.

[50] Thomas Wolf,et al. TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents , 2019, ArXiv.

[51] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.

[52] Dilek Z. Hakkani-Tür,et al. HyST: A Hybrid Approach for Flexible and Accurate Dialogue State Tracking , 2019, INTERSPEECH.

[53] Kee-Eung Kim,et al. End-to-End Neural Pipeline for Goal-Oriented Dialogue Systems using GPT-2 , 2020, ACL.

[54] Kam-Fai Wong,et al. Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning , 2017, EMNLP.

[55] Xiaojin Zhu,et al. Machine Teaching: An Inverse Problem to Machine Learning and an Approach Toward Optimal Education , 2015, AAAI.

[56] Jianfeng Gao,et al. Robust Conversational AI with Grounded Text Generation , 2020, ArXiv.

[57] Xiujun Li,et al. Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space , 2020, EMNLP.

[58] Tae-Yoon Kim,et al. SUMBT: Slot-Utterance Matching for Universal and Scalable Belief Tracking , 2019, ACL.

[59] Erik Cambria,et al. A survey on empathetic dialogue systems , 2020, Inf. Fusion.

[60] Wei Li,et al. BiERU: Bidirectional Emotional Recurrent Unit for Conversational Sentiment Analysis , 2022, Neurocomputing.

[61] Jiliang Tang,et al. A Survey on Dialogue Systems: Recent Advances and New Frontiers , 2017, SKDD.

[62] Jianfeng Gao,et al. Conversation Learner - A Machine Teaching Tool for Building Dialog Managers for Task-Oriented Dialog Systems , 2020, ACL.

[63] Lihong Li,et al. Neural Approaches to Conversational AI , 2019, Found. Trends Inf. Retr..

[64] Pawel Budzianowski,et al. Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing , 2018, ACL.

[65] Ali Farhadi,et al. Defending Against Neural Fake News , 2019, NeurIPS.

[66] Xiaodong Liu,et al. Unified Language Model Pre-training for Natural Language Understanding and Generation , 2019, NeurIPS.

[67] Mihail Eric,et al. MultiWOZ 2. , 2019 .

[68] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[69] Matthew Henderson,et al. Span-ConveRT: Few-shot Span Extraction for Dialog with Pretrained Conversational Representations , 2020, ACL.

[70] Pascale Fung,et al. MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems , 2020, EMNLP.

[71] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[72] Gyuwan Kim,et al. Efficient Dialogue State Tracking by Selectively Overwriting Memory , 2020, ACL.

[73] Stefan Ultes,et al. MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling , 2018, EMNLP.

[74] Erik Cambria,et al. Dialogue systems with audio context , 2020, Neurocomputing.

[75] Raghav Gupta,et al. Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue Dataset , 2020, AAAI.

[76] Ehsan Hosseini-Asl,et al. Toward Scalable Neural Dialogue State Tracking Model , 2018, ArXiv.

[77] Maxine Eskénazi,et al. Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning , 2016, SIGDIAL Conference.

[78] Zhijian Ou,et al. Task-Oriented Dialog Systems that Consider Multiple Appropriate Responses under the Same Context , 2019, AAAI.

[79] Jianfeng Gao,et al. ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems , 2020, ACL.

[80] Nurul Lubis,et al. TripPy: A Triple Copy Strategy for Value Independent Neural Dialog State Tracking , 2020, SIGdial.