论文信息 - PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum Learning - 字舞流文

PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum Learning

To build a high-quality open-domain chatbot, we introduce the effective training process of PLATO-2 via curriculum learning. There are two stages involved in the learning process. In the first stage, a coarse-grained generation model is trained to learn response generation under the simplified framework of one-to-one mapping. In the second stage, a fine-grained generation model and an evaluation model are further trained to learn diverse response generation and response coherence estimation, respectively. PLATO-2 was trained on both Chinese and English data, whose effectiveness and superiority are verified through comprehensive evaluations, achieving new state-of-the-art results.

Zhen Guo | Hua Wu | Fan Wang | Siqi Bao | Huang He | Xinchao Xu | Zhibin Liu | Haifeng Wang | Wenquan Wu

[1] Ellen M. Voorhees,et al. The TREC-8 Question Answering Track Report , 1999, TREC.

[2] Hua Wu,et al. PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable , 2020, ACL.

[3] Jianfeng Gao,et al. A Diversity-Promoting Objective Function for Neural Conversation Models , 2015, NAACL.

[4] Quoc V. Le,et al. A Neural Conversational Model , 2015, ArXiv.

[5] Byeongchang Kim,et al. Sequential Latent Knowledge Selection for Knowledge-Grounded Dialogue , 2020, ICLR.

[6] Maxine Eskénazi,et al. Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders , 2017, ACL.

[7] Y-Lan Boureau,et al. Towards Empathetic Open-domain Conversation Models: A New Benchmark and Dataset , 2018, ACL.

[8] Mark Chen,et al. Language Models are Few-Shot Learners , 2020, NeurIPS.

[9] Jason Weston,et al. Personalizing Dialogue Agents: I have a dog, do you have pets too? , 2018, ACL.

[10] Hua Wu,et al. Generating Multiple Diverse Responses with Multi-Mapping and Posterior Mapping Selection , 2019, IJCAI.

[11] Xiaodong Liu,et al. Unified Language Model Pre-training for Natural Language Understanding and Generation , 2019, NeurIPS.

[12] Joelle Pineau,et al. The Second Conversational Intelligence Challenge (ConvAI2) , 2019, The NeurIPS '18 Competition.

[13] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[14] Jitendra Malik,et al. Which Tasks Should Be Learned Together in Multi-task Learning? , 2019, ICML.

[15] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[16] Hua Wu,et al. Know More about Each Other: Evolving Dialogue Strategy via Compound Assessment , 2019, ACL.

[17] Jason Weston,et al. Curriculum learning , 2009, ICML '09.

[18] Mary Williamson,et al. Recipes for Building an Open-Domain Chatbot , 2020, EACL.

[19] Alec Radford,et al. Improving Language Understanding by Generative Pre-Training , 2018 .

[20] Harry Shum,et al. The Design and Implementation of XiaoIce, an Empathetic Social Chatbot , 2018, CL.

[21] Hua Wu,et al. A Unified Pre-training Framework for Conversational AI , 2021, ArXiv.

[22] Jason Weston,et al. ACUTE-EVAL: Improved Dialogue Evaluation with Optimized Questions and Multi-turn Comparisons , 2019, ArXiv.

[23] J. Fleiss. Measuring nominal scale agreement among many raters. , 1971 .

[24] Jeremy Blackburn,et al. The Pushshift Reddit Dataset , 2020, ICWSM.

[25] Xiaoyu Shen,et al. DailyDialog: A Manually Labelled Multi-turn Dialogue Dataset , 2017, IJCNLP.

[26] Jianfeng Gao,et al. DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation , 2020, ACL.

[27] Jianfeng Gao,et al. Deep Reinforcement Learning for Dialogue Generation , 2016, EMNLP.

[28] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[29] Lihong Li,et al. Neural Approaches to Conversational AI , 2019, Found. Trends Inf. Retr..

[30] Tianqi Chen,et al. Training Deep Nets with Sublinear Memory Cost , 2016, ArXiv.

[31] Ben Poole,et al. Categorical Reparameterization with Gumbel-Softmax , 2016, ICLR.

[32] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.

[33] Quoc V. Le,et al. Towards a Human-like Open-Domain Chatbot , 2020, ArXiv.

[34] Dilek Z. Hakkani-Tür,et al. Overview of the Ninth Dialog System Technology Challenge: DSTC9 , 2020, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[35] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[36] Ilya Sutskever,et al. Language Models are Unsupervised Multitask Learners , 2019 .

[37] Mary Williamson,et al. Can You Put it All Together: Evaluating Conversational Agents’ Ability to Blend Skills , 2020, ACL.

[38] Xiaoyan Zhu,et al. Commonsense Knowledge Aware Conversation Generation with Graph Attention , 2018, IJCAI.

[39] Bill Dolan,et al. Grounded Response Generation Task at DSTC7 , 2019 .

[40] Mohammad Shoeybi,et al. Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism , 2019, ArXiv.

[41] Jason Weston,et al. Wizard of Wikipedia: Knowledge-Powered Conversational agents , 2018, ICLR.