论文信息 - AuGPT: Auxiliary Tasks and Data Augmentation for End-To-End Dialogue with Pre-Trained Language Models - 字舞流文

AuGPT: Auxiliary Tasks and Data Augmentation for End-To-End Dialogue with Pre-Trained Language Models

Attention-based pre-trained language models such as GPT-2 brought considerable progress to end-to-end dialogue modelling. However, they also present considerable risks for task-oriented dialogue, such as lack of knowledge grounding or diversity. To address these issues, we introduce modified training objectives for language model finetuning, and we employ massive data augmentation via back-translation to increase the diversity of the training data. We further examine the possibilities of combining data from multiples sources to improve performance on the target dataset. We carefully evaluate our contributions with both human and automatic methods. Our model substantially outperforms the baseline on the MultiWOZ data and shows competitive performance with state of the art in both automatic and human evaluation.

Ondvrej Duvsek | Jon'avs Kulh'anek | Vojtvech Hudevcek | Tom'avs Nekvinda | Ondrej Dusek | Jon'avs Kulh'anek | Tom'avs Nekvinda | Vojtvech Hudevcek

[1] Nitin Madnani,et al. Generating Phrasal and Sentential Paraphrases: A Survey of Data-Driven Methods , 2010, CL.

[2] Raffaella Bernardi,et al. Psycholinguistics Meets Continual Learning: Measuring Catastrophic Forgetting in Visual Question Answering , 2019, ACL.

[3] Arash Einolghozati,et al. Improving Robustness of Task Oriented Dialog Systems , 2019, ArXiv.

[4] Ondvrej Bojar,et al. ELITR Non-Native Speech Translation at IWSLT 2020 , 2020, IWSLT.

[5] Yejin Choi,et al. Neural AMR: Sequence-to-Sequence Models for Parsing and Generation , 2017, ACL.

[6] Nurul Lubis,et al. LAVA: Latent Action Spaces via Variational Auto-encoding for Dialogue Policy Optimization , 2020, COLING.

[7] Milica Gasic,et al. POMDP-Based Statistical Spoken Dialog Systems: A Review , 2013, Proceedings of the IEEE.

[8] Jianfeng Gao,et al. Multi-Domain Task-Completion Dialog Challenge , 2019 .

[9] Jason Weston,et al. Don't Say That! Making Inconsistent Dialogue Unlikely with Unlikelihood Training , 2020, ACL.

[10] David King,et al. Using Paraphrasing and Memory-Augmented Models to Combat Data Sparsity in Question Interpretation with a Virtual Patient Dialogue System , 2018, BEA@NAACL-HLT.

[11] Rico Sennrich,et al. Improving Neural Machine Translation Models with Monolingual Data , 2015, ACL.

[12] Ondvrej Bojar,et al. Backtranslation Feedback Improves User Confidence in MT, Not Quality , 2021, NAACL.

[13] Zhijian Ou,et al. A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning , 2020, EMNLP.

[14] Kee-Eung Kim,et al. End-to-End Neural Pipeline for Goal-Oriented Dialogue Systems using GPT-2 , 2020, ACL.

[15] David Vandyke,et al. A Network-based End-to-End Trainable Task-oriented Dialogue System , 2016, EACL.

[16] Bill Byrne,et al. Taskmaster-1: Toward a Realistic and Diverse Dialog Dataset , 2019, EMNLP.

[17] Zhijian Ou,et al. Task-Oriented Dialog Systems that Consider Multiple Appropriate Responses under the Same Context , 2019, AAAI.

[18] Jianfeng Gao,et al. ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems , 2020, ACL.

[19] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[20] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[21] Jennifer Foster,et al. Shape of synth to come: Why we should use synthetic data for English surface realization , 2020, ACL.

[22] Natalia Gimelshein,et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[23] Mihail Eric,et al. MultiWOZ 2. , 2019 .

[24] Christian Federmann,et al. Multilingual Whispers: Generating Paraphrases with Translation , 2019, W-NUT@EMNLP.

[25] Richard Socher,et al. A Simple Language Model for Task-Oriented Dialogue , 2020, NeurIPS.

[26] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[27] Christopher D. Manning,et al. Key-Value Retrieval Networks for Task-Oriented Dialogue , 2017, SIGDIAL Conference.

[28] Jianfeng Gao,et al. Neural Approaches to Conversational AI: Question Answering, Task-oriented Dialogues and Social Chatbots , 2019 .

[29] Richard Socher,et al. TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogue , 2020, EMNLP.

[30] Ilya Sutskever,et al. Language Models are Unsupervised Multitask Learners , 2019 .

[31] Wanxiang Che,et al. Dynamic Fusion Network for Multi-Domain End-to-end Task-Oriented Dialog , 2020, ACL.

[32] Yejin Choi,et al. The Curious Case of Neural Text Degeneration , 2019, ICLR.

[33] Ivan Vulić,et al. Hello, It’s GPT-2 - How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems , 2019, EMNLP.

[34] Min-Yen Kan,et al. Sequicity: Simplifying Task-oriented Dialogue Systems with Single Sequence-to-Sequence Architectures , 2018, ACL.

[35] Jianfeng Gao,et al. DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation , 2020, ACL.

[36] Frank Hutter,et al. Decoupled Weight Decay Regularization , 2017, ICLR.

[37] Myle Ott,et al. Understanding Back-Translation at Scale , 2018, EMNLP.

[38] Stefan Ultes,et al. MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling , 2018, EMNLP.

[39] Baolin Peng,et al. Soloist: Building Task Bots at Scale with Transfer Learning and Machine Teaching , 2021, Transactions of the Association for Computational Linguistics.

[40] Dilek Z. Hakkani-Tür,et al. Overview of the Ninth Dialog System Technology Challenge: DSTC9 , 2020, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[41] Raghav Gupta,et al. Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue Dataset , 2020, AAAI.

[42] David Vandyke,et al. Stochastic Language Generation in Dialogue using Recurrent Neural Networks with Convolutional Sentence Reranking , 2015, SIGDIAL Conference.

[43] Jason Weston,et al. Neural Text Generation with Unlikelihood Training , 2019, ICLR.

[44] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.