论文信息 - STAR: A Schema-Guided Dialog Dataset for Transfer Learning - 字舞流文

STAR: A Schema-Guided Dialog Dataset for Transfer Learning

We present STAR, a schema-guided task-oriented dialog dataset consisting of 127,833 utterances and knowledge base queries across 5,820 task-oriented dialogs in 13 domains that is especially designed to facilitate task and domain transfer learning in task-oriented dialog. Furthermore, we propose a scalable crowd-sourcing paradigm to collect arbitrarily large datasets of the same quality as STAR. Moreover, we introduce novel schema-guided dialog models that use an explicit description of the task(s) to generalize from known to unknown tasks. We demonstrate the effectiveness of these models, particularly for zero-shot generalization across tasks and domains.

Shikib Mehri | Johannes E. M. Mosig | Thomas Kober | Thomas Kober | Shikib Mehri

[1] Stefan Ultes,et al. MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling , 2018, EMNLP.

[2] Wenhu Chen,et al. Semantically Conditioned Dialog Response Generation via Hierarchical Disentangled Self-Attention , 2019, ACL.

[3] Bill Byrne,et al. Taskmaster-1: Toward a Realistic and Diverse Dialog Dataset , 2019, EMNLP.

[4] David Vandyke,et al. A Network-based End-to-End Trainable Task-oriented Dialogue System , 2016, EACL.

[5] Hannes Schulz,et al. Frames: a corpus for adding memory to goal-oriented dialogue systems , 2017, SIGDIAL Conference.

[6] Mary Williamson,et al. Recipes for Building an Open-Domain Chatbot , 2020, EACL.

[7] Ilya Sutskever,et al. Language Models are Unsupervised Multitask Learners , 2019 .

[8] Maxine Eskénazi,et al. Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational Autoencoders , 2017, ACL.

[9] Maxine Eskénazi,et al. Structured Fusion Networks for Dialog , 2019, SIGdial.

[10] Vladimir Vlasov,et al. Where is the context? - A critique of recent dialogue datasets , 2020, ArXiv.

[11] Jianfeng Gao,et al. SOLOIST: Few-shot Task-Oriented Dialog with A Single Pre-trained Auto-regressive Model , 2020, ArXiv.

[12] Jason Weston,et al. ParlAI: A Dialog Research Software Platform , 2017, EMNLP.

[13] Rafael E. Banchs,et al. The Fourth Dialog State Tracking Challenge , 2016, IWSDS.

[14] Jianfeng Gao,et al. DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation , 2020, ACL.

[15] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[16] Dilek Z. Hakkani-Tür,et al. MultiWOZ 2.1: Multi-Domain Dialogue State Corrections and State Tracking Baselines , 2019, ArXiv.

[17] Alexander I. Rudnicky,et al. The RavenClaw dialog management framework: Architecture and systems , 2009, Comput. Speech Lang..

[18] Raghav Gupta,et al. Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue Dataset , 2020, AAAI.

[19] Quoc V. Le,et al. Towards a Human-like Open-Domain Chatbot , 2020, ArXiv.

[20] Mary Williamson,et al. Can You Put it All Together: Evaluating Conversational Agents’ Ability to Blend Skills , 2020, ACL.

[21] Lihong Li,et al. Neural Approaches to Conversational AI , 2019, Found. Trends Inf. Retr..

[22] Christopher D. Manning,et al. Towards Ecologically Valid Research on Language User Interfaces , 2020, ArXiv.

[23] Geoffrey Zweig,et al. Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning , 2017, ACL.

[24] Anoop Cherian,et al. The Eighth Dialog System Technology Challenge , 2019, ArXiv.

[25] Kevin Crowston,et al. Amazon Mechanical Turk: A Research Tool for Organizations and Information Systems Scholars , 2012, Shaping the Future of ICT Research.

[26] J. F. Kelley,et al. An iterative design methodology for user-friendly natural language office information applications , 1984, TOIS.

[27] Vladimir Vlasov,et al. DIET: Lightweight Language Understanding for Dialogue Systems , 2020, ArXiv.

[28] A. M. Turing,et al. Computing Machinery and Intelligence , 1950, The Philosophy of Artificial Intelligence.

[29] Jason Weston,et al. Personalizing Dialogue Agents: I have a dog, do you have pets too? , 2018, ACL.

[30] Matthew Henderson,et al. ConveRT: Efficient and Accurate Conversational Representations from Transformers , 2020, EMNLP.

[31] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.