Amalgamating Knowledge from Two Teachers for Task-oriented Dialogue System with Adversarial Training
暂无分享,去创建一个
Chengming Li | Ying Shen | Ruifeng Xu | Min Yang | Rui Yan | Wanwei He | Ying Shen | Min Yang | Rui Yan | Ruifeng Xu | Wanwei He | Chengming Li
[1] Hannes Schulz,et al. Relevance of Unsupervised Metrics in Task-Oriented Dialogue for Evaluating Natural Language Generation , 2017, ArXiv.
[2] David Vandyke,et al. Conditional Generation and Snapshot Learning in Neural Dialogue Systems , 2016, EMNLP.
[3] David D. Cox,et al. Making a Science of Model Search: Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures , 2013, ICML.
[4] Bowen Zhou,et al. Pointing the Unknown Words , 2016, ACL.
[5] Vaibhava Goel,et al. Self-Critical Sequence Training for Image Captioning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[6] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.
[7] Danish Contractor,et al. 2019 Formatting Instructions for Authors Using LaTeX , 2018 .
[8] Yoshua Bengio,et al. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.
[9] Christopher D. Manning,et al. Key-Value Retrieval Networks for Task-Oriented Dialogue , 2017, SIGDIAL Conference.
[10] Pascale Fung,et al. Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems , 2018, ACL.
[11] Jason Weston,et al. End-To-End Memory Networks , 2015, NIPS.
[12] Richard Socher,et al. Global-to-local Memory Pointer Networks for Task-Oriented Dialogue , 2019, ICLR.
[13] J. Fleiss. Measuring nominal scale agreement among many raters. , 1971 .
[14] Jaime G. Carbonell,et al. Discourse Pragmatics and Ellipsis Resolution in Task-Oriented Natural Language Interfaces , 1983, ACL.
[15] Min-Yen Kan,et al. Sequicity: Simplifying Task-oriented Dialogue Systems with Single Sequence-to-Sequence Architectures , 2018, ACL.
[16] Bo Xu,et al. A Working Memory Model for Task-oriented Dialog Response Generation , 2019, ACL.
[17] Nojun Kwak,et al. FEED: Feature-level Ensemble for Knowledge Distillation , 2019, ArXiv.
[18] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[19] Dacheng Tao,et al. Adversarial Learning of Portable Student Networks , 2018, AAAI.
[20] Dit-Yan Yeung,et al. Dynamic Key-Value Memory Networks for Knowledge Tracing , 2016, WWW.
[21] Milica Gasic,et al. POMDP-Based Statistical Spoken Dialog Systems: A Review , 2013, Proceedings of the IEEE.
[22] Steve J. Young,et al. Partially observable Markov decision processes for spoken dialog systems , 2007, Comput. Speech Lang..
[23] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[24] Dacheng Tao,et al. Learning from Multiple Teacher Networks , 2017, KDD.
[25] Yangming Li,et al. Entity-Consistent End-to-end Task-Oriented Dialogue System with KB Retriever , 2019, EMNLP.
[26] Nikhil Gupta,et al. Disentangling Language and Knowledge in Task-Oriented Dialogs , 2018, NAACL.
[27] Jason Weston,et al. Learning End-to-End Goal-Oriented Dialog , 2016, ICLR.
[28] Christopher D. Manning,et al. A Copy-Augmented Sequence-to-Sequence Architecture Gives Good Performance on Task-Oriented Dialogue , 2017, EACL.
[29] Rich Caruana,et al. Model compression , 2006, KDD '06.
[30] Jonathan Le Roux,et al. Student-teacher network learning with enhanced features , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[31] Geoffrey E. Hinton,et al. Distilling the Knowledge in a Neural Network , 2015, ArXiv.
[32] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.