暂无分享,去创建一个
Joachim Bingel | Isabelle Augenstein | Pawel Budzianowski | Anders Søgaard | Victor Petrén Bach Hansen | Ana Valeria González-Garduño | Joachim Bingel | Anders Søgaard | Isabelle Augenstein | Paweł Budzianowski | Ana Valeria González
[1] Matthew Henderson,et al. Machine Learning for Dialog State Tracking: A Review , 2015 .
[2] Lihong Li,et al. Reinforcement learning for dialog management using least-squares Policy iteration and fast feature selection , 2009, INTERSPEECH.
[3] Ivan Vulić,et al. Fully Statistical Neural Belief Tracking , 2018, ACL.
[4] Sham M. Kakade,et al. A Natural Policy Gradient , 2001, NIPS.
[5] Zhi Chen,et al. Policy Adaptation for Deep Reinforcement Learning-Based Dialogue Management , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[6] David Vandyke,et al. Multi-domain Dialog State Tracking using Recurrent Neural Networks , 2015, ACL.
[7] Jing Peng,et al. Function Optimization using Connectionist Reinforcement Learning Algorithms , 1991 .
[8] Lu Chen,et al. A generalized rule based tracker for dialogue state tracking , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).
[9] S. Singh,et al. Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System , 2011, J. Artif. Intell. Res..
[10] Jianfeng Gao,et al. Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access , 2016, ACL.
[11] Antoine Raux,et al. The Dialog State Tracking Challenge , 2013, SIGDIAL Conference.
[12] Oliver Lemon,et al. A Simple and Generic Belief Tracking Mechanism for the Dialog State Tracking Challenge: On the believability of observed information , 2013, SIGDIAL Conference.
[13] Matthew Henderson,et al. Word-Based Dialog State Tracking with Recurrent Neural Networks , 2014, SIGDIAL Conference.
[14] Lu Chen,et al. Towards Universal Dialogue State Tracking , 2018, EMNLP.
[15] Steve J. Young,et al. Partially observable Markov decision processes for spoken dialog systems , 2007, Comput. Speech Lang..
[16] Eduardo F. Morales,et al. An Introduction to Reinforcement Learning , 2011 .
[17] Matthew Henderson,et al. The Second Dialog State Tracking Challenge , 2014, SIGDIAL Conference.
[18] Lu Chen,et al. Hybrid Dialogue State Tracking for Real World Human-to-Human Dialogues , 2016, INTERSPEECH.
[19] Maxine Eskénazi,et al. Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning , 2016, SIGDIAL Conference.
[20] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[21] Jason Williams,et al. Multi-domain learning and generalization in dialog state tracking , 2013, SIGDIAL Conference.
[22] David Vandyke,et al. Dialogue manager domain adaptation using Gaussian process reinforcement learning , 2016, Comput. Speech Lang..
[23] Daniel Marcu,et al. Domain Adaptation for Statistical Classifiers , 2006, J. Artif. Intell. Res..
[24] John Blitzer,et al. Domain Adaptation with Structural Correspondence Learning , 2006, EMNLP.
[25] Dilek Z. Hakkani-Tür,et al. Scalable multi-domain dialogue state tracking , 2017, 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).
[26] Dongho Kim,et al. POMDP-based dialogue manager adaptation to extended domains , 2013, SIGDIAL Conference.
[27] Dilek Z. Hakkani-Tür,et al. Dialogue Learning with Human Teaching and Feedback in End-to-End Trainable Task-Oriented Dialogue Systems , 2018, NAACL.
[28] Anna Korhonen,et al. Semantic Specialization of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints , 2017, TACL.
[29] Tsung-Hsien Wen,et al. Neural Belief Tracker: Data-Driven Dialogue State Tracking , 2016, ACL.
[30] Hannes Schulz,et al. Frames: a corpus for adding memory to goal-oriented dialogue systems , 2017, SIGDIAL Conference.
[31] Pawel Budzianowski,et al. Large-Scale Multi-Domain Belief Tracking with Knowledge Sharing , 2018, ACL.
[32] Lex Weaver,et al. The Optimal Reward Baseline for Gradient-Based Reinforcement Learning , 2001, UAI.
[33] Ehsan Hosseini-Asl,et al. Toward Scalable Neural Dialogue State Tracking Model , 2018, ArXiv.
[34] Manuela M. Veloso,et al. Rational and Convergent Learning in Stochastic Games , 2001, IJCAI.
[35] Stefan Ultes,et al. MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling , 2018, EMNLP.
[36] Luca Antiga,et al. Automatic differentiation in PyTorch , 2017 .
[37] Marcello Restelli,et al. Adaptive Batch Size for Safe Policy Gradients , 2017, NIPS.
[38] ChengXiang Zhai,et al. Instance Weighting for Domain Adaptation in NLP , 2007, ACL.