Feudal Dialogue Management with Jointly Learned Feature Extractors
暂无分享,去创建一个
Stefan Ultes | Iñigo Casanueva | Pawel Budzianowski | Milica Gasic | Yen-Chen Wu | Bo-Hsiang Tseng | Florian Kreyssig
[1] David Vandyke,et al. PyDial: A Multi-domain Statistical Dialogue System Toolkit , 2017, ACL.
[2] Geoffrey E. Hinton,et al. Feudal Reinforcement Learning , 1992, NIPS.
[3] Nando de Freitas,et al. Sample Efficient Actor-Critic with Experience Replay , 2016, ICLR.
[4] Stefan Ultes,et al. Feudal Reinforcement Learning for Dialogue Management in Large Domains , 2018, NAACL.
[5] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[6] Milica Gasic,et al. POMDP-Based Statistical Spoken Dialog Systems: A Review , 2013, Proceedings of the IEEE.
[7] Steve J. Young,et al. Partially observable Markov decision processes for spoken dialog systems , 2007, Comput. Speech Lang..
[8] Matthieu Geist,et al. Sample Efficient On-Line Learning of Optimal Dialogue Policies with Kalman Temporal Differences , 2011, IJCAI.
[9] Dongho Kim,et al. POMDP-based dialogue manager adaptation to extended domains , 2013, SIGDIAL Conference.
[10] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
[11] Philip Bachman,et al. Deep Reinforcement Learning that Matters , 2017, AAAI.
[12] Kallirroi Georgila,et al. Hybrid Reinforcement/Supervised Learning of Dialogue Policies from Fixed Data Sets , 2008, CL.
[13] Pei-Hao Su,et al. Sample Efficient Deep Reinforcement Learning for Dialogue Systems With Large Action Spaces , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[14] Roberto Pieraccini,et al. Using Markov decision process for learning dialogue strategies , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[15] Yannis Stylianou,et al. Learning Domain-Independent Dialogue Policies via Ontology Parameterisation , 2015, SIGDIAL Conference.
[16] Stefan Ultes,et al. A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management , 2017, ArXiv.