Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning
暂无分享,去创建一个
Kam-Fai Wong | Sungjin Lee | Jianfeng Gao | Xiujun Li | Lihong Li | Asli Çelikyilmaz | Baolin Peng | Lihong Li | Xiujun Li | Jianfeng Gao | Kam-Fai Wong | Asli Celikyilmaz | Baolin Peng | Sungjin Lee
[1] Ralph Weischedel,et al. A STUDY OF TRANSLATION ERROR RATE WITH TARGETED HUMAN ANNOTATION , 2005 .
[2] Steve J. Young,et al. The Hidden Agenda User Simulation Model , 2009, IEEE Transactions on Audio, Speech, and Language Processing.
[3] Sridhar Mahadevan,et al. Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..
[4] Stefan Lee,et al. Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).
[5] Olivier Pietquin,et al. End-to-end optimization of goal-driven and visually grounded dialogue systems , 2017, IJCAI.
[6] Heriberto Cuayáhuitl,et al. SimpleDS: A Simple Deep Reinforcement Learning Dialogue System , 2016, IWSDS.
[7] Joelle Pineau,et al. How NOT To Evaluate Your Dialogue System: An Empirical Study of Unsupervised Evaluation Metrics for Dialogue Response Generation , 2016, EMNLP.
[8] Verena Rieser,et al. The E2E Dataset: New Challenges For End-to-End Generation , 2017, SIGDIAL Conference.
[9] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[10] David Vandyke,et al. Policy committee for adaptation in multi-domain spoken dialogue systems , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
[11] Philipp Koehn,et al. Re-evaluating the Role of Bleu in Machine Translation Research , 2006, EACL.
[12] David Vandyke,et al. Multi-domain Neural Network Language Generation for Spoken Dialogue Systems , 2016, NAACL.
[13] Oliver Lemon,et al. Natural Language Generation as Incremental Planning Under Uncertainty: Adaptive Information Presentation for Statistical Dialogue Systems , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[14] Ondrej Dusek,et al. Training a Natural Language Generator From Unaligned Data , 2015, ACL.
[15] Geoffrey Zweig,et al. Spoken language understanding using long short-term memory neural networks , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).
[16] Joshua B. Tenenbaum,et al. Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation , 2016, NIPS.
[17] Stuart J. Russell,et al. Reinforcement Learning with Hierarchies of Machines , 1997, NIPS.
[18] Anja Belz,et al. An Investigation into the Validity of Some Metrics for Automatically Evaluating Natural Language Generation Systems , 2009, CL.
[19] Gökhan Tür,et al. Multi-Domain Joint Semantic Frame Parsing Using Bi-Directional RNN-LSTM , 2016, INTERSPEECH.
[20] Konrad Scheffler,et al. Probabilistic simulation of human-machine dialogues , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[21] Heriberto Cuayáhuitl,et al. Hierarchical Reinforcement Learning for Spoken Dialogue Systems , 2009 .
[22] Seunghak Yu,et al. Scaling up deep reinforcement learning for multi-domain dialogue systems , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).
[23] Hannes Schulz,et al. Frames: a corpus for adding memory to goal-oriented dialogue systems , 2017, SIGDIAL Conference.
[24] Philip Bachman,et al. Natural Language Generation in Dialogue using Lexicalized and Delexicalized Data , 2016, ICLR.
[25] Min-Yen Kan,et al. Applying Natural Language Generation to Indicative Summarization , 2001, EWNLG@ACL.
[26] David Vandyke,et al. Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems , 2015, EMNLP.
[27] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..
[28] Ondrej Dusek,et al. A Context-aware Natural Language Generator for Dialogue Systems , 2016, SIGDIAL Conference.
[29] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[30] Roberto Pieraccini,et al. A stochastic model of human-machine interaction for learning dialog strategies , 2000, IEEE Trans. Speech Audio Process..
[31] Hui Ye,et al. Agenda-Based User Simulation for Bootstrapping a POMDP Dialogue System , 2007, NAACL.
[32] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[33] Rudolf Franz Flesch,et al. How to write plain English : a book for lawyers and consumers , 1979 .
[34] Frank Keller,et al. Comparing Automatic Evaluation Measures for Image Description , 2014, ACL.
[35] Ondrej Dusek,et al. Sequence-to-Sequence Generation for Spoken Dialogue via Deep Syntax Trees and Strings , 2016, ACL.
[36] Satinder P. Singh,et al. Reinforcement Learning with a Hierarchy of Abstract Models , 1992, AAAI.
[37] Jianfeng Gao,et al. A User Simulator for Task-Completion Dialogues , 2016, ArXiv.
[38] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.
[39] Verena Rieser,et al. Referenceless Quality Estimation for Natural Language Generation , 2017, ArXiv.
[40] Jing He,et al. A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue Systems , 2016, INTERSPEECH.
[41] Dimitra Gkatzia,et al. Natural Language Generation enhances human decision-making with uncertain information , 2016, ACL.
[42] Matthew R. Walter,et al. What to talk about and how? Selective Generation using LSTMs with Coarse-to-Fine Alignment , 2015, NAACL.
[43] Jianfeng Gao,et al. End-to-End Task-Completion Neural Dialogue Systems , 2017, IJCNLP.
[44] N. Draper,et al. Applied Regression Analysis , 1966 .
[45] Aoife Cahill. Correlating Human and Automatic Evaluation of a German Surface Realiser , 2009, ACL/IJCNLP.
[46] Maxine Eskénazi,et al. Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning , 2016, SIGDIAL Conference.
[47] Oliver Lemon,et al. Crowd-sourcing NLG Data: Pictures Elicit Better Data. , 2016, INLG.
[48] J. R. Landis,et al. The measurement of observer agreement for categorical data. , 1977, Biometrics.
[49] Joel R. Tetreault,et al. There’s No Comparison: Reference-less Evaluation Metrics in Grammatical Error Correction , 2016, EMNLP.
[50] Andreas Vlachos,et al. Imitation learning for language generation from unaligned data , 2016, COLING.
[51] Doina Precup,et al. Intra-Option Learning about Temporally Abstract Actions , 1998, ICML.
[52] Mert Kilickaya,et al. Re-evaluating Automatic Metrics for Image Captioning , 2016, EACL.
[53] Jianfeng Gao,et al. deltaBLEU: A Discriminative Metric for Generation Tasks with Intrinsically Diverse Targets , 2015, ACL.
[54] Jianfeng Gao,et al. Investigation of Language Understanding Impact for Reinforcement Learning Based Dialogue Systems , 2017, ArXiv.
[55] Geoffrey Zweig,et al. Hybrid Code Networks: practical and efficient end-to-end dialog control with supervised and reinforcement learning , 2017, ACL.
[56] Matthew Marge,et al. Evaluating Evaluation Methods for Generation in the Presence of Variation , 2005, CICLing.
[57] Jianfeng Gao,et al. Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access , 2016, ACL.
[58] Milica Gasic,et al. POMDP-Based Statistical Spoken Dialog Systems: A Review , 2013, Proceedings of the IEEE.
[59] Steve J. Young,et al. Partially observable Markov decision processes for spoken dialog systems , 2007, Comput. Speech Lang..
[60] Dongho Kim,et al. Distributed dialogue policies for multi-domain statistical dialogue management , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[61] Pierre Lison,et al. Multi-Policy Dialogue Management , 2011, SIGDIAL Conference.
[62] David Vandyke,et al. Continuously Learning Neural Dialogue Management , 2016, ArXiv.
[63] Steve J. Young,et al. Still talking to machines (cognitively speaking) , 2010, INTERSPEECH.
[64] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[65] Jianfeng Gao,et al. Deep Reinforcement Learning for Dialogue Generation , 2016, EMNLP.