Gaussian Processes for POMDP-Based Dialogue Manager Optimization
暂无分享,去创建一个
[1] Fabrice Lefèvre,et al. Semantic Graph Clustering for POMDP-Based Spoken Dialog Systems , 2011, INTERSPEECH.
[2] Stefan Schaal,et al. Natural Actor-Critic , 2003, Neurocomputing.
[3] Marilyn A. Walker,et al. Reinforcement Learning for Spoken Dialogue Systems , 1999, NIPS.
[4] Filip Jur C I Cek. Natural Actor and Belief Critic: Reinforcement algorithm for learning parameters of dialogue systems modelled as POMDPs , 2010 .
[5] S. Young,et al. Scaling POMDPs for Spoken Dialog Management , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[6] S. Singh,et al. Optimizing Dialogue Management with Reinforcement Learning: Experiments with the NJFun System , 2011, J. Artif. Intell. Res..
[7] Kallirroi Georgila,et al. Hybrid Reinforcement/Supervised Learning of Dialogue Policies from Fixed Data Sets , 2008, CL.
[8] Stephen Young. Probabilistic methods in spoken–dialogue systems , 2000, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.
[9] Milica Gasic,et al. The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management , 2010, Comput. Speech Lang..
[10] Milica Gasic,et al. Gaussian Processes for Fast Policy Optimisation of POMDP-based Dialogue Managers , 2010, SIGDIAL Conference.
[11] Roberto Pieraccini,et al. Using Markov decision process for learning dialogue strategies , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[12] Leslie Pack Kaelbling,et al. Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..
[13] Blaise Roger Marie Thomson,et al. Statistical methods for spoken dialogue management , 2013 .
[14] Matthias Hein,et al. Hilbertian Metrics and Positive Definite Kernels on Probability Measures , 2005, AISTATS.
[15] Joelle Pineau,et al. Fast reinforcement learning of dialog strategies , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).
[16] Milica Gasic,et al. Training and Evaluation of the HIS POMDP Dialogue System in Noise , 2008, SIGDIAL Workshop.
[17] Milica Gasic,et al. Real User Evaluation of Spoken Dialogue Systems Using Amazon Mechanical Turk , 2011, INTERSPEECH.
[18] Kallirroi Georgila,et al. Adaptive Multimodal Dialogue Management based on the Information State Update Approach , 2004 .
[19] Anthony Widjaja,et al. Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.
[20] Steve J. Young,et al. Natural actor and belief critic: Reinforcement algorithm for learning parameters of dialogue systems modelled as POMDPs , 2011, TSLP.
[21] David A. Cohn,et al. Improving generalization with active learning , 1994, Machine Learning.
[22] Matthieu Geist,et al. Off-policy learning in large-scale POMDP-based dialogue systems , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[23] Shie Mannor,et al. Reinforcement learning with Gaussian processes , 2005, ICML.
[24] Carl E. Rasmussen,et al. Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.
[25] Yaakov Engel,et al. Algorithms and representations for reinforcement learning (עם תקציר בעברית, תכן ושער נוסף: אלגוריתמים וייצוגים ללמידה מחיזוקים.; אלגוריתמים וייצוגים ללמידה מחיזוקים.) , 2005 .
[26] Roberto Pieraccini,et al. Where do we go from here? Research and Commercial Spoken Dialog Systems , 2005, SIGDIAL.
[27] Marilyn A. Walker,et al. Automatic Optimization of Dialogue Management , 2000, COLING.
[28] Baining Guo,et al. Spoken dialogue management as planning and acting under uncertainty , 2001, INTERSPEECH.
[29] Milica Gasic,et al. On-line policy optimisation of spoken dialogue systems via live interaction with human subjects , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.
[30] Tony Jebara,et al. Probability Product Kernels , 2004, J. Mach. Learn. Res..
[31] Thomas Gärtner,et al. Graph kernels and Gaussian processes for relational reinforcement learning , 2006, Machine Learning.
[32] Joelle Pineau,et al. Point-based value iteration: An anytime algorithm for POMDPs , 2003, IJCAI.
[33] Matthieu Geist,et al. Uncertainty Management for On-Line Optimisation of a POMDP-Based Large-Scale Spoken Dialogue System , 2011, INTERSPEECH.
[34] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[35] Milica Gasic,et al. Parameter estimation for agenda-based user simulation , 2010, SIGDIAL Conference.
[36] Roberto Pieraccini,et al. A stochastic model of human-machine interaction for learning dialog strategies , 2000, IEEE Trans. Speech Audio Process..
[37] Steve J. Young,et al. Partially observable Markov decision processes for spoken dialog systems , 2007, Comput. Speech Lang..
[38] Matthew Henderson,et al. N-best error simulation for training spoken dialogue systems , 2012, 2012 IEEE Spoken Language Technology Workshop (SLT).
[39] Drew McDermott,et al. Planning and Acting , 1978, Cogn. Sci..
[40] Carl E. Rasmussen,et al. A Unifying View of Sparse Approximate Gaussian Process Regression , 2005, J. Mach. Learn. Res..
[41] Shie Mannor,et al. Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning , 2003, ICML.
[42] Lihong Li,et al. Reinforcement learning for dialog management using least-squares Policy iteration and fast feature selection , 2009, INTERSPEECH.
[43] Lehel Csató,et al. Reinforcement learning with guided policy search using Gaussian processes , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).
[44] Oliver Lemon,et al. Lossless Value Directed Compression of Complex User Goal States for Statistical Spoken Dialogue Systems , 2011, INTERSPEECH.
[45] Joelle Pineau,et al. Spoken Dialogue Management Using Probabilistic Reasoning , 2000, ACL.
[46] Zoubin Ghahramani,et al. Sparse Gaussian Processes using Pseudo-inputs , 2005, NIPS.
[47] Steve J. Young,et al. Talking to machines (statistically speaking) , 2002, INTERSPEECH.
[48] Steve J. Young,et al. Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems , 2010, Comput. Speech Lang..
[49] Carl E. Rasmussen,et al. Sparse Spectrum Gaussian Process Regression , 2010, J. Mach. Learn. Res..
[50] Carl E. Rasmussen,et al. Gaussian Processes in Reinforcement Learning , 2003, NIPS.
[51] Carl E. Rasmussen,et al. Gaussian process dynamic programming , 2009, Neurocomputing.
[52] Michael F. McTear,et al. Modelling spoken dialogues with state transition diagrams: experiences with the CSLU toolkit , 1998, ICSLP.
[53] Jan Peters,et al. Incremental Sparsification for Real-time Online Model Learning , 2010, AISTATS.
[54] David J. C. MacKay,et al. Information-Based Objective Functions for Active Data Selection , 1992, Neural Computation.