Gaussian Processes for Fast Policy Optimisation of POMDP-based Dialogue Managers
暂无分享,去创建一个
Milica Gasic | Steve J. Young | Blaise Thomson | Filip Jurcícek | Simon Keizer | François Mairesse | Kai Yu | S. Young | François Mairesse | Kai Yu | Simon Keizer | Blaise Thomson | Filip Jurcícek | Milica Gasic
[1] Ronen I. Brafman,et al. A Heuristic Variable Grid Solution Method for POMDPs , 1997, AAAI/IAAI.
[2] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[3] Thomas G. Dietterich. Adaptive computation and machine learning , 1998 .
[4] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[5] Shie Mannor,et al. Reinforcement learning with Gaussian processes , 2005, ICML.
[6] Jason D. Williams,et al. Partially Observable Markov Decision Processes for Spoken Dialogue Management , 2006 .
[7] Carl E. Rasmussen,et al. Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.
[8] Carl E. Rasmussen,et al. Gaussian process dynamic programming , 2009, Neurocomputing.
[9] Milica Gasic,et al. The Hidden Information State model: A practical framework for POMDP-based spoken dialogue management , 2010, Comput. Speech Lang..