An Inverse Reinforcement Learning Algorithm for Partially Observable Domains with Application on Healthcare Dialogue Management
暂无分享,去创建一个
[1] Oliver Lemon,et al. Recent research advances in Reinforcement Learning in Spoken Dialogue Systems , 2009, The Knowledge Engineering Review.
[2] Nikos A. Vlassis,et al. Perseus: Randomized Point-based Value Iteration for POMDPs , 2005, J. Artif. Intell. Res..
[3] Steve J. Young,et al. Partially observable Markov decision processes for spoken dialog systems , 2007, Comput. Speech Lang..
[4] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[5] Joelle Pineau,et al. On the Feasibility of Using a Standardized Test for Evaluating a Speech-Controlled Smart Wheelchair , 2011 .
[6] Kee-Eung Kim,et al. Inverse Reinforcement Learning in Partially Observable Environments , 2009, IJCAI.
[7] Reid G. Simmons,et al. Smartphone Interruptibility Using Density-Weighted Uncertainty Sampling with Reinforcement Learning , 2011, 2011 10th International Conference on Machine Learning and Applications and Workshops.
[8] Maury M. Gouvea,et al. Autonomous Navigation in Dynamic Environments with Reinforcement Learning and Heuristic , 2010, 2010 Ninth International Conference on Machine Learning and Applications.
[9] Hui Li,et al. Point-Based Policy Iteration , 2007, AAAI.
[10] Brahim Chaib-draa,et al. Learning Observation Models for Dialogue POMDPs , 2012, Canadian Conference on AI.