United We Stand: Population Based Methods for Solving Unknown POMDPs
暂无分享,去创建一个
[1] Joelle Pineau,et al. Point-based value iteration: An anytime algorithm for POMDPs , 2003, IJCAI.
[2] Andrew W. Moore,et al. Direct Policy Search using Paired Statistical Tests , 2001, ICML.
[3] C. D. Gelatt,et al. Optimization by Simulated Annealing , 1983, Science.
[4] Pat Langley,et al. Editorial: On Machine Learning , 1986, Machine Learning.
[5] Andrew McCallum,et al. Reinforcement learning with selective perception and hidden state , 1996 .
[6] Douglas Aberdeen,et al. Policy-Gradient Algorithms for Partially Observable Markov Decision Processes , 2003 .
[7] Carl E. Rasmussen,et al. Factorial Hidden Markov Models , 1997 .
[8] Michael L. Littman,et al. Algorithms for Sequential Decision Making , 1996 .
[9] Katia P. Sycara,et al. Evolutionary Search, Stochastic Policies with Memory, and Reinforcement Learning with Hidden State , 2001, ICML.
[10] Leslie Pack Kaelbling,et al. Learning Policies with External Memory , 1999, ICML.