On Partially Observable Markov Decision Processes Using Genetic Algorithm Based Q-Learning
暂无分享,去创建一个
[1] Daming Shi,et al. Sensitivity analysis applied to the construction of radial basis function networks , 2005, Neural Networks.
[2] Joelle Pineau,et al. Point-based value iteration: An anytime algorithm for POMDPs , 2003, IJCAI.
[3] John J. Grefenstette,et al. Evolutionary Algorithms for Reinforcement Learning , 1999, J. Artif. Intell. Res..
[4] Michel de Rougemont,et al. On the Complexity of Partially Observed Markov Decision Processes , 1996, Theor. Comput. Sci..
[5] Leslie Pack Kaelbling,et al. Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..
[6] Andy J. Keane,et al. Meta-Lamarckian learning in memetic algorithms , 2004, IEEE Transactions on Evolutionary Computation.
[7] C. Watkins. Learning from delayed rewards , 1989 .
[8] Su Xiao. A Genetic Algorithm Based on Evolutionarily Stable Strategy , 2003 .
[9] Leslie Pack Kaelbling,et al. Learning Policies for Partially Observable Environments: Scaling Up , 1997, ICML.