Least-Squares Methods in Reinforcement Learning for Control
暂无分享,去创建一个
Michail G. Lagoudakis | Michael L. Littman | Ronald Parr | Ronald E. Parr | M. Littman | M. Lagoudakis
[1] John R. Rice,et al. The Algorithm Selection Problem , 1976, Adv. Comput..
[2] C. Watkins. Learning from delayed rewards , 1989 .
[3] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.
[4] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[5] Kazuo Tanaka,et al. An approach to fuzzy control of nonlinear systems: stability and design issues , 1996, IEEE Trans. Fuzzy Syst..
[6] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[7] Andrew G. Barto,et al. Linear Least-Squares Algorithms for Temporal Difference Learning , 2005, Machine Learning.
[8] Preben Alstrøm,et al. Learning to Drive a Bicycle Using Reinforcement Learning and Shaping , 1998, ICML.
[9] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[10] Andrew W. Moore,et al. Distributed Value Functions , 1999, ICML.
[11] Daphne Koller,et al. Policy Iteration for Factored MDPs , 2000, UAI.
[12] Michail G. Lagoudakis,et al. Algorithm Selection using Reinforcement Learning , 2000, ICML.
[13] Michail G. Lagoudakis,et al. Selecting the Right Algorithm , 2001 .
[14] Michail G. Lagoudakis,et al. Model-Free Least-Squares Policy Iteration , 2001, NIPS.
[15] Carlos Guestrin,et al. Multiagent Planning with Factored MDPs , 2001, NIPS.
[16] Michail G. Lagoudakis,et al. Learning to Select Branching Rules in the DPLL Procedure for Satisfiability , 2001, Electron. Notes Discret. Math..
[17] Michail G. Lagoudakis,et al. Coordinated Reinforcement Learning , 2002, ICML.