Basis function construction for hierarchical reinforcement learning
暂无分享,去创建一个
[1] Andrew G. Barto,et al. Causal Graph Based Decomposition of Factored MDPs , 2006, J. Mach. Learn. Res..
[2] Stuart J. Russell,et al. Reinforcement Learning with Hierarchies of Machines , 1997, NIPS.
[3] Matthias W. Seeger,et al. Using the Nyström Method to Speed Up Kernel Machines , 2000, NIPS.
[4] Andrew G. Barto,et al. Skill Characterization Based on Betweenness , 2008, NIPS.
[5] Sridhar Mahadevan,et al. Learning state-action basis functions for hierarchical MDPs , 2007, ICML '07.
[6] Sridhar Mahadevan,et al. Hierarchical Average Reward Reinforcement Learning , 2007, J. Mach. Learn. Res..
[7] Thomas G. Dietterich,et al. Automatic discovery and transfer of MAXQ hierarchies , 2008, ICML '08.
[8] F. Chung. Laplacians and the Cheeger Inequality for Directed Graphs , 2005 .
[9] Thomas G. Dietterich. Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition , 1999, J. Artif. Intell. Res..
[10] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[11] Thomas G. Dietterich. The MAXQ Method for Hierarchical Reinforcement Learning , 1998, ICML.
[12] Sridhar Mahadevan,et al. Constructing basis functions from directed graphs for value function approximation , 2007, ICML '07.
[13] Sridhar Mahadevan,et al. Proto-value functions: developmental reinforcement learning , 2005, ICML.
[14] Sridhar Mahadevan,et al. Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..
[15] Lihong Li,et al. Analyzing feature generation for value-function approximation , 2007, ICML '07.
[16] Fan Chung,et al. Spectral Graph Theory , 1996 .
[17] Shie Mannor,et al. Automatic basis function construction for approximate dynamic programming and reinforcement learning , 2006, ICML.
[18] Sridhar Mahadevan,et al. Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..
[19] Marek Petrik,et al. An Analysis of Laplacian Methods for Value Function Approximation in MDPs , 2007, IJCAI.