Learning the structure of Factored Markov Decision Processes in reinforcement learning problems
暂无分享,去创建一个
Olivier Sigaud | Thomas Degris | Pierre-Henri Wuillemin | T. Degris | Olivier Sigaud | Pierre-Henri Wuillemin
[1] Keiji Kanazawa,et al. A model for reasoning about persistence and causation , 1989 .
[2] Richard S. Sutton,et al. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.
[3] Craig Boutilier,et al. Exploiting Structure in Policy Construction , 1995, IJCAI.
[4] Andrew McCallum,et al. Reinforcement learning with selective perception and hidden state , 1996 .
[5] Nir Friedman,et al. Learning Bayesian Networks with Local Structure , 1996, UAI.
[6] David Maxwell Chickering,et al. A Bayesian Approach to Learning Bayesian Networks with Local Structure , 1997, UAI.
[7] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[8] Jesse Hoey,et al. SPUDD: Stochastic Planning using Decision Diagrams , 1999, UAI.
[9] Craig Boutilier,et al. Stochastic dynamic programming with factored representations , 2000, Artif. Intell..
[10] Jesse Hoey,et al. APRICODD: Approximate Policy Construction Using Decision Diagrams , 2000, NIPS.
[11] Dale Schuurmans,et al. Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs , 2002, ICML.
[12] Shobha Venkataraman,et al. Efficient Solution Algorithms for Factored MDPs , 2003, J. Artif. Intell. Res..
[13] Paul E. Utgoff,et al. Incremental Induction of Decision Trees , 1989, Machine Learning.
[14] Paul E. Utgoff,et al. Decision Tree Induction Based on Efficient Tree Restructuring , 1997, Machine Learning.
[15] Geoffrey E. Hinton,et al. Reinforcement Learning with Factored States and Actions , 2004, J. Mach. Learn. Res..
[16] J. Ross Quinlan,et al. Induction of Decision Trees , 1986, Machine Learning.
[17] M. Wells,et al. Learning with delayed rewards in Octopus , 1968, Zeitschrift für vergleichende Physiologie.
[18] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.