Tree Based Discretization for Continuous State Space Reinforcement Learning
暂无分享,去创建一个
[1] Leemon C. Baird,et al. Residual Algorithms: Reinforcement Learning with Function Approximation , 1995, ICML.
[2] Geoffrey J. Gordon. Online Fitted Reinforcement Learning , 1995 .
[3] C. Atkeson,et al. Prioritized Sweeping : Reinforcement Learning withLess Data and Less Real , 1993 .
[4] R. Bellman. Dynamic programming. , 1957, Science.
[5] Carla E. Brodley,et al. Linear Machine Decision Trees , 1991 .
[6] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[7] Andrew McCallum,et al. Reinforcement learning with selective perception and hidden state , 1996 .
[8] Andrew W. Moore,et al. The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces , 2004, Machine Learning.
[9] William H. Press,et al. Numerical recipes in C. The art of scientific computing , 1987 .
[10] Andrew W. Moore,et al. Variable Resolution Dynamic Programming , 1991, ML Workshop.
[11] Ronald J. Williams,et al. Tight Performance Bounds on Greedy Policies Based on Imperfect Value Functions , 1993 .
[12] Thomas Dean,et al. Decomposition Techniques for Planning in Stochastic Domains , 1995, IJCAI.
[13] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[14] F. A. Seiler,et al. Numerical Recipes in C: The Art of Scientific Computing , 1989 .
[15] William H. Press,et al. The Art of Scientific Computing Second Edition , 1998 .
[16] Richard S. Sutton,et al. Temporal credit assignment in reinforcement learning , 1984 .
[17] William H. Press,et al. Book-Review - Numerical Recipes in Pascal - the Art of Scientific Computing , 1989 .
[18] Leslie Pack Kaelbling,et al. Input Generalization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons , 1991, IJCAI.
[19] Leo Breiman,et al. Classification and Regression Trees , 1984 .
[20] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.
[21] Andrew W. Moore,et al. Generalization in Reinforcement Learning: Safely Approximating the Value Function , 1994, NIPS.