On the Characteristics of Sequential Decision Problems and Their Impact on Evolutionary Computation and Reinforcement Learning
暂无分享,去创建一个
[1] Risto Miikkulainen,et al. Efficient Reinforcement Learning through Symbiotic Evolution , 1996, Machine Learning.
[2] David E. Moriarty,et al. Symbiotic Evolution of Neural Networks in Sequential Decision Tasks , 1997 .
[3] Jan Drugowitsch. Design and Analysis of Learning Classifier Systems: A Probabilistic Approach (Studies in Computational Intelligence) , 2008 .
[4] André da Motta Salles Barreto,et al. Restricted gradient-descent algorithm for value-function approximation in reinforcement learning , 2008, Artif. Intell..
[5] L. Darrell Whitley,et al. The GENITOR Algorithm and Selection Pressure: Why Rank-Based Allocation of Reproductive Trials is Best , 1989, ICGA.
[6] Richard S. Sutton,et al. Learning to predict by the methods of temporal differences , 1988, Machine Learning.
[7] Robert Tibshirani,et al. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.
[8] Risto Miikkulainen,et al. Robust non-linear control through neuroevolution , 2003 .
[9] Risto Miikkulainen,et al. Efficient evolution of neural networks through complexification , 2004 .
[10] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[11] L. Darrell Whitley,et al. Genetic Reinforcement Learning for Neurocontrol Problems , 2004, Machine Learning.
[12] Jan Drugowitsch. Design and Analysis of Learning Classifier Systems - A Probabilistic Approach , 2008, Studies in Computational Intelligence.
[13] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[14] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[15] Andrew W. Moore,et al. Generalization in Reinforcement Learning: Safely Approximating the Value Function , 1994, NIPS.
[16] David E. Goldberg,et al. Genetic Algorithms in Search Optimization and Machine Learning , 1988 .
[17] John J. Grefenstette,et al. Evolutionary Algorithms for Reinforcement Learning , 1999, J. Artif. Intell. Res..
[18] John N. Tsitsiklis,et al. Analysis of temporal-difference learning with function approximation , 1996, NIPS 1996.
[19] Preben Alstrøm,et al. Learning to Drive a Bicycle Using Reinforcement Learning and Shaping , 1998, ICML.
[20] Risto Miikkulainen,et al. Accelerated Neural Evolution through Cooperatively Coevolved Synapses , 2008, J. Mach. Learn. Res..
[21] Michail G. Lagoudakis,et al. Least-Squares Policy Iteration , 2003, J. Mach. Learn. Res..
[22] John N. Tsitsiklis,et al. Feature-based methods for large scale dynamic programming , 2004, Machine Learning.
[23] D. J. White,et al. Real Applications of Markov Decision Processes , 1985 .
[24] Larry Bull,et al. Foundations of Learning Classifier Systems , 2005 .