RLPy: a value-function-based reinforcement learning framework for education and research
暂无分享,去创建一个
Alborz Geramifard | Jonathan P. How | Christoph Dann | Robert H. Klein | William Dabney | Will Dabney | J. How | A. Geramifard | Christoph Dann | Robert H. Klein
[1] G. vanRossum,et al. Interactively testing remote servers using the Python programming language , 1991 .
[2] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[3] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[4] Shie Mannor,et al. Automatic basis function construction for approximate dynamic programming and reinforcement learning , 2006, ICML.
[5] Frank Kirchner,et al. Performance evaluation of EANT in the robocup keepaway benchmark , 2007, ICMLA 2007.
[6] Lihong Li,et al. Analyzing feature generation for value-function approximation , 2007, ICML '07.
[7] Peter Auer,et al. Near-optimal Regret Bounds for Reinforcement Learning , 2008, J. Mach. Learn. Res..
[8] Michael L. Littman,et al. Multi-resolution Exploration in Continuous Spaces , 2008, NIPS.
[9] Brian Tanner,et al. RL-Glue: Language-Independent Software for Reinforcement-Learning Experiments , 2009, J. Mach. Learn. Res..
[10] Bart De Schutter,et al. Approximate Dynamic Programming and Reinforcement Learning , 2010, Interactive Collaborative Information Systems.
[11] Alborz Geramifard,et al. Online Discovery of Feature Dependencies , 2011, ICML.
[12] Lihong Li,et al. Sample Complexity Bounds of Exploration , 2012, Reinforcement Learning.
[13] Will Dabney,et al. RLPy : A Reinforcement Learning Framework for Education and Research , 2013 .
[14] David D. Cox,et al. Making a Science of Model Search: Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures , 2013, ICML.
[15] Hervé Frezza-Buet,et al. A C++ template-based reinforcement learning library: fitting the code to the mathematics , 2013, J. Mach. Learn. Res..
[16] Shie Mannor,et al. Scaling Up Approximate Value Iteration with Options: Better Policies with Fewer Iterations , 2014, ICML.
[17] Peter Kulchyski. and , 2015 .