A Hybrid Transfer Algorithm for Reinforcement Learning Based on Spectral Method
暂无分享,去创建一个
Xue-Song Wang | Yu-Hu Cheng | Huan-Ting Feng | Yu-hu Cheng | X. Wang | Ming Li | Meiqiang Zhu | Huanting Feng | Mei-Qiang Zhu | Ming Li
[1] Chen Xing-guo,et al. Transfer of Reinforcement Learning:The State of the Art , 2008 .
[2] Peter Stone,et al. Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..
[3] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[4] Sridhar Mahadevan,et al. Proto-value Functions: A Laplacian Framework for Learning Representation and Control in Markov Decision Processes , 2007, J. Mach. Learn. Res..
[5] Wang Xuesong,et al. Q-learning System Based on Cooperative Least Squares Support Vector Machine , 2009 .
[6] Chen Shi,et al. Research on Reinforcement Learning Technology: A Review , 2004 .
[7] Von-Wun Soo,et al. AUTOMATIC COMPLEXITY REDUCTION IN REINFORCEMENT LEARNING , 2010, Comput. Intell..
[8] Qiang Yang,et al. A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.
[9] Michail G. Lagoudakis,et al. Least-Squares Policy Iteration , 2003, J. Mach. Learn. Res..
[10] S. Mahadevan,et al. Proto-transfer Learning in Markov Decision Processes Using Spectral Methods , 2006 .
[11] Alicia P. Wolfe,et al. Identifying useful subgoals in reinforcement learning by local graph partitioning , 2005, ICML.
[12] J. Yi,et al. An OVerview on the Adaptive Dynamic Programming Based Urban City Traffic Signal Optimal Control: An OVerview on the Adaptive Dynamic Programming Based Urban City Traffic Signal Optimal Control , 2009 .
[13] Sridhar Mahadevan,et al. Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..
[14] Luo Siwei and Zhao Lianwei. Manifold Learning Algorithms Based on Spectral Graph Theory , 2006 .
[15] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[16] Jitendra Malik,et al. Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.
[17] Xin Xu,et al. Kernel-Based Least Squares Policy Iteration for Reinforcement Learning , 2007, IEEE Transactions on Neural Networks.