Autonomous Cross-Domain Knowledge Transfer in Lifelong Policy Gradient Reinforcement Learning
暂无分享,去创建一个
Eric Eaton | Haitham Bou-Ammar | Paul Ruvolo | José-Marcio Luna | Haitham Bou-Ammar | Eric Eaton | P. Ruvolo | José-Marcio Luna
[1] Peter Stone,et al. Autonomous transfer for reinforcement learning , 2008, AAMAS.
[2] Alessandro Lazaric,et al. Bayesian Multi-Task Reinforcement Learning , 2010, ICML.
[3] Bart De Schutter,et al. A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).
[4] Peter Stone,et al. Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..
[5] Alan Fern,et al. Multi-task reinforcement learning: a hierarchical Bayesian approach , 2007, ICML '07.
[6] Haitham Bou-Ammar,et al. Reinforcement learning transfer via sparse coding , 2012, AAMAS.
[7] Finale Doshi-Velez,et al. Hidden Parameter Markov Decision Processes: An Emerging Paradigm for Modeling Families of Related Tasks , 2014, AAAI Fall Symposia.
[8] Peter Englert,et al. Multi-task policy search for robotics , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).
[9] Hal Daumé,et al. Learning Task Grouping and Overlap in Multi-task Learning , 2012, ICML.
[10] Stefan Schaal,et al. Natural Actor-Critic , 2003, Neurocomputing.
[11] Shimon Whiteson,et al. Transfer via inter-task mappings in policy search reinforcement learning , 2007, AAMAS '07.
[12] Aravaipa Canyon Basin,et al. Volume 3 , 2012, Journal of Diabetes Investigation.
[13] Haitham Bou-Ammar,et al. Automatically Mapped Transfer between Reinforcement Learning Tasks via Three-Way Restricted Boltzmann Machines , 2013, ECML/PKDD.
[14] M A H Dempster,et al. An automated FX trading system using adaptive reinforcement learning , 2006, Expert Syst. Appl..
[15] Eric Eaton,et al. ELLA: An Efficient Lifelong Learning Algorithm , 2013, ICML.
[16] Jürgen Schmidhuber,et al. State-Dependent Exploration for Policy Gradient Methods , 2008, ECML/PKDD.
[17] Dimitri P. Bertsekas,et al. Dynamic Programming and Optimal Control, Two Volume Set , 1995 .
[18] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .
[19] Lawrence Carin,et al. Cross-Domain Multitask Learning with Latent Probit Models , 2012, ICML.
[20] Kristen Grauman,et al. Learning with Whom to Share in Multi-task Feature Learning , 2011, ICML.
[21] Eric Eaton,et al. Online Multi-Task Learning for Policy Gradient Methods , 2014, ICML.
[22] P. Cochat,et al. Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.
[23] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[24] Shimon Whiteson,et al. Learning potential functions and their representations for multi-task reinforcement learning , 2013, Autonomous Agents and Multi-Agent Systems.
[25] Massimiliano Pontil,et al. Sparse coding for multitask and transfer learning , 2012, ICML.
[26] Sebastian Thrun,et al. Discovering Structure in Multiple Learning Tasks: The TC Algorithm , 1996, ICML.
[27] Thomas G. Dietterich. What is machine learning? , 2020, Archives of Disease in Childhood.
[28] Jan Peters,et al. Noname manuscript No. (will be inserted by the editor) Policy Search for Motor Primitives in Robotics , 2022 .
[29] Hui Li,et al. Multi-task Reinforcement Learning in Partially Observable Stochastic Environments , 2009, J. Mach. Learn. Res..
[30] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[31] Leslie Pack Kaelbling,et al. Effective reinforcement learning for mobile robots , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).