Hidden Parameter Markov Decision Processes: An Emerging Paradigm for Modeling Families of Related Tasks
暂无分享,去创建一个
[1] Bruno Castro da Silva,et al. Learning Parameterized Skills , 2012, ICML.
[2] David Hsu,et al. Integrated Perception and Planning in the Continuous Space: A POMDP Approach , 2013, Robotics: Science and Systems.
[3] Shimon Whiteson,et al. Protecting against evaluation overfitting in empirical reinforcement learning , 2011, 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL).
[4] Leslie Pack Kaelbling,et al. Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..
[5] David Wingate,et al. A Physics-Based Model Prior for Object-Oriented MDPs , 2014, ICML.
[6] Alessandro Lazaric,et al. Sequential Transfer in Multi-armed Bandit with Finite Set of Models , 2013, NIPS.
[7] Pushmeet Kohli,et al. Adapting Interaction Environments to Diverse Users through Online Action Set Selection , 2014, AAAI 2014.
[8] Sriraam Natarajan,et al. A Decision-Theoretic Model of Assistance , 2007, IJCAI.
[9] Eric Eaton,et al. Online Multi-Task Learning for Policy Gradient Methods , 2014, ICML.
[10] Lihong Li,et al. Sample Complexity of Multi-task Reinforcement Learning , 2013, UAI.
[11] Alan Fern,et al. Transfer Learning in Sequential Decision Problems: A Hierarchical Bayesian Approach , 2012, ICML Unsupervised and Transfer Learning.
[12] Steven M. LaValle,et al. Planning algorithms , 2006 .
[13] Shimon Whiteson,et al. The Reinforcement Learning Competitions , 2010 .
[14] David Hsu,et al. Planning how to learn , 2013, 2013 IEEE International Conference on Robotics and Automation.
[15] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[16] Benjamin Saul Rosman,et al. Learning domain abstractions for long lived robots , 2014 .
[17] Alan Fern,et al. A Computational Decision Theory for Interactive Assistants , 2010, Interactive Decision Theory and Game Theory.
[18] Shimon Whiteson,et al. Neuroevolutionary reinforcement learning for generalized helicopter control , 2009, GECCO.
[19] Finale Doshi-Velez,et al. Hidden Parameter Markov Decision Processes: A Semiparametric Regression Approach for Discovering Latent Task Parametrizations , 2013, IJCAI.