Learning all optimal policies with multiple criteria
暂无分享,去创建一个
[1] Stuart J. Russell,et al. Q-Decomposition for Reinforcement Learning Agents , 2003, ICML.
[2] Shie Mannor,et al. A Geometric Approach to Multi-Criterion Reinforcement Learning , 2004, J. Mach. Learn. Res..
[3] Leslie Pack Kaelbling,et al. Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..
[4] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[5] Pieter Abbeel,et al. Apprenticeship learning via inverse reinforcement learning , 2004, ICML.
[6] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[7] Csaba Szepesvári,et al. Multi-criteria Reinforcement Learning , 1998, ICML.
[8] Eugene A. Feinberg,et al. Constrained Markov Decision Models with Weighted Discounted Rewards , 1995, Math. Oper. Res..
[9] Sriraam Natarajan,et al. Dynamic preferences in multi-criteria reinforcement learning , 2005, ICML.
[10] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[11] Sean R Eddy,et al. What is dynamic programming? , 2004, Nature Biotechnology.
[12] G. Ainslie. Breakdown of will , 2001 .
[13] Kenneth L. Clarkson,et al. Applications of random sampling in computational geometry, II , 1988, SCG '88.