Multi-Objective Reinforcement Learning with Continuous Pareto Frontier Approximation
暂无分享,去创建一个
Marcello Restelli | Simone Parisi | Matteo Pirotta | Matteo Pirotta | Marcello Restelli | Simone Parisi
[1] Luca Bascetta,et al. Adaptive Step-Size for Policy Gradient Methods , 2013, NIPS.
[2] Isao Ono,et al. Uniform sampling of local pareto-optimal solution curves by pareto path following and its applications in multi-objective GA , 2007, GECCO '07.
[3] Raphaël Fonteneau,et al. Simultaneous perturbation algorithms for batch off-policy search , 2014, 53rd IEEE Conference on Decision and Control.
[4] Christian R. Shelton,et al. Importance sampling for reinforcement learning with multiple objectives , 2001 .
[5] Yaochu Jin,et al. A Critical Survey of Performance Indices for Multi-Objective Optimisation , 2003 .
[6] Frank Neumann,et al. Multiplicative approximations and the hypervolume indicator , 2009, GECCO.
[7] Shimon Whiteson,et al. A Survey of Multi-Objective Sequential Decision-Making , 2013, J. Artif. Intell. Res..
[8] J. Munkres. Analysis On Manifolds , 1991 .
[9] David Barber,et al. A Unifying Perspective of Parametric Policy Search Methods for Markov Decision Processes , 2012, NIPS.
[10] Marco Laumanns,et al. Performance assessment of multiobjective optimizers: an analysis and review , 2003, IEEE Trans. Evol. Comput..
[11] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[12] Susan A. Murphy,et al. Linear fitted-Q iteration with multiple reward functions , 2013, J. Mach. Learn. Res..
[13] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[14] E. Ziegel. Matrix Differential Calculus With Applications in Statistics and Econometrics , 1989 .
[15] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[16] J. Magnus,et al. Matrix Differential Calculus with Applications in Statistics and Econometrics , 1991 .
[17] Lothar Thiele,et al. On Set-Based Multiobjective Optimization , 2010, IEEE Transactions on Evolutionary Computation.
[18] R. J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[19] Evan Dekker,et al. Empirical evaluation methods for multiobjective reinforcement learning algorithms , 2011, Machine Learning.
[20] Luca Bascetta,et al. Policy gradient approaches for multi-objective sequential decision making , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).
[21] WhitesonShimon,et al. A survey of multi-objective sequential decision-making , 2013 .
[22] Stefan Schaal,et al. 2008 Special Issue: Reinforcement learning of motor skills with policy gradients , 2008 .
[23] Sham M. Kakade,et al. Optimizing Average Reward Using Discounted Rewards , 2001, COLT/EuroCOLT.
[24] Dewen Hu,et al. Multiobjective Reinforcement Learning: A Comprehensive Overview , 2015, IEEE Transactions on Systems, Man, and Cybernetics: Systems.
[25] Christian P. Robert,et al. Monte Carlo Statistical Methods , 2005, Springer Texts in Statistics.
[26] Marcello Restelli,et al. A multiobjective reinforcement learning approach to water resources systems operation: Pareto frontier approximation in a single run , 2013 .