Scalarized multi-objective reinforcement learning: Novel design techniques
暂无分享,去创建一个
[1] Susan A. Murphy,et al. Efficient Reinforcement Learning with Multiple Reward Functions for Randomized Controlled Trial Analysis , 2010, ICML.
[2] Jean Dickinson Gibbons,et al. Nonparametric Statistical Inference , 1972, International Encyclopedia of Statistical Science.
[3] M.A. Wiering,et al. Computing Optimal Stationary Policies for Multi-Objective Markov Decision Processes , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.
[4] Kalyanmoy Deb,et al. A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..
[5] T. L. Saaty,et al. The computational algorithm for the parametric objective function , 1955 .
[6] Csaba Szepesvári,et al. Multi-criteria Reinforcement Learning , 1998, ICML.
[7] Subhabrata Chakraborti,et al. Nonparametric Statistical Inference , 2011, International Encyclopedia of Statistical Science.
[8] Nicola Beume,et al. Scalarization versus indicator-based selection in multi-objective CMA evolution strategies , 2008, 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence).
[9] Chris Watkins,et al. Learning from delayed rewards , 1989 .
[10] J. Dennis,et al. A closer look at drawbacks of minimizing weighted sums of objectives for Pareto set generation in multicriteria optimization problems , 1997 .
[11] Srini Narayanan,et al. Learning all optimal policies with multiple criteria , 2008, ICML '08.
[12] Evan Dekker,et al. Empirical evaluation methods for multiobjective reinforcement learning algorithms , 2011, Machine Learning.
[13] Lothar Thiele,et al. Multiobjective evolutionary algorithms: a comparative case study and the strength Pareto approach , 1999, IEEE Trans. Evol. Comput..