Asynchronous Vector Iteration in Multi-objective Markov Decision Processes
暂无分享,去创建一个
Lawrence Mandow | José-Luis Pérez-de-la-Cruz | Ekaterina Sedova | J. Pérez-de-la-Cruz | L. Mandow | E. Sedova
[1] D. White. Multi-objective infinite-horizon discounted Markov decision processes , 1982 .
[2] Evan Dekker,et al. Empirical evaluation methods for multiobjective reinforcement learning algorithms , 2011, Machine Learning.
[3] Marco Wiering,et al. Special issue on multi-objective reinforcement learning , 2017, Neurocomputing.
[4] Srini Narayanan,et al. Learning all optimal policies with multiple criteria , 2008, ICML '08.