Development of interactive multi-objective reinforcement learning considering preference structure of a decision maker
暂无分享,去创建一个
[1] Peter Stone,et al. Reinforcement learning , 2019, Scholarpedia.
[2] Madalina M. Drugan. Multi-objective optimization perspectives on reinforcement learning algorithms using reward vectors , 2015, ESANN.
[3] Eckart Zitzler,et al. A Hypervolume-Based Optimizer for High-Dimensional Objective Spaces , 2010 .
[4] Ichiro Nishizaki,et al. Fuzzy Stochastic Multiobjective Programming , 2013 .
[5] Sriraam Natarajan,et al. Dynamic preferences in multi-criteria reinforcement learning , 2005, ICML.
[6] Kaisa Miettinen,et al. Nonlinear multiobjective optimization , 1998, International series in operations research and management science.