A reinforcement learning approach for dynamic multi-objective optimization

Abstract Dynamic Multi-objective Optimization Problem (DMOP) is emerging in recent years as a major real-world optimization problem receiving considerable attention. Tracking the movement of Pareto front efficiently and effectively over time has been a central issue in solving DMOPs. In this paper, a reinforcement learning-based dynamic multi-objective evolutionary algorithm, called RL-DMOEA, which seamlessly integrates reinforcement learning framework and three change response mechanisms, is proposed for solving DMOPs. The proposed algorithm relocates the individuals based on the severity degree of environmental changes, which is estimated through the corresponding changes in the objective space of their decision variables. When identifying different severity degree of environmental changes, the proposed RL-DMOEA approach can learn better evolutionary behaviors from environment information, based on which apply the appropriate response mechanisms. Specifically, these change response mechanisms including the knee-based prediction, center-based prediction and indicator-based local search, are devised to promote both convergence and diversity of the algorithm under different severity of environmental changes. To verify this idea, the proposed RL-DMOEA is evaluated on CEC 2015 test problems involving various problem characteristics. Empirical studies on chosen state-of-the-art designs validate that the proposed RL-DMOEA is effective in addressing the DMOPs.

[1]  Gary G. Yen,et al.  Dynamic Evolutionary Algorithm With Variable Relocation , 2009, IEEE Transactions on Evolutionary Computation.

[2]  Kalyanmoy Deb,et al.  Dynamic multiobjective optimization problems: test cases, approximations, and applications , 2004, IEEE Transactions on Evolutionary Computation.

[3]  Chee Peng Lim,et al.  A new Reinforcement Learning-based Memetic Particle Swarm Optimizer , 2016, Appl. Soft Comput..

[4]  Fang Liu,et al.  A sphere-dominance based preference immune-inspired algorithm for dynamic multi-objective optimization , 2010, GECCO '10.

[5]  Enrique Alba,et al.  Global memory schemes for dynamic optimization , 2016, Natural Computing.

[6]  Kay Chen Tan,et al.  A predictive gradient strategy for multiobjective evolutionary algorithms in a fast changing environment , 2010, Memetic Comput..

[7]  Kay Chen Tan,et al.  Evolutionary Dynamic Multiobjective Optimization Via Kalman Filter Prediction , 2016, IEEE Transactions on Cybernetics.

[8]  Manuela Ruiz-Montiel,et al.  A temporal difference method for multi-objective reinforcement learning , 2017, Neurocomputing.

[9]  Kalyanmoy Deb,et al.  Finding Knees in Multi-objective Optimization , 2004, PPSN.

[10]  Julio Ortega Lopera,et al.  A single front genetic algorithm for parallel multi-objective optimization in dynamic environments , 2009, Neurocomputing.

[11]  Shengxiang Yang,et al.  The effect of diversity maintenance on prediction in dynamic multi-objective optimization , 2017, Appl. Soft Comput..

[12]  Richard S. Sutton,et al.  Learning to predict by the methods of temporal differences , 2004, Machine Learning.

[13]  Jun Zhang,et al.  Evolutionary Computation Meets Machine Learning: A Survey , 2011, IEEE Computational Intelligence Magazine.

[14]  Ying Han,et al.  A Q-learning-based memetic algorithm for multi-objective dynamic software project scheduling , 2018, Inf. Sci..

[15]  Marco Laumanns,et al.  Performance assessment of multiobjective optimizers: an analysis and review , 2003, IEEE Trans. Evol. Comput..

[16]  Shengxiang Yang,et al.  A Steady-State and Generational Evolutionary Algorithm for Dynamic Multiobjective Optimization , 2017, IEEE Transactions on Evolutionary Computation.

[17]  Jinhua Zheng,et al.  Achieving balance between proximity and diversity in multi-objective evolutionary algorithm , 2012, Inf. Sci..

[18]  Carlos A. Coello Coello,et al.  Handling multiple objectives with particle swarm optimization , 2004, IEEE Transactions on Evolutionary Computation.

[19]  Carlos Cruz,et al.  Optimization in dynamic environments: a survey on problems, methods and measures , 2011, Soft Comput..

[20]  Yaochu Jin,et al.  A directed search strategy for evolutionary dynamic multiobjective optimization , 2014, Soft Computing.

[21]  Xin Yao,et al.  Dynamic Multiobjectives Optimization With a Changing Number of Objectives , 2016, IEEE Transactions on Evolutionary Computation.

[22]  Ye Tian,et al.  A Knee Point-Driven Evolutionary Algorithm for Many-Objective Optimization , 2015, IEEE Transactions on Evolutionary Computation.

[23]  Lixin Tang,et al.  A knee-guided prediction approach for dynamic multi-objective optimization , 2020, Inf. Sci..

[24]  Tommi S. Jaakkola,et al.  Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms , 2000, Machine Learning.

[25]  Zhuhong Zhang,et al.  Multiobjective optimization immune algorithm in dynamic environments and its application to greenhouse control , 2008, Appl. Soft Comput..

[26]  Kay Chen Tan,et al.  A Coevolutionary Paradigm for Dynamic Multi-Objective Optimization , 2009 .

[27]  Anton Schwartz,et al.  A Reinforcement Learning Method for Maximizing Undiscounted Rewards , 1993, ICML.

[28]  Lamjed Ben Said,et al.  A dynamic multi-objective evolutionary algorithm using a change severity-based adaptive population management strategy , 2015, Soft Computing.

[29]  Licheng Jiao,et al.  A coevolutionary technique based on multi-swarm particle swarm optimization for dynamic multi-objective optimization , 2017, Eur. J. Oper. Res..

[30]  Gary G. Yen,et al.  Minimum Manhattan Distance Approach to Multiple Criteria Decision Making in Multiobjective Optimization Problems , 2016, IEEE Transactions on Evolutionary Computation.

[31]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[32]  Lin Li,et al.  Quantum immune clonal coevolutionary algorithm for dynamic multiobjective optimization , 2014, Soft Comput..

[33]  Qingfu Zhang,et al.  Multiobjective Optimization Problems With Complicated Pareto Sets, MOEA/D and NSGA-II , 2009, IEEE Transactions on Evolutionary Computation.

[34]  Qingfu Zhang,et al.  A Population Prediction Strategy for Evolutionary Dynamic Multiobjective Optimization , 2014, IEEE Transactions on Cybernetics.

[35]  Gary G. Yen,et al.  Transfer Learning-Based Dynamic Multiobjective Optimization Algorithms , 2016, IEEE Transactions on Evolutionary Computation.