Dynamic multiobjective optimization driven by inverse reinforcement learning

Abstract Due to the widespread interest in dynamic multiobjective optimization in real-world applications, more and more approaches exploiting machine learning are deployed to tackle this type of problems. Unfortunately, recent works do not make full use of the data obtained during the optimization process, which could be benefit for model training thereby mining the dynamic characteristics of the underlying problem. To address this issue, this paper proposes a dynamic multiobjective evolutionary algorithm driven by inverse reinforcement learning to solve the dynamic multiobjective optimization problems. IRL is widely used to recover the unknown reward function, making it possible to perform at an expert level. The notable features of the proposed algorithm mainly consist of data-driven evolutionary technique, which uses inverse reinforcement learning as a surrogate-assisted model for model training. This design makes full use of the surrogate management strategy based on inverse reinforcement learning to optimize the reward function within a reinforcement learning framework. At the same time, the algorithm can generate a promising policy based on limited training data during the optimization process to achieve better algorithm evolution and guide the search. The experimental results on the benchmark problems validate that the proposed algorithm is effective in dealing with dynamic multiobjective optimization.

[1]  Yaochu Jin,et al.  Surrogate-assisted evolutionary computation: Recent advances and future challenges , 2011, Swarm Evol. Comput..

[2]  Haym Hirsh,et al.  Informed operators: Speeding up genetic-algorithm-based design optimization using reduced models , 2000, GECCO.

[3]  Min Jiang,et al.  Individual-Based Transfer Learning for Dynamic Multiobjective Optimization , 2020, IEEE Transactions on Cybernetics.

[4]  Shengxiang Yang,et al.  A Steady-State and Generational Evolutionary Algorithm for Dynamic Multiobjective Optimization , 2017, IEEE Transactions on Evolutionary Computation.

[5]  Kalyanmoy Deb,et al.  Dynamic Multi-objective Optimization and Decision-Making Using Modified NSGA-II: A Case Study on Hydro-thermal Power Scheduling , 2007, EMO.

[6]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[7]  Qingfu Zhang,et al.  Expensive Multiobjective Optimization by MOEA/D With Gaussian Process Model , 2010, IEEE Transactions on Evolutionary Computation.

[8]  Carlos A. Coello Coello,et al.  Handling multiple objectives with particle swarm optimization , 2004, IEEE Transactions on Evolutionary Computation.

[9]  Bernhard Sendhoff,et al.  A framework for evolutionary optimization with approximate fitness functions , 2002, IEEE Trans. Evol. Comput..

[10]  K. S. Anderson,et al.  Genetic crossover strategy using an approximation concept , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[11]  Dipti Srinivasan,et al.  A genetic algorithm - differential evolution based hybrid framework: Case study on unit commitment scheduling problem , 2016, Inf. Sci..

[12]  Lixin Tang,et al.  A knee-guided prediction approach for dynamic multi-objective optimization , 2020, Inf. Sci..

[13]  Anind K. Dey,et al.  Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.

[14]  Zexuan Zhu,et al.  Hybrid of memory and prediction strategies for dynamic multiobjective optimization , 2019, Inf. Sci..

[15]  Kay Chen Tan,et al.  A Competitive-Cooperative Coevolutionary Paradigm for Dynamic Multiobjective Optimization , 2009, IEEE Transactions on Evolutionary Computation.

[16]  Meng-Sing Liou,et al.  Multiobjective Optimization Using Coupled Response Surface Model and Evolutionary Algorithm. , 2005 .

[17]  Dan Guo,et al.  Data-Driven Evolutionary Optimization: An Overview and Case Studies , 2019, IEEE Transactions on Evolutionary Computation.

[18]  Shengxiang Yang,et al.  Evolutionary Dynamic Multiobjective Optimization: Benchmarks and Algorithm Comparisons , 2017, IEEE Transactions on Cybernetics.

[19]  Carlos Cruz,et al.  Optimization in dynamic environments: a survey on problems, methods and measures , 2011, Soft Comput..

[20]  Robert E. Schapire,et al.  A Game-Theoretic Approach to Apprenticeship Learning , 2007, NIPS.

[21]  David Wallace,et al.  Dynamic multi-objective optimization with evolutionary algorithms: a forward-looking approach , 2006, GECCO.

[22]  Athanasios V. Vasilakos,et al.  Information sharing in bee colony for detecting multiple niches in non-stationary environments , 2013, GECCO.

[23]  Yaru Hu,et al.  A dynamic multi-objective evolutionary algorithm based on intensity of environmental change , 2020, Inf. Sci..

[24]  Terence C. Fogarty,et al.  A Genetic Algorithm with Variable Range of Local Search for Tracking Changing Environments , 1996, PPSN.

[25]  Pieter Abbeel,et al.  Apprenticeship learning via inverse reinforcement learning , 2004, ICML.

[26]  Lamjed Ben Said,et al.  A dynamic multi-objective evolutionary algorithm using a change severity-based adaptive population management strategy , 2015, Soft Computing.

[27]  Witold Pedrycz,et al.  Multidirectional Prediction Approach for Dynamic Multiobjective Optimization Problems , 2019, IEEE Transactions on Cybernetics.

[28]  Sanghamitra Bandyopadhyay,et al.  Multiobjective GAs, quantitative indices, and pattern classification , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[29]  Kay Chen Tan,et al.  Evolutionary Dynamic Multiobjective Optimization Via Kalman Filter Prediction , 2016, IEEE Transactions on Cybernetics.

[30]  Marco Laumanns,et al.  SPEA2: Improving the Strength Pareto Evolutionary Algorithm For Multiobjective Optimization , 2002 .

[31]  David Silver,et al.  Learning to search: Functional gradient techniques for imitation learning , 2009, Auton. Robots.

[32]  Kai Zhang,et al.  Multiobjective Evolution Strategy for Dynamic Multiobjective Optimization , 2020, IEEE Transactions on Evolutionary Computation.

[33]  Marc Schoenauer,et al.  Surrogate Deterministic Mutation: Preliminary Results , 2001, Artificial Evolution.

[34]  A. Keane,et al.  Evolutionary Optimization of Computationally Expensive Problems via Surrogate Modeling , 2003 .

[35]  Michèle Sebag,et al.  A mono surrogate for multiobjective optimization , 2010, GECCO '10.

[36]  Qingfu Zhang,et al.  A Population Prediction Strategy for Evolutionary Dynamic Multiobjective Optimization , 2014, IEEE Transactions on Cybernetics.

[37]  Handing Wang,et al.  A Random Forest-Assisted Evolutionary Algorithm for Data-Driven Constrained Multiobjective Combinatorial Optimization of Trauma Systems , 2020, IEEE Transactions on Cybernetics.

[38]  Ying Han,et al.  A Q-learning-based memetic algorithm for multi-objective dynamic software project scheduling , 2018, Inf. Sci..

[39]  Min Jiang,et al.  Dynamic Multi-objective Estimation of Distribution Algorithm based on Domain Adaptation and Nonparametric Estimation , 2018, Inf. Sci..

[40]  Gary G. Yen,et al.  Improving the performance of globalized dual heuristic programming for fault tolerant control through an online learning supervisor , 2005, IEEE Transactions on Automation Science and Engineering.

[41]  Gary G. Yen,et al.  A cluster-based clonal selection algorithm for optimization in dynamic environment , 2019, Swarm Evol. Comput..

[42]  Pieter Abbeel,et al.  Autonomous Helicopter Aerobatics through Apprenticeship Learning , 2010, Int. J. Robotics Res..

[43]  Xiangxiang Zeng,et al.  An Evolutionary Algorithm Based on Minkowski Distance for Many-Objective Optimization , 2019, IEEE Transactions on Cybernetics.

[44]  Tapabrata Ray,et al.  An Evolutionary Algorithm with Spatially Distributed Surrogates for Multiobjective Optimization , 2007, ACAL.

[45]  Bernhard Sendhoff,et al.  A Multiobjective Evolutionary Algorithm Using Gaussian Process-Based Inverse Modeling , 2015, IEEE Transactions on Evolutionary Computation.

[46]  Lixin Tang,et al.  A reinforcement learning approach for dynamic multi-objective optimization , 2021, Inf. Sci..

[47]  Qingfu Zhang,et al.  Multiobjective Optimization Problems With Complicated Pareto Sets, MOEA/D and NSGA-II , 2009, IEEE Transactions on Evolutionary Computation.