Hybrid algorithms based on combining reinforcement learning and metaheuristic methods to solve global optimization problems

Abstract This paper introduces three hybrid algorithms that help in solving global optimization problems using reinforcement learning along with metaheuristic methods. Using the algorithms presented, the search agents try to find a global optimum avoiding the local optima trap. Compared to the classical metaheuristic approaches, the proposed algorithms display higher success in finding new areas as well as exhibiting a more balanced performance while in the exploration and exploitation phases. The algorithms employ reinforcement agents to select an environment based on predefined actions and tasks. A reward and penalty system is used by the agents to discover the environment, done dynamically without following a predetermined model or method. The study makes use of Q-Learning method in all three metaheuristic algorithms, so-called RL I−GWO , RL Ex−GWO , and RL WOA algorithms, so as to check and control exploration and exploitation with Q-Table. The Q-Table values guide the search agents of the metaheuristic algorithms to select between the exploration and exploitation phases. A control mechanism is used to get the reward and penalty values for each action. The algorithms presented in this paper are simulated over 30 benchmark functions from CEC 2014, 2015 and the results obtained are compared with well-known metaheuristic and hybrid algorithms (GWO, RL GWO , I-GWO, Ex-GWO, and WOA). The proposed methods have also been applied to the inverse kinematics of the robot arms problem. The results of the used algorithms demonstrate that RL WOA provides better solutions for relevant problems.

[1]  Feras Al-Obeidat,et al.  Combining Machine Learning and Metaheuristics Algorithms for Classification Method PROAFTN , 2019, Enhanced Living Environments.

[2]  Andrew Lewis,et al.  The Whale Optimization Algorithm , 2016, Adv. Eng. Softw..

[3]  Xinyu Shao,et al.  An efficient surrogate-assisted particle swarm optimization algorithm for high-dimensional expensive problems , 2019, Knowl. Based Syst..

[4]  Enrique Alba,et al.  The exploration/exploitation tradeoff in dynamic cellular genetic algorithms , 2005, IEEE Transactions on Evolutionary Computation.

[5]  Hossein Nezamabadi-pour,et al.  GSA: A Gravitational Search Algorithm , 2009, Inf. Sci..

[6]  Rasul Enayatifar,et al.  GSA-LA: gravitational search algorithm based on learning automata , 2020, J. Exp. Theor. Artif. Intell..

[7]  Tome Eftimov,et al.  Understanding exploration and exploitation powers of meta-heuristic stochastic optimization algorithms through statistical analysis , 2019, GECCO.

[8]  Marco A. Contreras-Cruz,et al.  Mobile robot path planning using artificial bee colony and evolutionary programming , 2015, Appl. Soft Comput..

[9]  Shahrel Azmin Suandi,et al.  Q-learning-based simulated annealing algorithm for constrained engineering design problems , 2019, Neural Computing and Applications.

[10]  Konstantinos G. Margaritis,et al.  On benchmarking functions for genetic algorithms , 2001, Int. J. Comput. Math..

[11]  Xin Yao,et al.  Evolutionary programming made faster , 1999, IEEE Trans. Evol. Comput..

[12]  Jing Zhang,et al.  A novel reinforcement learning based grey wolf optimizer algorithm for unmanned aerial vehicles (UAVs) path planning , 2020, Appl. Soft Comput..

[13]  Amir Seyyedabbasi,et al.  MAP-ACO: An efficient protocol for multi-agent pathfinding in real-time WSN and decentralized IoT systems , 2020, Microprocess. Microsystems.

[14]  Patrick De Causmaecker,et al.  Boosting Metaheuristic Search Using Reinforcement Learning , 2013, Hybrid Metaheuristics.

[15]  Christian Blum,et al.  Metaheuristics in combinatorial optimization: Overview and conceptual comparison , 2003, CSUR.

[16]  Qamar Askari,et al.  Political Optimizer: A novel socio-inspired meta-heuristic for global optimization , 2020, Knowl. Based Syst..

[17]  Vahideh Hayyolalam,et al.  Black Widow Optimization Algorithm: A novel meta-heuristic approach for solving engineering optimization problems , 2020, Eng. Appl. Artif. Intell..

[18]  Kevin Burrage,et al.  An improved firefly algorithm for global continuous optimization problems , 2020, Expert Syst. Appl..

[19]  J. Žerovnik Heuristics for NP-hard optimization problems - simpler is better!? , 2015 .

[20]  Jun Zhang,et al.  Genetic Learning Particle Swarm Optimization , 2016, IEEE Transactions on Cybernetics.

[21]  Hamdan Daniyal,et al.  Barnacles Mating Optimizer: A new bio-inspired algorithm for solving engineering optimization problems , 2020, Eng. Appl. Artif. Intell..

[22]  Guangzhou Zeng,et al.  Study of genetic algorithm with reinforcement learning to solve the TSP , 2009, Expert Syst. Appl..

[23]  Pratyusha Rakshit,et al.  Realization of an Adaptive Memetic Algorithm Using Differential Evolution and Q-Learning: A Case Study in Multirobot Path Planning , 2013, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[24]  Crina Grosan,et al.  Experienced Gray Wolf Optimization Through Reinforcement Learning and Neural Networks , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[25]  Li Zhang,et al.  A scattering and repulsive swarm intelligence algorithm for solving global optimization problems , 2018, Knowl. Based Syst..

[26]  Zhigang Jin,et al.  Hybridizing grey wolf optimization with neural network algorithm for global numerical optimization problems , 2019, Neural Computing and Applications.

[27]  Zong Woo Geem,et al.  A New Heuristic Optimization Algorithm: Harmony Search , 2001, Simul..

[28]  D. V. L. N. Somayajulu,et al.  Adaptive deep Q-learning model for detecting social bots and influential users in online social networks , 2019, Applied Intelligence.

[29]  Andries Petrus Engelbrecht,et al.  A study of particle swarm optimization particle trajectories , 2006, Inf. Sci..

[30]  A. L. Sangal,et al.  Tunicate Swarm Algorithm: A new bio-inspired based metaheuristic paradigm for global optimization , 2020, Eng. Appl. Artif. Intell..

[31]  Yue Xu,et al.  A reinforcement learning-based communication topology in particle swarm optimization , 2019, Neural Computing and Applications.

[32]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[33]  Peyman Mahouti,et al.  Competitive evolutionary algorithms for building performance database of a microwave transistor , 2018, Int. J. Circuit Theory Appl..

[34]  Hongming Zhang,et al.  Taxonomy of Reinforcement Learning Algorithms , 2020 .

[35]  Tugrul Cavdar,et al.  A New Heuristic Approach for Inverse Kinematics of Robot Arms , 2013 .

[36]  Vahid Khatibi Bardsiri,et al.  Poor and rich optimization algorithm: A new human-based and multi populations algorithm , 2019, Eng. Appl. Artif. Intell..

[37]  Xin-She Yang,et al.  A literature survey of benchmark functions for global optimisation problems , 2013, Int. J. Math. Model. Numer. Optimisation.

[38]  Oscar Cordón,et al.  A survey on image segmentation using metaheuristic-based deformable models: state of the art and critical analysis , 2016, Appl. Soft Comput..

[39]  Dan Boneh,et al.  On genetic algorithms , 1995, COLT '95.

[40]  Afshin Kazerooni,et al.  KASRA: A Kriging-based Adaptive Space Reduction Algorithm for global optimization of computationally expensive black-box constrained problems , 2020, Appl. Soft Comput..

[41]  Chee Peng Lim,et al.  A new Reinforcement Learning-based Memetic Particle Swarm Optimizer , 2016, Appl. Soft Comput..

[42]  Amir Seyyedabbasi,et al.  I-GWO and Ex-GWO: improved algorithms of the Grey Wolf Optimizer to solve global optimization problems , 2019, Engineering with Computers.

[43]  Xiang Feng,et al.  The social team building optimization algorithm , 2019, Soft Comput..

[44]  Urvinder Singh,et al.  Modified Grey Wolf Optimizer for Global Engineering Optimization , 2016, Appl. Comput. Intell. Soft Comput..

[45]  Hao Wang,et al.  Reinforcement Learning-Based Genetic Algorithm in Optimizing Multidimensional Data Discretization Scheme , 2020 .

[46]  Taymaz Rahkar Farshi Battle royale optimization algorithm , 2020, Neural Computing and Applications.

[47]  Fariborz Jolai,et al.  Lion Optimization Algorithm (LOA): A nature-inspired metaheuristic algorithm , 2016, J. Comput. Des. Eng..

[48]  Chunguo Wu,et al.  Particle swarm optimization based on dimensional learning strategy , 2019, Swarm Evol. Comput..

[49]  Majdi M. Mafarja,et al.  Hybrid Whale Optimization Algorithm with simulated annealing for feature selection , 2017, Neurocomputing.

[50]  Farzan Soleymani,et al.  Financial portfolio optimization with online deep reinforcement learning and restricted stacked autoencoder - DeepBreath , 2020, Expert Syst. Appl..

[51]  Kusum Deep,et al.  Improved sine cosine algorithm with crossover scheme for global optimization , 2019, Knowl. Based Syst..

[52]  Madalina M. Drugan,et al.  Reinforcement learning versus evolutionary computation: A survey on hybrid algorithms , 2019, Swarm Evol. Comput..

[53]  Andrew Lewis,et al.  Grey Wolf Optimizer , 2014, Adv. Eng. Softw..