Novel hybrid algorithm for Team Orienteering Problem with Time Windows for rescue applications

Abstract Robots for rescue operations after a disaster are an interesting and challenging research problem that has the potential to save lives and reduce economic losses after a disaster. We developed TOPTWR, an extension of the popular TOPTW model, to model the issues in task allocation for teams of rescue robots. Our hybrid algorithm is based on a team of heterogeneous humanoid robots trying to optimize five objectives (task rewards, task completion time, total energy, maximum energy consumption for a single robot, and missed deadline penalties). A common approach to solve these kinds of problems are multi-objective evolutionary algorithms (MOEAs), but their major disadvantage is that they cannot deal with dynamic environments easily. This paper presents an efficient solution for TOPTWR by combining MOEAs with learning algorithms. A novel Extended Multi-Start Simulated Annealing Iterated Local Search (EMSAILS) operator using a modern state-of-the-art NSGA-III algorithm is proposed. In addition, we applied Q-Learning to learn the likely changes in the environment and how to react to them. This algorithm, HMO-TOPTWR-NSGA-III (HMO-N-L), uses an artificial neural network (ANN) as a function approximator to make the huge state and action spaces tractable. This paper includes a thorough empirical evaluation demonstrating the effectiveness of the multi-objective algorithm in both static and dynamic environments. The evaluation shows that the proposed algorithm reduces the error by up to 42% against three state-of-the-art approaches to TOPTW (HMO-N, MSA, and IPI).

[1]  Paolo Toth,et al.  The Vehicle Routing Problem , 2002, SIAM monographs on discrete mathematics and applications.

[2]  Hakim Mitiche,et al.  A taxonomy for task allocation problems with temporal and ordering constraints , 2017, Robotics Auton. Syst..

[3]  Richard F. Hartl,et al.  Metaheuristics for the bi-objective orienteering problem , 2009, Swarm Intelligence.

[4]  Peng-Yeng Yin,et al.  Multi-objective task allocation in distributed computing systems by hybrid particle swarm optimization , 2007, Appl. Math. Comput..

[5]  Enrico Pagello,et al.  Advanced approaches for multi-robot coordination in logistic scenarios , 2017, Robotics Auton. Syst..

[6]  Gary B. Lamont,et al.  Evolutionary Algorithms for Solving Multi-Objective Problems , 2002, Genetic Algorithms and Evolutionary Computation.

[7]  Cliff T. Ragsdale,et al.  A Grouping Genetic Algorithm for the Multiple Traveling Salesperson Problem , 2007, Int. J. Inf. Technol. Decis. Mak..

[8]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[9]  Robert O. Ambrose,et al.  Evolution of the NASA/DARPA Robonaut control system , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[10]  Christopher G. Atkeson,et al.  Optimization‐based Full Body Control for the DARPA Robotics Challenge , 2015, J. Field Robotics.

[11]  Zibin Zheng,et al.  Multiobjective Vehicle Routing Problems With Simultaneous Delivery and Pickup and Time Windows: Formulation, Instances, and Algorithms , 2016, IEEE Transactions on Cybernetics.

[12]  Weihua Sheng,et al.  Multirobot Cooperative Learning for Predator Avoidance , 2015, IEEE Transactions on Control Systems Technology.

[13]  R. Bolaños,et al.  A multiobjective non-dominated sorting genetic algorithm (NSGA-II) for the Multiple Traveling Salesman Problem , 2015 .

[14]  Dean A. Pomerleau,et al.  Neural Network Perception for Mobile Robot Guidance , 1993 .

[15]  Giovanni Righini,et al.  Decremental state space relaxation strategies and initialization heuristics for solving the Orienteering Problem with Time Windows with dynamic programming , 2009, Comput. Oper. Res..

[16]  Mhand Hifi,et al.  A hybrid multi-objective evolutionary algorithm for the team orienteering problem , 2017, 2017 4th International Conference on Control, Decision and Information Technologies (CoDIT).

[17]  Tunchan Cura,et al.  An artificial bee colony algorithm approach for the team orienteering problem with time windows , 2014, Comput. Ind. Eng..

[18]  Kalyanmoy Deb,et al.  An Evolutionary Many-Objective Optimization Algorithm Using Reference-Point-Based Nondominated Sorting Approach, Part I: Solving Problems With Box Constraints , 2014, IEEE Transactions on Evolutionary Computation.

[19]  Shih-Wei Lin,et al.  A simulated annealing heuristic for the team orienteering problem with time windows , 2012, Eur. J. Oper. Res..

[20]  Abdulla Al-Kaff,et al.  ROS-Based Approach for Unmanned Vehicles in Civil Applications , 2019 .

[21]  Doug A. Bowman,et al.  Human‐robot Teaming for Rescue Missions: Team ViGIR's Approach to the 2013 DARPA Robotics Challenge Trials , 2015, J. Field Robotics.

[23]  Gintaras Palubeckis Single row facility layout using multi-start simulated annealing , 2017, Comput. Ind. Eng..

[24]  Dilip Kumar Pratihar,et al.  Task allocation and collision-free path planning of centralized multi-robots system for industrial plant inspection using heuristic methods , 2016, Robotics Auton. Syst..

[25]  Elise Miller-Hooks,et al.  A TABU search heuristic for the team orienteering problem , 2005, Comput. Oper. Res..

[26]  MengChu Zhou,et al.  Colored Traveling Salesman Problem , 2015, IEEE Transactions on Cybernetics.

[27]  Gora Chand Nandi,et al.  Multiobjective optimized bipedal locomotion , 2019, Int. J. Mach. Learn. Cybern..

[28]  Bart De Schutter,et al.  A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[29]  Mohsen Fathian,et al.  Omnidirectional walking using central pattern generator , 2016, Int. J. Mach. Learn. Cybern..

[30]  Shih-Wei Lin,et al.  Team orienteering problem with time windows and time-dependent scores , 2019, Comput. Ind. Eng..

[31]  G. Laporte,et al.  A tabu search heuristic for periodic and multi-depot vehicle routing problems , 1997, Networks.

[32]  Chukwudi Anyakoha,et al.  A review of particle swarm optimization. Part II: hybridisation, combinatorial, multicriteria and constrained optimization, and indicative applications , 2008, Natural Computing.

[33]  Hsiao-Fan Wang,et al.  A genetic algorithm for the simultaneous delivery and pickup problems with time window , 2012, Comput. Ind. Eng..

[34]  Junhyuk Park,et al.  Exact Algorithm for the Capacitated Team Orienteering Problem with Time Windows , 2017 .

[35]  Dirk Van Oudheusden,et al.  The orienteering problem: A survey , 2011, Eur. J. Oper. Res..

[36]  Shuhui Li,et al.  Training Recurrent Neural Networks With the Levenberg–Marquardt Algorithm for Optimal Control of a Grid-Connected Converter , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[37]  Ali Asghar Alesheikh,et al.  Agent-based task allocation under uncertainties in disaster environments: An approach to interval uncertainty , 2017 .

[38]  Nacima Labadie,et al.  The Team Orienteering Problem with Time Windows: An LP-based Granular Variable Neighborhood Search , 2012, Eur. J. Oper. Res..

[39]  Alok Singh,et al.  A new grouping genetic algorithm approach to the multiple traveling salesperson problem , 2008, Soft Comput..

[40]  Hoong Chuin Lau,et al.  Well-tuned algorithms for the Team Orienteering Problem with Time Windows , 2017, J. Oper. Res. Soc..

[41]  Paul W. H. Chung,et al.  Reliable, Distributed Scheduling and Rescheduling for Time-Critical, Multiagent Systems , 2018, IEEE Transactions on Automation Science and Engineering.

[42]  Ching-Jung Ting,et al.  Two-level particle swarm optimization for the multi-modal team orienteering problem with time windows , 2017, Appl. Soft Comput..

[43]  Shu Zhang,et al.  Multi-period orienteering with uncertain adoption likelihood and waiting at customers , 2020, Eur. J. Oper. Res..

[44]  Marius M. Solomon,et al.  Algorithms for the Vehicle Routing and Scheduling Problems with Time Window Constraints , 1987, Oper. Res..

[45]  Qingfu Zhang,et al.  MOEA/D: A Multiobjective Evolutionary Algorithm Based on Decomposition , 2007, IEEE Transactions on Evolutionary Computation.

[46]  Xing Su,et al.  Two Innovative Coalition Formation Models for Dynamic Task Allocation in Disaster Rescues , 2018 .

[47]  Shih-Wei Lin,et al.  Solving the team orienteering problem with time windows and mandatory visits by multi-start simulated annealing , 2017, Comput. Ind. Eng..

[48]  John Anderson,et al.  Dynamic heterogeneous team formation for robotic urban search and rescue , 2015, J. Comput. Syst. Sci..

[49]  Hadi S. Aghdasi,et al.  Robust multi-objective multi-humanoid robots task allocation based on novel hybrid metaheuristic algorithm , 2019, Applied Intelligence.

[50]  Zhong Ming,et al.  An improved NSGA-III algorithm for feature selection used in intrusion detection , 2017, Knowl. Based Syst..

[51]  Lothar Thiele,et al.  Multiobjective evolutionary algorithms: a comparative case study and the strength Pareto approach , 1999, IEEE Trans. Evol. Comput..

[52]  M. Fatih Tasgetiren,et al.  An evolution strategy approach to the team orienteering problem with time windows , 2020, Comput. Ind. Eng..

[53]  Sahar Trigui,et al.  FL-MTSP: a fuzzy logic approach to solve the multi-objective multiple traveling salesman problem for multi-robot systems , 2017, Soft Comput..

[54]  Matthieu Zimmer,et al.  Bootstrapping $Q$ -Learning for Robotics From Neuro-Evolution Results , 2018, IEEE Transactions on Cognitive and Developmental Systems.

[55]  Hoong Chuin Lau,et al.  ADOPT: Combining parameter tuning and Adaptive Operator Ordering for solving a class of Orienteering Problems , 2018, Comput. Ind. Eng..

[56]  Aytac Altan,et al.  Model predictive control of three-axis gimbal system mounted on UAV for real-time target tracking under external disturbances , 2020 .

[57]  Paul W. H. Chung,et al.  A Heuristic Distributed Task Allocation Method for Multivehicle Multitask Problems and Its Application to Search and Rescue Scenario , 2016, IEEE Transactions on Cybernetics.

[58]  Ljiljana Trajkovic,et al.  Intelligent Deflection Routing in Buffer-Less Networks , 2015, IEEE Transactions on Cybernetics.

[59]  Chao Wang,et al.  Simulated Annealing for a Vehicle Routing Problem with Simultaneous Pickup-Delivery and Time Windows , 2013, APMS.

[60]  Sahar Trigui,et al.  Different Approaches to Solve the MRTA Problem , 2018 .

[61]  Shuuji Kajita,et al.  Humanoid robot HRP-2Kai — Improvement of HRP-2 towards disaster response tasks , 2015, 2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids).

[62]  Jacky Baltes,et al.  A multi-objective evolutionary hyper-heuristic algorithm for team-orienteering problem with time windows regarding rescue applications , 2019, The Knowledge Engineering Review.

[63]  Yichuan Jiang,et al.  A Survey of Task Allocation and Load Balancing in Distributed Systems , 2016, IEEE Transactions on Parallel and Distributed Systems.

[64]  Andrej Babinec,et al.  Modelling of Mechanical and Mechatronic Systems MMaMS 2014 Path planning with modified A star algorithm for a mobile robot , 2014 .

[65]  Gerald Schaefer,et al.  Distributed Task Rescheduling With Time Constraints for the Optimization of Total Task Allocations in a Multirobot System , 2018, IEEE Transactions on Cybernetics.

[66]  Alberto Santini An adaptive large neighbourhood search algorithm for the orienteering problem , 2019, Expert Syst. Appl..

[67]  Hoong Chuin Lau,et al.  Orienteering Problem: A survey of recent variants, solution approaches and applications , 2016, Eur. J. Oper. Res..

[68]  T. Bektaş The multiple traveling salesman problem: an overview of formulations and solution procedures , 2006 .

[69]  Eckart Zitzler,et al.  Evolutionary algorithms for multiobjective optimization: methods and applications , 1999 .

[70]  Hadi S. Aghdasi,et al.  Mechatronic Design of ARC Humanoid Robot Open Platform: First Fully 3D Printed Kid-Sized Robot , 2020, Int. J. Humanoid Robotics.

[71]  Marius M. Solomon,et al.  On the worst-case performance of some heuristics for the vehicle routing and scheduling problem with time window constraints , 1986, Networks.

[72]  Martin W. P. Savelsbergh,et al.  Local search in routing problems with time windows , 1984 .

[73]  Qingfu Zhang,et al.  This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION 1 RM-MEDA: A Regularity Model-Based Multiobjective Estimation of , 2022 .

[74]  Jacky Baltes,et al.  Novel lightweight odometric learning method for humanoid robot localization , 2018, Mechatronics.

[75]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[76]  Mohammad Bagher Menhaj,et al.  Training feedforward networks with the Marquardt algorithm , 1994, IEEE Trans. Neural Networks.

[77]  Roberto Montemanni,et al.  An ant colony system for team orienteering problems with time windows , 2023, 2305.07305.

[78]  Cliff T. Ragsdale,et al.  A new approach to solving the multiple traveling salesperson problem using genetic algorithms , 2006, Eur. J. Oper. Res..

[79]  Eric Bonabeau,et al.  Cooperative transport by ants and robots , 2000, Robotics Auton. Syst..

[80]  Aldy Gunawan,et al.  State-of-the-Art Solution Techniques for OPTW and TOPTW , 2019 .

[81]  Michel Gendreau,et al.  The orienteering problem with stochastic travel and service times , 2011, Ann. Oper. Res..

[82]  Gheorghe Mogan,et al.  Neural networks based reinforcement learning for mobile robots obstacle avoidance , 2016, Expert Syst. Appl..

[83]  Andrew L. Johnson,et al.  An augmented large neighborhood search method for solving the team orienteering problem , 2013, Expert Syst. Appl..

[84]  R. Vohra,et al.  The Orienteering Problem , 1987 .

[85]  MengChu Zhou,et al.  Multiple-Solution Optimization Strategy for Multirobot Task Allocation , 2020, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[86]  Andreas Kroll,et al.  Memetic algorithms for optimal task allocation in multi-robot systems for inspection problems with cooperative tasks , 2015, Soft Comput..

[87]  Yu Yang,et al.  Task allocation optimization in collaborative customized product development based on double-population adaptive genetic algorithm , 2016, J. Intell. Manuf..

[88]  Jacky Baltes,et al.  A comprehensive survey on humanoid robot development , 2019, The Knowledge Engineering Review.

[89]  Marco Laumanns,et al.  Performance assessment of multiobjective optimizers: an analysis and review , 2003, IEEE Trans. Evol. Comput..

[90]  Stelios D. Bekiros,et al.  Digital currency forecasting with chaotic meta-heuristic bio-inspired signal processing techniques , 2019, Chaos, Solitons & Fractals.