Generalized Nested Rollout Policy Adaptation with Dynamic Bias for Vehicle Routing

In this paper we present an extension of the Nested Rollout Policy Adaptation algorithm (NRPA), namely the Generalized Nested Rollout Policy Adaptation (GNRPA), as well as its use for solving some instances of the Vehicle Routing Problem. We detail some results obtained on the Solomon instances set which is a conventional benchmark for the Vehicle Routing Problem (VRP). We show that on all instances, GNRPA performs better than NRPA. On some instances, it performs better than the Google OR Tool module dedicated to VRP.

[1]  Jean Méhat,et al.  Combining UCT and Nested Monte Carlo Search for Single-Player General Game Playing , 2010, IEEE Transactions on Computational Intelligence and AI in Games.

[2]  Simon M. Lucas,et al.  A Survey of Monte Carlo Tree Search Methods , 2012, IEEE Transactions on Computational Intelligence and AI in Games.

[3]  Tristan Cazenave,et al.  Policy adaptation for vehicle routing , 2021, AI Commun..

[4]  C. Dang,et al.  Monte Carlo Search Algorithms for Network Traffic Engineering , 2021, ECML/PKDD.

[5]  G. Clarke,et al.  Scheduling of Vehicles from a Central Depot to a Number of Delivery Points , 1964 .

[6]  Rémi Coulom,et al.  Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search , 2006, Computers and Games.

[7]  Li Zheng,et al.  VRP Problem Solving Based on Adaptive Dynamic Search Ant Colony Algorithm , 2020 .

[8]  Fabien Teytaud,et al.  Application of the Nested Rollout Policy Adaptation Algorithm to the Traveling Salesman Problem with Time Windows , 2012, LION.

[9]  Marshall L. Fisher,et al.  A generalized assignment heuristic for vehicle routing , 1981, Networks.

[10]  Kendall E. Nygard,et al.  GIDEON: a genetic algorithm system for vehicle routing with time windows , 1991, [1991] Proceedings. The Seventh IEEE Conference on Artificial Intelligence Application.

[11]  Christopher D. Rosin,et al.  Nested Rollout Policy Adaptation for Monte Carlo Tree Search , 2011, IJCAI.

[12]  Hari Prasetyo,et al.  Biased random key genetic algorithm design with multiple populations to solve capacitated vehicle routing problem with time windows , 2018 .

[13]  Tristan Cazenave,et al.  Nested Monte-Carlo Search , 2009, IJCAI.

[14]  Stefan Edelkamp,et al.  Nested Rollout Policy Adaptation for Optimizing Vehicle Selection in Complex VRPs , 2016, 2016 IEEE 41st Conference on Local Computer Networks Workshops (LCN Workshops).

[15]  Stefan Edelkamp,et al.  Monte-Carlo Tree Search for 3D Packing with Object Orientation , 2014, KI.

[16]  Marco Platzner,et al.  Adaptive Playouts in Monte-Carlo Tree Search with Policy-Gradient Reinforcement Learning , 2015, ACG.

[17]  Bruno Bouzy,et al.  Computer Go: An AI oriented survey , 2001, Artif. Intell..

[18]  Michel Gendreau,et al.  A tabu search heuristic for periodic and multi-depot vehicle routing problems , 1997, Networks.

[19]  Otthein Herzog,et al.  Monte-Carlo Tree Search for Logistics , 2016 .

[20]  Zhuo Fu,et al.  Improved tabu search algorithm for the open vehicle routing problem with soft time windows and satisfaction rate , 2018, Cluster Computing.

[21]  Stefan Edelkamp,et al.  Algorithm and knowledge engineering for the TSPTW problem , 2013, 2013 IEEE Symposium on Computational Intelligence in Scheduling (CISched).

[22]  Wen-Chyuan Chiang,et al.  Simulated annealing metaheuristics for the vehicle routing problem with time windows , 1996, Ann. Oper. Res..

[23]  Marek Cornu,et al.  Local Search, data structures and Monte Carlo Search for Multi-Objective Combinatorial Optimization Problems. (Recherche Locale, structures de données et Recherche Monte-Carlo pour les problèmes d'optimisation combinatoire Multi-Objectif) , 2017 .

[24]  Marius M. Solomon,et al.  Algorithms for the Vehicle Routing and Scheduling Problems with Time Window Constraints , 1987, Oper. Res..

[25]  Csaba Szepesvári,et al.  Bandit Based Monte-Carlo Planning , 2006, ECML.

[26]  Tristan Cazenave,et al.  Monte Carlo Vehicle Routing , 2020, ATT@ECAI.

[27]  J. F. Pierce,et al.  ON THE TRUCK DISPATCHING PROBLEM , 1971 .

[28]  Flavien Balbo,et al.  Monte-Carlo Bus Regulation , 2009 .

[29]  Saeed Yaghoubi,et al.  An Improved Particle Swarm Optimization for a Class of Capacitated Vehicle Routing Problems , 2018 .

[30]  F. Portela,et al.  An unexpectedly effective Monte Carlo technique for the RNA inverse folding problem , 2018, bioRxiv.

[31]  Tristan Cazenave Generalized Nested Rollout Policy Adaptation , 2020, ArXiv.

[32]  Magdalene Marinaki,et al.  Particle Swarm Optimization for the Vehicle Routing Problem with Stochastic Demands , 2013, Appl. Soft Comput..