Multi-Vehicle Routing Problems with Soft Time Windows: A Multi-Agent Reinforcement Learning Approach

[1]  Yinhai Wang,et al.  Multistep speed prediction on traffic networks: A deep learning approach considering spatio-temporal dependencies , 2019, Transportation Research Part C: Emerging Technologies.

[2]  Wouter Kool,et al.  Attention Solves Your TSP, Approximately , 2018 .

[3]  Max Welling,et al.  Attention Solves Your TSP , 2018, ArXiv.

[4]  Lawrence V. Snyder,et al.  Reinforcement Learning for Solving the Vehicle Routing Problem , 2018, NeurIPS.

[5]  Marc Peter Deisenroth,et al.  Deep Reinforcement Learning: A Brief Survey , 2017, IEEE Signal Processing Magazine.

[6]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[7]  Elias Boutros Khalil,et al.  Learning Combinatorial Optimization Algorithms over Graphs , 2017, NIPS.

[8]  Sachin Ahuja,et al.  Machine learning and its applications: A review , 2017, 2017 International Conference on Big Data Analytics and Computational Intelligence (ICBDAC).

[9]  Samy Bengio,et al.  Neural Combinatorial Optimization with Reinforcement Learning , 2016, ICLR.

[10]  Le Song,et al.  Discriminative Embeddings of Latent Variable Models for Structured Data , 2016, ICML.

[11]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[12]  Navdeep Jaitly,et al.  Pointer Networks , 2015, NIPS.

[13]  Marc G. Bellemare,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[14]  Christian Szegedy,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[15]  Tom Van Woensel,et al.  The time-dependent vehicle routing problem with soft time windows and stochastic travel times , 2014 .

[16]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.

[17]  Ramasamy Panneerselvam,et al.  A Survey on the Vehicle Routing Problem and Its Variants , 2012 .

[18]  T. Urbanik,et al.  Reinforcement learning-based multi-agent system for network traffic signal control , 2010 .

[19]  Xu-ping Wang,et al.  Genetic algorithm for vehicle routing problem with time windows and a limited number of vehicles , 2008, 2008 International Conference on Management Science and Engineering 15th Annual Conference Proceedings.

[20]  T. Ibaraki,et al.  An iterated local search algorithm for the vehicle routing problem with convex time penalty functions , 2008, Discret. Appl. Math..

[21]  Beatrice M. Ombuki-Berman,et al.  Multi-Objective Genetic Algorithms for Vehicle Routing Problem with Time Windows , 2006, Applied Intelligence.

[22]  Andrew Lim,et al.  A smoothed dynamic tabu search embedded GRASP for m-VRPTW , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[23]  Hoong Chuin Lau,et al.  Vehicle routing problem with time windows and a limited number of vehicles , 2003, Eur. J. Oper. Res..

[24]  Lex Weaver,et al.  The Optimal Reward Baseline for Gradient-Based Reinforcement Learning , 2001, UAI.

[25]  Helena Ramalhinho Dias Lourenço,et al.  Iterated Local Search , 2001, Handbook of Metaheuristics.

[26]  Sushil J. Louis,et al.  Multiple vehicle routing with time windows using genetic algorithms , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[27]  H. V. Hoof,et al.  UvA-DARE ( Digital Academic Repository ) Attention , learn to solve routing problems ! , 2019 .

[28]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[29]  Ronald J. Williams,et al.  Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[30]  Paolo Toth,et al.  The Vehicle Routing Problem , 2002, SIAM monographs on discrete mathematics and applications.