论文信息 - Multi-Vehicle Routing Problems with Soft Time Windows: A Multi-Agent Reinforcement Learning Approach - 字舞流文

Multi-Vehicle Routing Problems with Soft Time Windows: A Multi-Agent Reinforcement Learning Approach

Fang He | Kecheng Zhang | Meng Li | Xi Lin | Zhengchao Zhang | Xi Lin

[1] Yinhai Wang,et al. Multistep speed prediction on traffic networks: A deep learning approach considering spatio-temporal dependencies , 2019, Transportation Research Part C: Emerging Technologies.

[2] Wouter Kool,et al. Attention Solves Your TSP, Approximately , 2018 .

[3] Max Welling,et al. Attention Solves Your TSP , 2018, ArXiv.

[4] Lawrence V. Snyder,et al. Reinforcement Learning for Solving the Vehicle Routing Problem , 2018, NeurIPS.

[5] Marc Peter Deisenroth,et al. Deep Reinforcement Learning: A Brief Survey , 2017, IEEE Signal Processing Magazine.

[6] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[7] Elias Boutros Khalil,et al. Learning Combinatorial Optimization Algorithms over Graphs , 2017, NIPS.

[8] Sachin Ahuja,et al. Machine learning and its applications: A review , 2017, 2017 International Conference on Big Data Analytics and Computational Intelligence (ICBDAC).

[9] Samy Bengio,et al. Neural Combinatorial Optimization with Reinforcement Learning , 2016, ICLR.

[10] Le Song,et al. Discriminative Embeddings of Latent Variable Models for Structured Data , 2016, ICML.

[11] Jian Sun,et al. Identity Mappings in Deep Residual Networks , 2016, ECCV.

[12] Navdeep Jaitly,et al. Pointer Networks , 2015, NIPS.

[13] Marc G. Bellemare,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[14] Christian Szegedy,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[15] Tom Van Woensel,et al. The time-dependent vehicle routing problem with soft time windows and stochastic travel times , 2014 .

[16] Guy Lever,et al. Deterministic Policy Gradient Algorithms , 2014, ICML.

[17] Ramasamy Panneerselvam,et al. A Survey on the Vehicle Routing Problem and Its Variants , 2012 .

[18] T. Urbanik,et al. Reinforcement learning-based multi-agent system for network traffic signal control , 2010 .

[19] Xu-ping Wang,et al. Genetic algorithm for vehicle routing problem with time windows and a limited number of vehicles , 2008, 2008 International Conference on Management Science and Engineering 15th Annual Conference Proceedings.

[20] T. Ibaraki,et al. An iterated local search algorithm for the vehicle routing problem with convex time penalty functions , 2008, Discret. Appl. Math..

[21] Beatrice M. Ombuki-Berman,et al. Multi-Objective Genetic Algorithms for Vehicle Routing Problem with Time Windows , 2006, Applied Intelligence.

[22] Andrew Lim,et al. A smoothed dynamic tabu search embedded GRASP for m-VRPTW , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[23] Hoong Chuin Lau,et al. Vehicle routing problem with time windows and a limited number of vehicles , 2003, Eur. J. Oper. Res..

[24] Lex Weaver,et al. The Optimal Reward Baseline for Gradient-Based Reinforcement Learning , 2001, UAI.

[25] Helena Ramalhinho Dias Lourenço,et al. Iterated Local Search , 2001, Handbook of Metaheuristics.

[26] Sushil J. Louis,et al. Multiple vehicle routing with time windows using genetic algorithms , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[27] H. V. Hoof,et al. UvA-DARE ( Digital Academic Repository ) Attention , learn to solve routing problems ! , 2019 .

[28] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[29] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[30] Paolo Toth,et al. The Vehicle Routing Problem , 2002, SIAM monographs on discrete mathematics and applications.