Multi-Vehicle Routing Problems with Soft Time Windows: A Multi-Agent Reinforcement Learning Approach
暂无分享,去创建一个
Fang He | Kecheng Zhang | Meng Li | Xi Lin | Zhengchao Zhang | Xi Lin
[1] Yinhai Wang,et al. Multistep speed prediction on traffic networks: A deep learning approach considering spatio-temporal dependencies , 2019, Transportation Research Part C: Emerging Technologies.
[2] Wouter Kool,et al. Attention Solves Your TSP, Approximately , 2018 .
[3] Max Welling,et al. Attention Solves Your TSP , 2018, ArXiv.
[4] Lawrence V. Snyder,et al. Reinforcement Learning for Solving the Vehicle Routing Problem , 2018, NeurIPS.
[5] Marc Peter Deisenroth,et al. Deep Reinforcement Learning: A Brief Survey , 2017, IEEE Signal Processing Magazine.
[6] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.
[7] Elias Boutros Khalil,et al. Learning Combinatorial Optimization Algorithms over Graphs , 2017, NIPS.
[8] Sachin Ahuja,et al. Machine learning and its applications: A review , 2017, 2017 International Conference on Big Data Analytics and Computational Intelligence (ICBDAC).
[9] Samy Bengio,et al. Neural Combinatorial Optimization with Reinforcement Learning , 2016, ICLR.
[10] Le Song,et al. Discriminative Embeddings of Latent Variable Models for Structured Data , 2016, ICML.
[11] Jian Sun,et al. Identity Mappings in Deep Residual Networks , 2016, ECCV.
[12] Navdeep Jaitly,et al. Pointer Networks , 2015, NIPS.
[13] Marc G. Bellemare,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[14] Christian Szegedy,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.
[15] Tom Van Woensel,et al. The time-dependent vehicle routing problem with soft time windows and stochastic travel times , 2014 .
[16] Guy Lever,et al. Deterministic Policy Gradient Algorithms , 2014, ICML.
[17] Ramasamy Panneerselvam,et al. A Survey on the Vehicle Routing Problem and Its Variants , 2012 .
[18] T. Urbanik,et al. Reinforcement learning-based multi-agent system for network traffic signal control , 2010 .
[19] Xu-ping Wang,et al. Genetic algorithm for vehicle routing problem with time windows and a limited number of vehicles , 2008, 2008 International Conference on Management Science and Engineering 15th Annual Conference Proceedings.
[20] T. Ibaraki,et al. An iterated local search algorithm for the vehicle routing problem with convex time penalty functions , 2008, Discret. Appl. Math..
[21] Beatrice M. Ombuki-Berman,et al. Multi-Objective Genetic Algorithms for Vehicle Routing Problem with Time Windows , 2006, Applied Intelligence.
[22] Andrew Lim,et al. A smoothed dynamic tabu search embedded GRASP for m-VRPTW , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.
[23] Hoong Chuin Lau,et al. Vehicle routing problem with time windows and a limited number of vehicles , 2003, Eur. J. Oper. Res..
[24] Lex Weaver,et al. The Optimal Reward Baseline for Gradient-Based Reinforcement Learning , 2001, UAI.
[25] Helena Ramalhinho Dias Lourenço,et al. Iterated Local Search , 2001, Handbook of Metaheuristics.
[26] Sushil J. Louis,et al. Multiple vehicle routing with time windows using genetic algorithms , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).
[27] H. V. Hoof,et al. UvA-DARE ( Digital Academic Repository ) Attention , learn to solve routing problems ! , 2019 .
[28] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[29] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[30] Paolo Toth,et al. The Vehicle Routing Problem , 2002, SIAM monographs on discrete mathematics and applications.