DHL: Deep reinforcement learning-based approach for emergency supply distribution in humanitarian logistics

[1]  Khaled Ben Letaief,et al.  Joint Coordinated Beamforming and Power Splitting Ratio Optimization in MU-MISO SWIPT-Enabled HetNets: A Multi-Agent DDQN-Based Approach , 2022, IEEE Journal on Selected Areas in Communications.

[2]  Xiaofei Wang,et al.  Networking Integrated Cloud–Edge–End in IoT: A Blockchain-Assisted Collective Q-Learning Approach , 2021, IEEE Internet of Things Journal.

[3]  Baiqing Sun,et al.  Multiperiod optimal emergency material allocation considering road network damage and risk under uncertain conditions , 2021, Operational Research.

[4]  Hui Wang,et al.  Deep reinforcement learning-based computation offloading and resource allocation in security-aware mobile edge computing , 2021, Wireless Networks.

[5]  Chunyan Miao,et al.  UAV-Assisted Wireless Energy and Data Transfer With Deep Reinforcement Learning , 2021, IEEE Transactions on Cognitive Communications and Networking.

[6]  Abegaz Mohammed,et al.  Deep Reinforcement Learning for Computation Offloading and Resource Allocation in Blockchain-Based Multi-UAV-Enabled Mobile Edge Computing , 2020, 2020 17th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP).

[7]  Lei Lei,et al.  Resource Allocation Based on Deep Reinforcement Learning in IoT Edge Computing , 2020, IEEE Journal on Selected Areas in Communications.

[8]  Xiang Zhang,et al.  Deep Learning-Based Resource Allocation for 5G Broadband TV Service , 2020, IEEE Transactions on Broadcasting.

[9]  L. Miao,et al.  Rollout algorithms for resource allocation in humanitarian logistics , 2019, IISE Trans..

[10]  Dusit Niyato,et al.  Deep Reinforcement Learning for Mobile 5G and Beyond: Fundamentals, Applications, and Challenges , 2019, IEEE Vehicular Technology Magazine.

[11]  Yu Zhang,et al.  Intelligent Cloud Resource Management with Deep Reinforcement Learning , 2018, IEEE Cloud Computing.

[12]  Fan Zhang,et al.  A Kind of Joint Routing and Resource Allocation Scheme Based on Prioritized Memories-Deep Q Network for Cognitive Radio Ad Hoc Networks , 2018, Sensors.

[13]  Rong Chen,et al.  A Deep Reinforcement Learning-Based Framework for Dynamic Resource Allocation in Multibeam Satellite Systems , 2018, IEEE Communications Letters.

[14]  Lixin Miao,et al.  Novel methods for resource allocation in humanitarian logistics considering human suffering , 2018, Comput. Ind. Eng..

[15]  Yan Song,et al.  Supply allocation: bi-level programming and differential evolution algorithm for Natural Disaster Relief , 2017, Cluster Computing.

[16]  M. Deisenroth,et al.  Deep Reinforcement Learning: A Brief Survey , 2017, IEEE Signal Processing Magazine.

[17]  Qinru Qiu,et al.  A Hierarchical Framework of Cloud Resource Allocation and Power Management Using Deep Reinforcement Learning , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[18]  Alfredo Moreno,et al.  Stochastic network models for logistics planning in disaster relief , 2016, Eur. J. Oper. Res..

[19]  Marijan Žura,et al.  Reinforcement learning approach for train rescheduling on a single-track railway , 2016 .

[20]  Yufei Yuan,et al.  Modeling multiple humanitarian objectives in emergency response to large-scale disasters , 2015 .

[21]  João Carlos Souza,et al.  Disaster management: hierarchical structuring criteria for selection and location of temporary shelters , 2015, Natural Hazards.

[22]  Shinya Hanaoka,et al.  An agent-based model for resource allocation during relief distribution , 2014 .

[23]  Stefan Feuerriegel,et al.  Emergency response in natural disaster management: Allocation and scheduling of rescue units , 2014, Eur. J. Oper. Res..

[24]  Marco Wiering,et al.  Reinforcement Learning , 2014, Adaptation, Learning, and Optimization.

[25]  Amit Konar,et al.  A Deterministic Improved Q-Learning for Path Planning of a Mobile Robot , 2013, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[26]  L. V. Wassenhove,et al.  On the appropriate objective function for post‐disaster humanitarian logistics models , 2013 .

[27]  Jiuh-Biing Sheu,et al.  An emergency logistics distribution approach for quick response to urgent relief demand in disasters , 2007 .

[28]  Lihong Li,et al.  PAC model-free reinforcement learning , 2006, ICML.

[29]  Peter Stone,et al.  Policy gradient reinforcement learning for fast quadrupedal locomotion , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[30]  Fritz Gehbauer,et al.  Optimized resource allocation for emergency response after earthquake disasters , 2000 .

[31]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[32]  Ben J. A. Kröse,et al.  Learning from delayed rewards , 1995, Robotics Auton. Syst..

[33]  Gerald Tesauro,et al.  Temporal difference learning and TD-Gammon , 1995, CACM.

[34]  Jingyan Jiang,et al.  Reinforcement learning approach for resource allocation in humanitarian logistics , 2021, Expert Syst. Appl..

[35]  A. Leiras,et al.  The Deprivation Cost in Humanitarian Logistics: A Systematic Review , 2021, Industrial Engineering and Operations Management.

[36]  Xuemin Shen,et al.  Optimizing Federated Learning in Distributed Industrial IoT: A Multi-Agent Approach , 2021, IEEE Journal on Selected Areas in Communications.

[37]  Nan Zhao,et al.  Integrated Networking, Caching, and Computing for Connected Vehicles: A Deep Reinforcement Learning Approach , 2018, IEEE Transactions on Vehicular Technology.

[38]  Peter Vrancx,et al.  Reinforcement Learning: State-of-the-Art , 2012 .

[39]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.