Deep Reinforcement Learning Based Dynamic Resource Allocation in 5G Ultra-Dense Networks

The rapid development of Internet of things (IoT) technology has promoted the densification of network infrastructure. Ultra-dense networks (UDN) will become a key technology in 5G networks. We investigated the dynamic resource allocation problem over 5G UDN. We considered the energy efficiency (EE) and spectral efficiency (SE) of the network. Therefore, the resource allocation problem at different moments was expressed as a joint optimization problem. Considering the dynamic nature of the environment, the EE and SE were dynamically weighted. In order to guarantee the long-term performance of UDN system, the joint optimization problem was described as a markov decision process (MDP). In view of the fact that the densification of the network makes the space explosion of MDP and makes it difficult to solve by traditional methods, the dueling deep Q network (Dueling DQN) method was proposed. Simulation results showed that compared with traditional Q-learning and DQN, this algorithm has obvious performance improvement.

[1]  Xin Chen,et al.  MDP-Based Network Selection with Reward Optimization in HetNets , 2018 .

[2]  Victor C. M. Leung,et al.  Resource Allocation for Ultra-Dense Networks: A Survey, Some Research Issues and Challenges , 2019, IEEE Communications Surveys & Tutorials.

[3]  Tie Qiu,et al.  Robustness Optimization Scheme With Multi-Population Co-Evolution for Scale-Free Wireless Sensor Networks , 2019, IEEE/ACM Transactions on Networking.

[4]  Ender Ayanoglu,et al.  Energy-spectral efficiency tradeoff for heterogeneous networks with QoS constraints , 2017, 2017 IEEE International Conference on Communications (ICC).

[5]  Amr M. Youssef,et al.  Ultra-Dense Networks: A Survey , 2016, IEEE Communications Surveys & Tutorials.

[6]  AKHIL GUPTA,et al.  A Survey of 5G Network: Architecture and Emerging Technologies , 2015, IEEE Access.

[7]  Qinghai Yang,et al.  Improved genetic algorithm based intelligent resource allocation in 5G Ultra Dense networks , 2018, 2018 IEEE Wireless Communications and Networking Conference (WCNC).

[8]  Geoffrey Ye Li,et al.  Deep Reinforcement Learning Based Resource Allocation for V2V Communications , 2018, IEEE Transactions on Vehicular Technology.

[9]  Tom Schaul,et al.  Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[10]  Sanjeev Jain,et al.  Green Communication in Next Generation Cellular Networks: A Survey , 2017, IEEE Access.

[11]  Tao Jiang,et al.  Ultra-Dense HetNets Meet Big Data: Green Frameworks, Techniques, and Approaches , 2017, IEEE Communications Magazine.

[12]  Jiming Chen,et al.  Narrowband Internet of Things: Implementations and Applications , 2017, IEEE Internet of Things Journal.

[13]  Bo Hu,et al.  User-centric ultra-dense networks for 5G: challenges, methodologies, and directions , 2016, IEEE Wireless Communications.

[14]  Zhimin Zeng,et al.  Load-Aware Energy Efficiency Optimization in Dense Small Cell Networks , 2017, IEEE Communications Letters.

[15]  Tiejun Lv,et al.  Deep reinforcement learning based computation offloading and resource allocation for MEC , 2018, 2018 IEEE Wireless Communications and Networking Conference (WCNC).

[16]  Zhu Han,et al.  Resource allocation for non-orthogonal multiple access in heterogeneous networks , 2017, 2017 IEEE International Conference on Communications (ICC).

[17]  Alia Asheralieva,et al.  Bayesian Reinforcement Learning-Based Coalition Formation for Distributed Resource Sharing by Device-to-Device Users in Heterogeneous Cellular Networks , 2017, IEEE Transactions on Wireless Communications.

[18]  Zhu Han,et al.  User Scheduling and Resource Allocation in HetNets With Hybrid Energy Supply: An Actor-Critic Reinforcement Learning Approach , 2018, IEEE Transactions on Wireless Communications.

[19]  Tiejun Lv,et al.  Deep Q-Learning Based Dynamic Resource Allocation for Self-Powered Ultra-Dense Networks , 2018, 2018 IEEE International Conference on Communications Workshops (ICC Workshops).

[20]  Tie Qiu,et al.  EABS: An Event-Aware Backpressure Scheduling Scheme for Emergency Internet of Things , 2018, IEEE Transactions on Mobile Computing.