A Deep Reinforcement Learning Based D2D Relay Selection and Power Level Allocation in mmWave Vehicular Networks

5G millimeter wave (mmWave) communication is an efficient technique for low delay and high data rate transmission in vehicular networks. Due to the high path loss in 5G mmWave band, 5G base stations need to be densely deployed, which may result in great deployment expenditures. In this letter, we jointly consider a relay selection problem in multihop 5G mmWave device to device (D2D) transmissions and a power level allocation problem of mmWave D2D links. We propose a centralized hierarchical deep reinforcement learning based method to find an optimal solution for the problem. The proposed method does not rely on the information of links, and it tries to find an optimal solution based on the information of vehicles. Simulation results show that the convergence of the proposed method, and the transmission delay performance of proposed method is better than a link-quality-prediction based method, and close to a link-quality-known method.

[1]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[2]  Koichi Ogawa,et al.  Path-Loss Prediction Models for Intervehicle Communication at 60 GHz , 2008, IEEE Transactions on Vehicular Technology.

[3]  Zhu Han,et al.  Exploiting Device-to-Device Communications to Enhance Spatial Reuse for Popular Content Downloading in Directional mmWave Small Cells , 2015, IEEE Transactions on Vehicular Technology.

[4]  Nan Zhao,et al.  Integrated Networking, Caching, and Computing for Connected Vehicles: A Deep Reinforcement Learning Approach , 2018, IEEE Transactions on Vehicular Technology.

[5]  Mehdi Bennis,et al.  Millimeter-Wave V2V Communications: Distributed Association and Beam Alignment , 2016, IEEE Journal on Selected Areas in Communications.

[6]  Carlo Fischione,et al.  Millimeter Wave Cellular Networks: A MAC Layer Perspective , 2015, IEEE Transactions on Communications.

[7]  Tao Chen,et al.  Resource Allocation and Interference Management for Opportunistic Relaying in Integrated mmWave/sub-6 GHz 5G Networks , 2017, IEEE Communications Magazine.

[8]  Robert W. Heath,et al.  MmWave Beam Prediction with Situational Awareness: A Machine Learning Approach , 2018, 2018 IEEE 19th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[9]  Ying-Chang Liang,et al.  Applications of Deep Reinforcement Learning in Communications and Networking: A Survey , 2018, IEEE Communications Surveys & Tutorials.

[10]  Theodore S. Rappaport,et al.  Millimeter-Wave Cellular Wireless Networks: Potentials and Challenges , 2014, Proceedings of the IEEE.

[11]  Matti Latva-aho,et al.  On the Joint Impact of Beamwidth and Orientation Error on Throughput in Directional Wireless Poisson Networks , 2013, IEEE Transactions on Wireless Communications.

[12]  Robert W. Heath,et al.  Millimeter-Wave Vehicular Communication to Support Massive Automotive Sensing , 2016, IEEE Communications Magazine.