Intelligent Resource Slicing for eMBB and URLLC Coexistence in 5G and Beyond: A Deep Reinforcement Learning Based Approach

In this paper, we study the resource slicing problem in a dynamic multiplexing scenario of two distinct 5G services, namely Ultra-Reliable Low Latency Communications (URLLC) and enhanced Mobile BroadBand (eMBB). While eMBB services focus on high data rates, URLLC is very strict in terms of latency and reliability. In view of this, the resource slicing problem is formulated as an optimization problem that aims at maximizing the eMBB data rate subject to a URLLC reliability constraint, while considering the variance of the eMBB data rate to reduce the impact of immediately scheduled URLLC traffic on the eMBB reliability. To solve the formulated problem, an optimization-aided Deep Reinforcement Learning (DRL) based framework is proposed, including: 1) eMBB resource allocation phase, and 2) URLLC scheduling phase. In the first phase, the optimization problem is decomposed into three subproblems and then each subproblem is transformed into a convex form to obtain an approximate resource allocation solution. In the second phase, a DRL-based algorithm is proposed to intelligently distribute the incoming URLLC traffic among eMBB users. Simulation results show that our proposed approach can satisfy the stringent URLLC reliability while keeping the eMBB reliability higher than 90%.

[1]  Xianzhong Xie,et al.  Learning-Based Energy-Efficient Resource Management by Heterogeneous RF/VLC for Ultra-Reliable Low-Latency Industrial IoT Networks , 2020, IEEE Transactions on Industrial Informatics.

[2]  Zhu Han,et al.  Coexistence Mechanism Between eMBB and uRLLC in 5G Wireless Networks , 2020, IEEE Transactions on Communications.

[3]  Yan Huang,et al.  A Deep-Reinforcement-Learning-Based Approach to Dynamic eMBB/URLLC Multiplexing in 5G NR , 2020, IEEE Internet of Things Journal.

[4]  Takayuki Nishio,et al.  Extreme URLLC: Vision, Challenges, and Key Enablers , 2020, ArXiv.

[5]  Chunjing Hu,et al.  Optimization of URLLC and eMBB Multiplexing via Deep Reinforcement Learning , 2019, 2019 IEEE/CIC International Conference on Communications Workshops in China (ICCC Workshops).

[6]  Walid Saad,et al.  Model-Free Ultra Reliable Low Latency Communication (URLLC): A Deep Reinforcement Learning Framework , 2019, ICC 2019 - 2019 IEEE International Conference on Communications (ICC).

[7]  Choong Seon Hong,et al.  A matching based coexistence mechanism between eMBB and uRLLC in 5G wireless networks , 2019, SAC.

[8]  Tony Q. S. Quek,et al.  Service Multiplexing and Revenue Maximization in Sliced C-RAN Incorporated With URLLC and Multicast eMBB , 2019, IEEE Journal on Selected Areas in Communications.

[9]  Mehdi Bennis,et al.  eMBB-URLLC Resource Slicing: A Risk-Sensitive Approach , 2019, IEEE Communications Letters.

[10]  Choong Seon Hong,et al.  A Downlink Resource Scheduling Strategy for URLLC Traffic , 2019, 2019 IEEE International Conference on Big Data and Smart Computing (BigComp).

[11]  Choong Seon Hong,et al.  Resource Allocation for Ultra-Reliable and Enhanced Mobile Broadband IoT Applications in Fog Network , 2019, IEEE Transactions on Communications.

[12]  Branka Vucetic,et al.  Optimizing Resource Allocation in the Short Blocklength Regime for Ultra-Reliable and Low-Latency Communications , 2019, IEEE Transactions on Wireless Communications.

[13]  Cheng-Xiang Wang,et al.  Artificial Intelligence to Manage Network Traffic of 5G Wireless Networks , 2018, IEEE Network.

[14]  Sun-Young Rieh,et al.  Seoul, South Korea , 2018, Global Planning Innovations for Urban Sustainability.

[15]  Petar Popovski,et al.  Wireless Access in Ultra-Reliable Low-Latency Communication (URLLC) , 2018, IEEE Transactions on Communications.

[16]  Mehdi Bennis,et al.  URLLC-eMBB Slicing to Support VR Multimodal Perceptions over Wireless Cellular Systems , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[17]  Fei Hu,et al.  Intelligent Spectrum Management Based on Transfer Actor-Critic Learning for Rateless Transmissions in Cognitive Radio Networks , 2018, IEEE Transactions on Mobile Computing.

[18]  Petar Popovski,et al.  Coexistence of URLLC and eMBB Services in the C-RAN Uplink: An Information-Theoretic Study , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[19]  Mehdi Bennis,et al.  Ultra-Reliable and Low-Latency Vehicular Transmission: An Extreme Value Theory Approach , 2018, IEEE Communications Letters.

[20]  Petar Popovski,et al.  5G Wireless Network Slicing for eMBB, URLLC, and mMTC: A Communication-Theoretic View , 2018, IEEE Access.

[21]  Xianbin Wang,et al.  A Latency and Reliability Guaranteed Resource Allocation Scheme for LTE V2V Communication Systems , 2018, IEEE Transactions on Wireless Communications.

[22]  H. Vincent Poor,et al.  Ultrareliable and Low-Latency Wireless Communication: Tail, Risk, and Scale , 2018, Proceedings of the IEEE.

[23]  Gustavo de Veciana,et al.  Joint Scheduling of URLLC and eMBB Traffic in 5G Wireless Networks , 2017, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[24]  Klaus I. Pedersen,et al.  Punctured Scheduling for Critical Low Latency Data on a Shared Channel with Mobile Broadband , 2017, 2017 IEEE 86th Vehicular Technology Conference (VTC-Fall).

[25]  Byonghyo Shim,et al.  Ultra-Reliable and Low-Latency Communications in 5G Downlink: Physical Layer Aspects , 2017, IEEE Wireless Communications.

[26]  Zhi-Quan Luo,et al.  A Unified Algorithmic Framework for Block-Structured Optimization Involving Big Data: With applications in machine learning and signal processing , 2015, IEEE Signal Processing Magazine.

[27]  Klaus Obermayer,et al.  Risk-Sensitive Reinforcement Learning , 2013, Neural Computation.

[28]  Wotao Yin,et al.  A Block Coordinate Descent Method for Regularized Multiconvex Optimization with Applications to Nonnegative Tensor Factorization and Completion , 2013, SIAM J. Imaging Sci..

[29]  Xianfu Chen,et al.  TACT: A Transfer Actor-Critic Learning Framework for Energy Saving in Cellular Radio Access Networks , 2012, IEEE Transactions on Wireless Communications.

[30]  S. Verdú,et al.  Channel Coding Rate in the Finite Blocklength Regime , 2010, IEEE Transactions on Information Theory.

[31]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[32]  W. Sharpe,et al.  Mean-Variance Analysis in Portfolio Choice and Capital Markets , 1987 .

[33]  W. Hager,et al.  and s , 2019, Shallow Water Hydraulics.

[34]  Xianzhong Xie,et al.  Intelligent Resource Management Based on Reinforcement Learning for Ultra-Reliable and Low-Latency IoV Communication Networks , 2019, IEEE Transactions on Vehicular Technology.

[35]  Inbal Talgam-Cohen,et al.  Oblivious Rounding and the Integrality Gap , 2016, APPROX-RANDOM.

[36]  Stephen W. Carden,et al.  An Introduction to Reinforcement Learning , 2013 .