Reinforcement Learning for Automated Energy Efficient Mobile Network Performance Tuning

Modern mobile networks are increasingly complex from a resource management perspective, with diverse combinations of software, infrastructure elements and services that need to be configured and tuned for correct and efficient operation. It is well accepted in the communications community that appropriately dimensioned, efficient and reliable configurations of systems like 5G or indeed its predecessor 4G is a massive technical challenge. One promising avenue is the application of machine learning methods to apply a data-driven and continuous learning approach to automated system performance tuning. We demonstrate the effectiveness of policy-gradient reinforcement learning as a way to learn and apply complex interleaving patterns of radio resource block usage in 4G and 5G, in order to automate the reduction of cell edge interference. We show that our method can increase overall spectral efficiency up to 25% and increase the overall system energy efficiency up to 50% in very challenging scenarios by learning how to do more with less system resources. We also introduce a flexible phased and continuous learning approach that can be used to train a bootstrap model in a simulated environment after which the model is transferred to a live system for continuous contextual learning.

[1]  Muhammad Ali Imran,et al.  Challenges in 5G: how to empower SON with big data for enabling 5G , 2014, IEEE Network.

[2]  Daniel F. Perez-Ramirez,et al.  Learning Combinatorial Optimization on Graphs: A Survey With Applications to Networking , 2020, IEEE Access.

[3]  Stefan Parkvall,et al.  NR - The New 5G Radio-Access Technology , 2017, 2018 IEEE 87th Vehicular Technology Conference (VTC Spring).

[4]  Mandayam A. L. Thathachar,et al.  Local and Global Optimization Algorithms for Generalized Learning Automata , 1995, Neural Computation.

[5]  Ekram Hossain,et al.  Fractional frequency reuse for interference management in LTE-advanced hetnets , 2013, IEEE Wireless Communications.

[6]  Jan Peters,et al.  Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..

[7]  Andreas Ermedahl,et al.  Data driven selection of DRX for energy efficient 5G RAN , 2017, 2017 13th International Conference on Network and Service Management (CNSM).

[8]  Giuseppe Piro,et al.  Downlink Packet Scheduling in LTE Cellular Networks: Key Design Issues and a Survey , 2013, IEEE Communications Surveys & Tutorials.

[9]  Yuvraj Singh,et al.  Comparison of Okumura, Hata and COST-231 Models on the Basis of Path Loss and Signal Strength , 2012 .

[10]  Jonathan Loo,et al.  Recent Advances in Radio Resource Management for Heterogeneous LTE/LTE-A Networks , 2014, IEEE Communications Surveys & Tutorials.

[11]  Ismail Güvenç,et al.  Learning Based Frequency- and Time-Domain Inter-Cell Interference Coordination in HetNets , 2014, IEEE Transactions on Vehicular Technology.

[12]  Ahmed Wasif Reza,et al.  Frequency Reuse for 4G Technologies: A Survey , 2015 .

[13]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[14]  Z. Popovic,et al.  Statistics for Ratios of Rayleigh, Rician, Nakagami-, and Weibull Distributed Random Variables , 2013 .

[15]  Sergio Verdú,et al.  Spectral efficiency in the wideband regime , 2002, IEEE Trans. Inf. Theory.

[16]  Zwi Altman,et al.  A cooperative Reinforcement Learning approach for Inter-Cell Interference Coordination in OFDMA cellular networks , 2010, 8th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks.

[17]  Juan Montojo,et al.  Carrier Aggregation , 2011, LTE - The UMTS Long Term Evolution.

[18]  Athanasios V. Vasilakos,et al.  Time-domain ICIC and optimized designs for 5G and beyond: a survey , 2018, Science China Information Sciences.

[19]  Zesong Fei,et al.  An Almost Blank Subframe Allocation Algorithm for 5G New Radio in Unlicensed Bands , 2020, 2020 IEEE/CIC International Conference on Communications in China (ICCC).

[20]  Satoshi Nagata,et al.  Coordinated multipoint transmission and reception in LTE-advanced: deployment scenarios and operational challenges , 2012, IEEE Communications Magazine.

[21]  Oriol Sallent,et al.  A Novel Framework for Dynamic Spectrum Management in MultiCell OFDMA Networks Based on Reinforcement Learning , 2009, 2009 IEEE Wireless Communications and Networking Conference.

[22]  2021 IEEE Wireless Communications and Networking Conference (WCNC) , 2021 .

[23]  Oriol Sallent,et al.  An Application of Reinforcement Learning for Efficient Spectrum Usage in Next-Generation Mobile Cellular Networks , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[24]  Rebecca Steinert,et al.  Distributed dynamic load balancing with applications in radio access networks , 2018, Int. J. Netw. Manag..

[25]  Lorenza Giupponi,et al.  From 4G to 5G: Self-organized Network Management meets Machine Learning , 2017, Comput. Commun..

[26]  Daniel Gillblad,et al.  Autonomous Load Balancing of Heterogeneous Networks , 2015, 2015 IEEE 81st Vehicular Technology Conference (VTC Spring).

[27]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[28]  Klaus Pohl,et al.  Online Reinforcement Learning for Self-adaptive Information Systems , 2020, CAiSE.

[29]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[30]  Supratim Deb,et al.  Algorithms for Enhanced Inter-Cell Interference Coordination (eICIC) in LTE HetNets , 2013, IEEE/ACM Transactions on Networking.

[31]  Matías Toril,et al.  Estimating Spectral Efficiency Curves from Connection Traces in a Live LTE Network , 2017, Mob. Inf. Syst..

[32]  Tony Q. S. Quek,et al.  Enhanced intercell interference coordination challenges in heterogeneous networks , 2011, IEEE Wireless Communications.

[33]  Erik Aumayr,et al.  Remote Electrical Tilt Optimization via Safe Reinforcement Learning , 2020, 2021 IEEE Wireless Communications and Networking Conference (WCNC).

[34]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[35]  Jeffrey G. Andrews,et al.  Online Antenna Tuning in Heterogeneous Cellular Networks With Deep Reinforcement Learning , 2019, IEEE Transactions on Cognitive Communications and Networking.

[36]  Demis Hassabis,et al.  Mastering the game of Go without human knowledge , 2017, Nature.

[37]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[38]  Craig Boutilier,et al.  Data center cooling using model-predictive control , 2018, NeurIPS.

[39]  Albert Cabellos-Aparicio,et al.  Deep Reinforcement Learning meets Graph Neural Networks: exploring a routing optimization use case. , 2020 .

[40]  Henning Sanneck,et al.  LTE Self-Organising Networks (SON): Network Management Automation for Operational Efficiency , 2012 .