Dynamic Control of Functional Splits for Energy Harvesting Virtual Small Cells: a Distributed Reinforcement Learning Approach

In this paper, we propose a network scenario where the baseband processes of the virtual small cells powered solely by energy harvesters and batteries can be opportunistically executed in a grid-connected edge computing server, co-located at the macro base station site. We state the corresponding energy minimization problem and propose multi-agent Reinforcement Learning (RL) to solve it. Distributed Fuzzy Q-Learning and Q-Learning on-line algorithms are tailored for our purposes. Coordination among the multiple agents is achieved by broadcasting system level information to the independent learners. The evaluation of the network performance confirms that coordination via broadcasting may achieve higher system level gains than un-coordinated solutions and cumulative rewards closer to the off-line bounds. Finally, our analysis permits to evaluate the benefits of continuous state/action representation for the learning algorithms in terms of faster convergence, higher cumulative reward and more adaptation to changing environments.

[1]  Abdallah Shami,et al.  NFV: state of the art, challenges, and implementation in next generation mobile networks (vEPC) , 2014, IEEE Network.

[2]  Oriol Sallent,et al.  Knowledge-based 5G Radio Access Network planning and optimization , 2016, 2016 International Symposium on Wireless Communication Systems (ISWCS).

[3]  Shaolei Ren,et al.  Online Learning for Offloading and Autoscaling in Renewable-Powered Mobile Edge Computing , 2016, 2016 IEEE Global Communications Conference (GLOBECOM).

[4]  Paolo Dini,et al.  Optimal direct load control of renewable powered small cells: A shortest path approach , 2018, Internet Technol. Lett..

[5]  Rainer Palm,et al.  Observers for Takagi-Sugeno fuzzy systems , 2002, IEEE Trans. Syst. Man Cybern. Part B.

[6]  Jianqiu Li,et al.  A review on the key issues for lithium-ion battery management in electric vehicles , 2013 .

[7]  Zhisheng Niu,et al.  Sleep control for base stations powered by heterogeneous energy sources , 2013, 2013 International Conference on ICT Convergence (ICTC).

[8]  Marco Miozzo,et al.  Layered Learning Radio Resource Management for Energy Harvesting Small Base Stations , 2018, 2018 IEEE 87th Vehicular Technology Conference (VTC Spring).

[9]  Raphael Caire,et al.  Battery-Aware Optimization of Green Small Cells: Sizing and Energy Management , 2018, IEEE Transactions on Green Communications and Networking.

[10]  Marco Miozzo,et al.  Dynamic Functional Split Selection in Energy Harvesting Virtual Small Cells Using Temporal Difference Learning , 2018, 2018 IEEE 29th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC).

[11]  P. Y. Glorennec,et al.  Fuzzy Q-learning and dynamical fuzzy Q-learning , 1994, Proceedings of 1994 IEEE 3rd International Fuzzy Systems Conference.

[12]  Marco Miozzo,et al.  Optimal Placement of Baseband Functions for Energy Harvesting Virtual Small Cells , 2018, 2018 IEEE 88th Vehicular Technology Conference (VTC-Fall).

[13]  Michela Meo,et al.  The Impact of Quantization on the Design of Solar Power Systems for Cellular Base Stations , 2018, IEEE Transactions on Green Communications and Networking.

[14]  Holger Claussen,et al.  Towards 1 Gbps/UE in Cellular Systems: Understanding Ultra-Dense Small Cell Deployments , 2015, IEEE Communications Surveys & Tutorials.

[15]  Byrav Ramamurthy,et al.  Network Innovation using OpenFlow: A Survey , 2014, IEEE Communications Surveys & Tutorials.

[16]  Jemin Lee,et al.  Mobile Edge Computing-Enabled Heterogeneous Networks , 2018, IEEE Transactions on Wireless Communications.

[17]  Giuseppe Piro,et al.  HetNets Powered by Renewable Energy Sources: Sustainable Next-Generation Cellular Networks , 2013, IEEE Internet Computing.

[18]  Marco Miozzo,et al.  SolarStat: Modeling photovoltaic sources through stochastic Markov processes , 2013, 2014 IEEE International Energy Conference (ENERGYCON).

[19]  Boriana L. Milenova,et al.  Fuzzy and neural approaches in engineering , 1997 .

[20]  Paolo Dini,et al.  Mobile Traffic Prediction from Raw Data Using LSTM Networks , 2018, 2018 IEEE 29th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC).

[21]  Bart De Schutter,et al.  A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[22]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[23]  Muhammad Ali Imran,et al.  Flexible power modeling of LTE base stations , 2012, 2012 IEEE Wireless Communications and Networking Conference (WCNC).

[24]  Lyes Khoukhi,et al.  Fully Distributed Approach for Energy Saving in Heterogeneous Networks , 2016, 2016 8th IFIP International Conference on New Technologies, Mobility and Security (NTMS).

[25]  Abolfazl Mehbodniya,et al.  Online Ski Rental for ON/OFF Scheduling of Energy Harvesting Base Stations , 2016, IEEE Transactions on Wireless Communications.

[26]  Marco Miozzo,et al.  Distributed Q-learning for energy harvesting Heterogeneous Networks , 2015, 2015 IEEE International Conference on Communication Workshop (ICCW).

[27]  Zhisheng Niu,et al.  Base Station Sleeping and Resource Allocation in Renewable Energy Powered Cellular Networks , 2013, IEEE Transactions on Communications.

[28]  Depeng Jin,et al.  Understanding Mobile Traffic Patterns of Large Scale Cellular Towers in Urban Environment , 2015, Internet Measurement Conference.

[29]  Anders S. G. Andrae,et al.  On Global Electricity Usage of Communication Technology: Trends to 2030 , 2015 .