Combinatorial multi-armed bandit algorithms for real-time energy trading in green C-RAN

Without a proper observation of the energy demand of the receiving terminals, the retailer may be obliged to purchase additional energy from the real-time market and may take the risk of losing profit. This paper proposes two combinatorial multi-armed bandit (CMAB) strategies in green cloud radio access network (C-RAN) with simultaneous wireless information and power transfer under the assumption that no initial knowledge of forthcoming energy demand and renewable energy supply are known to the central processor. The aim of the proposed strategies is to find the set of optimal sizes of the energy packages to be purchased from the day-ahead market by observing the instantaneous energy demand and learning from the behaviour of cooperative energy trading, so that the total cost of the retailer can be minimized. Two novel iterative algorithms, namely, ForCMAB energy trading and RevCMAB energy trading are introduced to search for the optimal set of energy packages in ascending and descending order of package sizes, respectively. Simulation results indicate that CMAB approach in our proposed strategies offers the significant advantage in terms of reducing overall energy cost of the retailer, as compared to other schemes without learning-based optimization.

[1]  Derrick Wing Kwan Ng,et al.  Resource allocation for coordinated multipoint networks with wireless information and power transfer , 2014, 2014 IEEE Global Communications Conference.

[2]  Wei Yu,et al.  Sparse Beamforming and User-Centric Clustering for Downlink Cloud Radio Access Network , 2014, IEEE Access.

[3]  Zhi-Quan Luo,et al.  Joint Base Station Clustering and Beamformer Design for Partial Coordinated Transmission in Heterogeneous Networks , 2012, IEEE Journal on Selected Areas in Communications.

[4]  Zhongding Lei,et al.  Coordinated Multipoint Transmission with Limited Backhaul Data Transfer , 2013, IEEE Transactions on Wireless Communications.

[5]  Gerhard Fettweis,et al.  The global footprint of mobile communications: The ecological and economic perspective , 2011, IEEE Communications Magazine.

[6]  Tuan Anh Le,et al.  A decentralized downlink beamforming algorithm for multicell processing , 2012, 2012 IEEE Global Communications Conference (GLOBECOM).

[7]  Mohammad Reza Nakhai,et al.  Real-time power balancing in green CoMP network with wireless information and energy transfer , 2015, 2015 IEEE 26th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC).

[8]  Mohammad Reza Nakhai,et al.  Real-time energy trading with grid in green cloud-RAN , 2015, 2015 IEEE 26th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC).

[9]  Derrick Wing Kwan Ng,et al.  Energy-efficient resource allocation in multi-cell OFDMA systems with limited backhaul capacity , 2012, 2012 IEEE Wireless Communications and Networking Conference (WCNC).

[10]  Wei Chen,et al.  Combinatorial multi-armed bandit: general framework, results and applications , 2013, ICML 2013.

[11]  Derrick Wing Kwan Ng,et al.  Energy-Efficient Resource Allocation in Multi-Cell OFDMA Systems with Limited Backhaul Capacity , 2012, IEEE Trans. Wirel. Commun..

[12]  Jie Xu,et al.  Cooperative Energy Trading in CoMP Systems Powered by Smart Grids , 2016, IEEE Transactions on Vehicular Technology.

[13]  Tuan Anh Le,et al.  Power-efficient downlink transmission in multicell networks with limited wireless backhaul , 2011, IEEE Wireless Communications.

[14]  Stephen P. Boyd,et al.  Enhancing Sparsity by Reweighted ℓ1 Minimization , 2007, 0711.1612.

[15]  Deniz Gündüz,et al.  Learning-based optimization of cache content in a small cell base station , 2014, 2014 IEEE International Conference on Communications (ICC).