Reinforcement Learning-Based Intelligent Resource Allocation for Integrated VLCP Systems

In this letter, an intelligent resource allocation framework based on model-free reinforcement learning (RL) is first presented for multi-user integrated visible light communication and positioning (VLCP) systems, in order to maximize the sum rate of users while guaranteeing the users’ minimum data rates and positioning accuracy constraints. The learning framework can learn the optimal policy under unknown environment’s dynamics and the continuous-valued space, and a reward function is proposed to take into account the strict communication and positioning constraints. Moreover, a modified experience replay actor–critic (MERAC) RL approach is proposed to improve the learning efficiency and convergence speed, which efficiently collects the reliable experience and utilizes the most useful knowledge from the memory. Numerical results show that the MERAC approach can effectively learn to satisfy the strict constraints and achieve the fast convergence speed.

[1]  Rajendran Parthiban,et al.  LED Based Indoor Visible Light Communications: State of the Art , 2015, IEEE Communications Surveys & Tutorials.

[2]  Liang Yin,et al.  Performance Evaluation of Non-Orthogonal Multiple Access in Visible Light Communication , 2016, IEEE Transactions on Communications.

[3]  Sinan Gezici,et al.  Localization via Visible Light Systems , 2018, Proceedings of the IEEE.

[4]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[5]  Mihaela van der Schaar,et al.  Fast Reinforcement Learning for Energy-Efficient Wireless Communication , 2010, IEEE Transactions on Signal Processing.

[6]  Pengfei Du,et al.  On the Performance of MIMO-NOMA-Based Visible Light Communication Systems , 2018, IEEE Photonics Technology Letters.

[7]  Sinan Gezici,et al.  Optimal and Robust Power Allocation for Visible Light Positioning Systems Under Illumination Constraints , 2018, IEEE Transactions on Communications.

[8]  Xianfu Chen,et al.  TACT: A Transfer Actor-Critic Learning Framework for Energy Saving in Cellular Radio Access Networks , 2012, IEEE Transactions on Wireless Communications.

[9]  Shlomi Arnon,et al.  Multiple Access Resource Allocation in Visible Light Communication Systems , 2014, Journal of Lightwave Technology.

[10]  Zhiyong Du,et al.  Context-Aware Indoor VLC/RF Heterogeneous Network Selection: Reinforcement Learning With Knowledge Transfer , 2018, IEEE Access.

[11]  Mohsen Kavehrad,et al.  Indoor positioning with OFDM Visible Light Communications , 2016, 2016 13th IEEE Annual Consumer Communications & Networking Conference (CCNC).

[12]  Harald Haas,et al.  Joint User Association and Power Allocation for Cell-Free Visible Light Communication Networks , 2018, IEEE Journal on Selected Areas in Communications.

[13]  Changyuan Yu,et al.  Accuracy analysis and improvement of visible light positioning based on VLC system using orthogonal frequency division multiple access , 2017 .

[14]  S. Zhang,et al.  Demonstration of a Quasi-Gapless Integrated Visible Light Communication and Positioning System , 2018, IEEE Photonics Technology Letters.