Meta-Reinforcement Learning for Reliable Communication in THz/VLC Wireless VR Networks

In this paper, the problem of enhancing the quality of virtual reality (VR) services is studied for an indoor terahertz (THz)/visible light communication (VLC) wireless network. In the studied model, small base stations (SBSs) transmit high-quality VR images to VR users over THz bands and light-emitting diodes (LEDs) provide accurate indoor positioning services for them using VLC. Here, VR users move in real time and their movement patterns change over time according to their applications, where both THz and VLC links can be blocked by the bodies of VR users. To control the energy consumption of the studied THz/VLC wireless VR network, VLC access points (VAPs) must be selectively turned on so as to ensure accurate and extensive positioning for VR users. Based on the user positions, each SBS must generate corresponding VR images and establish THz links without body blockage to transmit the VR content. The problem is formulated as an optimization problem whose goal is to maximize the average number of successfully served VR users by selecting the appropriate VAPs to be turned on and controlling the user association with SBSs. To solve this problem, a policy gradient-based reinforcement learning (RL) algorithm that adopts a meta-learning approach is proposed. The proposed meta policy gradient (MPG) algorithm enables the trained policy to quickly adapt to new user movement patterns. In order to solve the problem of maximizing the average number of successfully served users for VR scenarios with a large number of users, a dual method based MPG algorithm (D-MPG) with a low complexity is proposed. Simulation results demonstrate that, compared to a baseline trust region policy optimization algorithm (TRPO), the proposed MPG and D-MPG algorithms yield up to 26.8% and 21.9% improvement in the average number of successfully served users as well as 81.2% and 87.5% gains in the convergence speed, respectively.

[1]  Yansha Deng,et al.  Learning-Based Prediction and Proactive Uplink Retransmission for Wireless Virtual Reality Network , 2021, IEEE Transactions on Vehicular Technology.

[2]  Chong Han,et al.  SIABR: A Structured Intra-Attention Bidirectional Recurrent Deep Learning Method for Ultra-Accurate Terahertz Indoor Localization , 2021, IEEE Journal on Selected Areas in Communications.

[3]  Walid Saad,et al.  Meta-Reinforcement Learning for Immersive Virtual Reality over THz/VLC Wireless Networks , 2021, ICC 2021 - IEEE International Conference on Communications.

[4]  Pei Liu,et al.  The Impact of Multi-Connectivity and Handover Constraints on Millimeter Wave and Terahertz Cellular Networks , 2021, IEEE Journal on Selected Areas in Communications.

[5]  Walid Saad,et al.  Distributed Learning in Wireless Networks: Recent Progress and Future Challenges , 2021, IEEE Journal on Selected Areas in Communications.

[6]  Xinyu Li,et al.  Learning-based Prediction and Uplink Retransmission for Wireless Virtual Reality (VR) Network , 2020, ArXiv.

[7]  H. Vincent Poor,et al.  Distributed Multi-Agent Meta Learning for Trajectory Design in Wireless Drone Networks , 2020, IEEE Journal on Selected Areas in Communications.

[8]  Hao Wu,et al.  A Fast and High-Accuracy Real-Time Visible Light Positioning System Based on Single LED Lamp With a Beacon , 2020, IEEE Photonics Journal.

[9]  Theodore S. Rappaport,et al.  3-D Statistical Indoor Channel Model for Millimeter-Wave and Sub-Terahertz Bands , 2020, GLOBECOM 2020 - 2020 IEEE Global Communications Conference.

[10]  Diana Göhringer,et al.  Indoor THz SAR Trajectory Deviations Effects and Compensation With Passive Sub-mm Localization System , 2020, IEEE Access.

[11]  Shuguang Cui,et al.  A Machine Learning Approach for Task and Resource Allocation in Mobile-Edge Computing-Based Networks , 2020, IEEE Internet of Things Journal.

[12]  Umberto Spagnolini,et al.  Estimation of Wideband Dynamic mmWave and THz Channels for 5G Systems and Beyond , 2020, IEEE Journal on Selected Areas in Communications.

[13]  Victor C. M. Leung,et al.  Energy Efficient User Clustering, Hybrid Precoding and Power Optimization in Terahertz MIMO-NOMA Systems , 2020, IEEE Journal on Selected Areas in Communications.

[14]  Walid Saad,et al.  Can Terahertz Provide High-Rate Reliable Low-Latency Communications for Wireless VR? , 2020, IEEE Internet of Things Journal.

[15]  John Cosmas,et al.  5G Internet of Radio Light Positioning System for Indoor Broadcasting Service , 2020, IEEE Transactions on Broadcasting.

[16]  Walid Saad,et al.  Risk-Based Optimization of Virtual Reality over Terahertz Reconfigurable Intelligent Surfaces , 2020, ICC 2020 - 2020 IEEE International Conference on Communications (ICC).

[17]  M. Bennis,et al.  Cellular-Connected Wireless Virtual Reality: Requirements, Challenges, and Solutions , 2020, IEEE Communications Magazine.

[18]  W. Saad,et al.  Deep Learning for Optimal Deployment of UAVs With Visible Light Communications , 2019, IEEE Transactions on Wireless Communications.

[19]  Reza Malekian,et al.  Improving Positioning Accuracy of the Mobile Laser Scanning in GPS-Denied Environments: An Experimental Case Study , 2019, IEEE Sensors Journal.

[20]  H. Poor,et al.  A Joint Learning and Communications Framework for Federated Learning Over Wireless Networks , 2019, IEEE Transactions on Wireless Communications.

[21]  Chunyan Feng,et al.  A Relay-Assisted OFDM System for VLC Uplink Transmission , 2019, IEEE Transactions on Communications.

[22]  Mugen Peng,et al.  Joint Radio Communication, Caching, and Computing Design for Mobile Virtual Reality Delivery in Fog Radio Access Networks , 2019, IEEE Journal on Selected Areas in Communications.

[23]  Antonios Argyriou,et al.  MEC-Assisted Panoramic VR Video Streaming Over Millimeter Wave Mobile Networks , 2019, IEEE Transactions on Multimedia.

[24]  Walid Saad,et al.  Federated Echo State Learning for Minimizing Breaks in Presence in Wireless Virtual Reality Networks , 2018, IEEE Transactions on Wireless Communications.

[25]  Mehdi Bennis,et al.  Taming the Latency in Multi-User VR 360°: A QoE-Aware Deep Learning-Aided Multicast Framework , 2018, IEEE Transactions on Communications.

[26]  Chen Gong,et al.  Experimental Indoor Visible Light Positioning Systems With Centimeter Accuracy Based on a Commercial Smartphone Camera , 2018, IEEE Photonics Journal.

[27]  Xin Chen,et al.  High-speed 3D indoor localization system based on visible light communication using differential evolution algorithm , 2018, Optics Communications.

[28]  Hui Liu,et al.  Communications, Caching, and Computing for Mobile Virtual Reality: Modeling and Tradeoff , 2018, IEEE Transactions on Communications.

[29]  Ian F. Akyildiz,et al.  Combating the Distance Problem in the Millimeter Wave and Terahertz Frequency Bands , 2018, IEEE Communications Magazine.

[30]  John Thompson,et al.  A Survey of Positioning Systems Using Visible LED Lights , 2018, IEEE Communications Surveys & Tutorials.

[31]  Mehdi Bennis,et al.  Toward Low-Latency and Ultra-Reliable Virtual Reality , 2018, IEEE Network.

[32]  Philip S. Thomas,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation and Action-Dependent Baselines , 2017, ArXiv.

[33]  Sergey Levine,et al.  Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[34]  Rui Zhang,et al.  User-Initiated Data Plan Trading via a Personal Hotspot Market , 2016, IEEE Transactions on Wireless Communications.

[35]  Ismail Güvenç,et al.  Accuracy of AOA-Based and RSS-Based 3D Localization for Visible Light Communications , 2015, 2015 IEEE 82nd Vehicular Technology Conference (VTC2015-Fall).

[36]  Vitaly Petrov,et al.  Interference and SINR in Dense Terahertz Networks , 2015, 2015 IEEE 82nd Vehicular Technology Conference (VTC2015-Fall).

[37]  Murat Yuksel,et al.  Hybrid 3-D Localization for Visible Light Communication Systems , 2015, Journal of Lightwave Technology.

[38]  Geoffrey Ye Li,et al.  Adaptive Beamforming With Resource Allocation for Distance-Aware Multi-User Indoor Terahertz Communications , 2015, IEEE Transactions on Communications.

[39]  Michael I. Jordan,et al.  Trust Region Policy Optimization , 2015, ICML.

[40]  Thomas B. Schön,et al.  Indoor Positioning Using Ultrawideband and Inertial Measurements , 2015, IEEE Transactions on Vehicular Technology.

[41]  Badri N. Vellambi,et al.  Indoor Positioning System Using Visible Light and Accelerometer , 2014, Journal of Lightwave Technology.

[42]  Eliot Winer,et al.  MetaTracker: Unifying and Abstracting 3-D Motion Tracking Data From Multiple Heterogenous Hardware Systems , 2016, IEEE Access.