Deep Reinforcement Learning for Trustworthy and Time-Varying Connection Scheduling in a Coupled UAV-Based Femtocaching Architecture

The paper is motivated by the urgent need, imposed by the COVID-19 pandemic, for trustworthy access to secure communication systems with the highest achievable availability and minimum latency. In this regard, we focus on an ultra-dense wireless network consisting of Femto Access Points (FAPs) and Unmanned Aerial Vehicles (UAVs), known as caching nodes, where there are more than one possible caching node to handle user’s request. To efficiently cope with the dynamic topology of wireless networks and time-varying behavior of ground users, our focus is to develop an efficient connection scheduling framework, where ground users are autonomously trained to determine the optimal caching node, i.e., UAV or FAP. Our aim is to minimize users’ access delay by maintaining a trade-off between the energy consumption of UAVs and the occurrence of handovers. To achieve these objectives, we formulate a multi-objective optimization problem and propose the Convolutional Neural Network (CNN) and Q-Network-based Connection Scheduling (CQN-CS) framework. More specifically, to solve the constructed multi-objective connection scheduling problem, a deep Q-Network model is developed as an efficient Reinforcement Learning (RL) approach to train ground users to handle their requests in an optimal and trustworthy fashion within the coupled UAV-based femtocaching network. The effectiveness of the proposed CQN-CS framework is evaluated in terms of the cache-hit ratio, user’s access delay, energy consumption of UAVs, handover, lifetime of the network, and cumulative rewards. Based on the simulation results, the proposed CQN-CS framework illustrates significant performance improvements in companion to Q-learning and Deep Q-Network (DQN) schemes across all the aforementioned aspects.

[1]  Bin Jiang,et al.  Multimedia Data Throughput Maximization in Internet-of-Things System Based on Optimization of Cache-Enabled UAV , 2019, IEEE Internet of Things Journal.

[2]  Victor Talpaert,et al.  Deep Reinforcement Learning for Autonomous Driving: A Survey , 2020, IEEE Transactions on Intelligent Transportation Systems.

[3]  Joost van Oijen,et al.  Machine learning techniques for autonomous agents in military simulations — Multum in parvo , 2017, 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[4]  Catherine M. Burns,et al.  Intelligent Adaptive Systems: An Interaction-Centered Design Perspective , 2014 .

[5]  Mohsen Guizani,et al.  A Comprehensive Review of the COVID-19 Pandemic and the Role of IoT, Drones, AI, Blockchain, and 5G in Managing its Impact , 2020, IEEE Access.

[6]  Konstantinos N. Plataniotis,et al.  Orientation-Matched Multiple Modeling for RSSI-based Indoor Localization via BLE Sensors , 2021, 2020 28th European Signal Processing Conference (EUSIPCO).

[7]  Mostafa Zaman Chowdhury,et al.  Energy-Efficient UAV-to-User Scheduling to Maximize Throughput in Wireless Networks , 2020, IEEE Access.

[8]  Walid Saad,et al.  Regret Based Learning for UAV Assisted LTE-U/WiFi Public Safety Networks , 2016, 2016 IEEE Global Communications Conference (GLOBECOM).

[9]  Ala' F. Khalifeh,et al.  A survey on femtocell handover management in dense heterogeneous 5G networks , 2020, Telecommunication Systems.

[10]  Derrick Wing Kwan Ng,et al.  Energy-Efficient Resource Allocation for Secure UAV Communication Systems , 2022 .

[11]  Ann Nowé,et al.  Scalarized multi-objective reinforcement learning: Novel design techniques , 2013, 2013 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL).

[12]  Ming Hou,et al.  Intelligent Adaptive Interfaces for the Control of Multiple UAVs , 2007 .

[13]  Hao He,et al.  Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations , 2019, ArXiv.

[14]  Sherali Zeadally,et al.  A Survey of Device-to-Device Communications: Research Issues and Challenges , 2018, IEEE Communications Surveys & Tutorials.

[15]  Konstantinos N. Plataniotis,et al.  Bluetooth Low Energy-based Angle of Arrival Estimation via Switch Antenna Array for Indoor Localization , 2020, 2020 IEEE 23rd International Conference on Information Fusion (FUSION).

[16]  Tarik Taleb,et al.  Energy and Delay Aware Task Assignment Mechanism for UAV-Based IoT Platform , 2019, IEEE Internet of Things Journal.

[17]  Hui Zhao,et al.  Energy-Aware Dynamic Resource Allocation in UAV Assisted Mobile Edge Computing Over Social Internet of Vehicles , 2018, IEEE Access.

[18]  Jie Xu,et al.  Energy Minimization for Wireless Communication With Rotary-Wing UAV , 2018, IEEE Transactions on Wireless Communications.

[19]  Mohamed A. Aref,et al.  Robust Deep Reinforcement Learning for Interference Avoidance in Wideband Spectrum , 2019, 2019 IEEE Cognitive Communications for Aerospace Applications Workshop (CCAAW).

[20]  Kaharudin Dimyati,et al.  Comparative Study of Indoor Propagation Model Below and Above 6 GHz for 5G Wireless Networks , 2019, Electronics.

[21]  Li Qiu,et al.  Popularity-aware caching increases the capacity of wireless networks , 2017, IEEE INFOCOM 2017 - IEEE Conference on Computer Communications.

[22]  MengChu Zhou,et al.  Optimizing Operator–Agent Interaction in Intelligent Adaptive Interface Design: A Conceptual Framework , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[23]  Christoffer Moesgaard Albertsen,et al.  Generalizing the first-difference correlated random walk for marine animal movement data , 2018, Scientific Reports.

[24]  Konstantinos N. Plataniotis,et al.  Bluetooth Low Energy-based Angle of Arrival Estimation in Presence of Rayleigh Fading , 2020, 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[25]  Nan Cheng,et al.  Joint Design of Access Point Selection and Path Planning for UAV-Assisted Cellular Networks , 2020, IEEE Internet of Things Journal.

[26]  Alexandros G. Dimakis,et al.  FemtoCaching: Wireless Content Delivery Through Distributed Caching Helpers , 2013, IEEE Transactions on Information Theory.

[27]  Qingqing Wu,et al.  Accessing From the Sky: A Tutorial on UAV Communications for 5G and Beyond , 2019, Proceedings of the IEEE.

[28]  Changchuan Yin,et al.  Optimized Trajectory Design in UAV Based Cellular Networks for 3D Users: A Double Q-Learning Approach , 2019, J. Commun. Inf. Networks.

[29]  Zhu Han,et al.  Multi-UAV Delay Optimization in Edge Caching Networks: A Mean Field Game Approach , 2019, 2019 28th Wireless and Optical Communications Conference (WOCC).

[30]  Liang Xiao,et al.  UAV-Aided Cellular Communications with Deep Reinforcement Learning Against Jamming , 2018, IEEE Wireless Communications.

[31]  Dushantha Nalin K. Jayakody,et al.  Neural-Blockchain-Based Ultrareliable Caching for Edge-Enabled UAV Networks , 2019, IEEE Transactions on Industrial Informatics.

[32]  Ismail Güvenç,et al.  Energy Efficiency of RSMA and NOMA in Cellular-Connected mmWave UAV Networks , 2019, 2019 IEEE International Conference on Communications Workshops (ICC Workshops).

[33]  H. Vincent Poor,et al.  Probabilistic Caching for Small-Cell Networks With Terrestrial and Aerial Users , 2019, IEEE Transactions on Vehicular Technology.

[34]  Jamshid Abouei,et al.  Cache Replacement Schemes Based on Adaptive Time Window for Video on Demand Services in Femtocell Networks , 2019, IEEE Transactions on Mobile Computing.

[35]  Arash Mohammadi,et al.  Mobility-Aware Femtocaching Algorithm in D2D Networks Based on Handover , 2020, IEEE Transactions on Vehicular Technology.

[36]  Aiping Huang,et al.  Modeling the Delivery of Coded Packets in D2D Mobile Caching Networks , 2019, IEEE Access.

[37]  Geoffrey Ho,et al.  Effects of display mode and input method for handheld control of micro aerial vehicles for a reconnaissance mission , 2013, IEEE Transactions on Human-Machine Systems.

[38]  Shuo Shi,et al.  Deep Reinforcement Learning-Based Content Placement and Trajectory Design in Urban Cache-Enabled UAV Networks , 2020, Wirel. Commun. Mob. Comput..

[39]  Yi Wang,et al.  Cache-Enabling UAV Communications: Network Deployment and Resource Allocation , 2020, IEEE Transactions on Wireless Communications.

[40]  Bin Li,et al.  UAV Communications for 5G and Beyond: Recent Advances and Future Trends , 2019, IEEE Internet of Things Journal.

[41]  Walid Saad,et al.  Caching in the Sky: Proactive Deployment of Cache-Enabled Unmanned Aerial Vehicles for Optimized Quality-of-Experience , 2016, IEEE Journal on Selected Areas in Communications.

[42]  F. Richard Yu,et al.  Caching UAV Assisted Secure Transmission in Hyper-Dense Networks Based on Interference Alignment , 2018, IEEE Transactions on Communications.

[43]  C. Yalçin Kaya,et al.  A New Scalarization Technique to Approximate Pareto Fronts of Problems with Disconnected Feasible Sets , 2013, Journal of Optimization Theory and Applications.

[44]  Xiaofeng Tao,et al.  Cooperative UAV Cluster-Assisted Terrestrial Cellular Networks for Ubiquitous Coverage , 2018, IEEE Journal on Selected Areas in Communications.

[45]  Walid Saad,et al.  Echo-Liquid State Deep Learning for 360° Content Transmission and Caching in Wireless VR Networks With Cellular-Connected UAVs , 2018, IEEE Transactions on Communications.

[46]  Ali Ghrayeb,et al.  Trajectory Planning and Resource Allocation of Multiple UAVs for Data Delivery in Vehicular Networks , 2019, IEEE Networking Letters.

[47]  Rui Wang,et al.  Deep Reinforcement Learning for Multiobjective Optimization , 2019, IEEE Transactions on Cybernetics.

[48]  Konstantinos N. Plataniotis,et al.  IoT-TD: IoT Dataset for Multiple Model BLE-based Indoor Localization/Tracking , 2021, 2020 28th European Signal Processing Conference (EUSIPCO).

[49]  Walid Saad,et al.  Liquid State Machine Learning for Resource and Cache Management in LTE-U Unmanned Aerial Vehicle (UAV) Networks , 2018, IEEE Transactions on Wireless Communications.

[50]  Hong Wen,et al.  Performance Analysis of UAV Relay Assisted IoT Communication Network Enhanced With Energy Harvesting , 2019, IEEE Access.

[51]  Ying-Chang Liang,et al.  Applications of Deep Reinforcement Learning in Communications and Networking: A Survey , 2018, IEEE Communications Surveys & Tutorials.

[52]  Yusheng Ji,et al.  Information Freshness-Aware Task Offloading in Air-Ground Integrated Edge Computing Systems , 2020, IEEE Journal on Selected Areas in Communications.

[53]  Jamshid Abouei,et al.  An efficient femtocell-to-femtocell handover decision algorithm in LTE femtocell networks , 2015, 2015 23rd Iranian Conference on Electrical Engineering.