Q-Learning-Based Spectrum Access for Content Delivery in Mobile Networks

By the wide development of wireless communication technology, mobile networks are becoming the promising paradigm for mobile users to access the Internet to download contents. However, transmitting contents to multiple mobile users causes excessive interference and heavy network load. Moreover, the service demands for rich contents over mobile networks have been ever-increasing, which leads serious traffic loads to the networks. To improve mobile user’s quality of experience (QoE), in this paper, a Q-learning based spectrum access scheme in mobile networks is firstly proposed for mobile users to access the optimal spectrum and maximize the transmission rate. Secondly, based on the optimal spectrum decision, we present a content delivery scheme for edge nodes, device-to-device (D2D) pairs and mobile users. After that, we model the content delivery process among edge nodes, D2D pairs and mobile users as a non-cooperative Stackelberg game to improve the efficiency of content delivery. Then, the content delivery strategy algorithm is proposed to derive the Stackelberg equilibrium. Each player can obtain the maximum utility from the game based on the optimal strategy. Finally, we conduct extensive simulations to validate the performance of the proposed scheme. Simulation results demonstrate that the proposed scheme can jointly maximize the throughput of mobile networks and improve the QoE of mobile users compared with the conventional schemes.

[1]  Danijela Cabric,et al.  Green Heterogeneous Networks via an Intelligent Sleep/Wake-Up Mechanism and D2D Communications , 2018, IEEE Transactions on Green Communications and Networking.

[2]  Yong Li,et al.  Social-Community-Aware Resource Allocation for D2D Communications Underlaying Cellular Networks , 2016, IEEE Transactions on Vehicular Technology.

[3]  Muhammad Ali Imran,et al.  Spatial and Social Paradigms for Interference and Coverage Analysis in Underlay D2D Network , 2017, IEEE Transactions on Vehicular Technology.

[4]  Zhu Han,et al.  Joint Sponsored and Edge Caching Content Service Market: A Game-Theoretic Approach , 2019, IEEE Transactions on Wireless Communications.

[5]  Matti Latva-aho,et al.  Incentivizing Selected Devices to Perform Cooperative Content Delivery: A Carrier Aggregation-Based Approach , 2016, IEEE Transactions on Wireless Communications.

[6]  K. J. Ray Liu,et al.  Scalable Video Multicasting: A Stochastic Game Approach With Optimal Pricing , 2015, IEEE Transactions on Wireless Communications.

[7]  K. J. Ray Liu,et al.  Game-Theoretic Pricing for Video Streaming in Mobile Networks , 2012, IEEE Transactions on Image Processing.

[8]  Alagan Anpalagan,et al.  Opportunistic Spectrum Access Using Partially Overlapping Channels: Graphical Game and Uncoupled Learning , 2013, IEEE Transactions on Communications.

[9]  Peter Vrancx,et al.  Convolutional Neural Networks for Automatic State-Time Feature Extraction in Reinforcement Learning Applied to Residential Load Control , 2016, IEEE Transactions on Smart Grid.

[10]  Andreas Mitschele-Thiel,et al.  Synchronized Cooperative Learning for Coordinating Cognitive Network Management Functions , 2018, IEEE Transactions on Cognitive Communications and Networking.

[11]  Haitao Zhao,et al.  A Simple Distributed Channel Allocation Algorithm for D2D Communication Pairs , 2018, IEEE Transactions on Vehicular Technology.

[12]  Simon Haykin,et al.  Cognitive Radio Networks: The Spectrum Supply Chain Paradigm , 2015, IEEE Transactions on Cognitive Communications and Networking.

[13]  Walid Saad,et al.  Echo State Networks for Self-Organizing Resource Allocation in LTE-U With Uplink–Downlink Decoupling , 2016, IEEE Transactions on Wireless Communications.

[14]  Qihui Wu,et al.  QoE-Based Distributed Multichannel Allocation in 5G Heterogeneous Cellular Networks: A Matching-Coalitional Game Solution , 2017, IEEE Access.

[15]  Lakshmish Ramaswamy,et al.  Scalable Delivery of Dynamic Content Using a Cooperative Edge Cache Grid , 2007, IEEE Transactions on Knowledge and Data Engineering.

[16]  Bechir Hamdaoui,et al.  Responsive Content-Centric Delivery in Large Urban Communication Networks: A LinkNYC Use-Case , 2018, IEEE Transactions on Wireless Communications.

[17]  Xianfu Chen,et al.  Energy-Efficiency Oriented Traffic Offloading in Wireless Networks: A Brief Survey and a Learning Approach for Heterogeneous Cellular Networks , 2015, IEEE Journal on Selected Areas in Communications.

[18]  Yusheng Ji,et al.  Multiple Description Coding and Recovery of Free Viewpoint Video for Wireless Multi-Path Streaming , 2015, IEEE Journal of Selected Topics in Signal Processing.

[19]  Daniel Camps-Mur,et al.  Device-to-device communications with Wi-Fi Direct: overview and experimentation , 2013, IEEE Wireless Communications.

[20]  Bo Hu,et al.  An Iterative Matching-Stackelberg Game Model for Channel-Power Allocation in D2D Underlaid Cellular Networks , 2018, IEEE Transactions on Wireless Communications.

[21]  David Hutchison,et al.  The Extended Cloud: Review and Analysis of Mobile Edge Computing and Fog From a Security and Resilience Perspective , 2017, IEEE Journal on Selected Areas in Communications.

[22]  Gabriel-Miro Muntean,et al.  Enhanced Power-Friendly Access Network Selection Strategy for Multimedia Delivery Over Heterogeneous Wireless Networks , 2014, IEEE Transactions on Broadcasting.

[23]  Ekram Hossain,et al.  Distributed Resource Allocation for Relay-Aided Device-to-Device Communication: A Message Passing Approach , 2014, IEEE Transactions on Wireless Communications.

[24]  Catherine Rosenberg,et al.  Joint Resource Allocation and User Association for Heterogeneous Wireless Cellular Networks , 2013, IEEE Transactions on Wireless Communications.

[25]  Cheng Wang,et al.  User Association for Load Balancing in Vehicular Networks: An Online Reinforcement Learning Approach , 2017, IEEE Transactions on Intelligent Transportation Systems.

[26]  F. Richard Yu,et al.  A Joint Cross-Layer and Colayer Interference Management Scheme in Hyperdense Heterogeneous Networks Using Mean-Field Game Theory , 2016, IEEE Transactions on Vehicular Technology.

[27]  Rose Qingyang Hu,et al.  Dynamic Distributed Resource Sharing for Mobile D2D Communications , 2015, IEEE Transactions on Wireless Communications.

[28]  Khaled A. Harras,et al.  Disseminating Multilayer Multimedia Content Over Challenged Networks , 2018, IEEE Transactions on Multimedia.

[29]  Jianxin Chen,et al.  Greening the Smart Cities: Energy-Efficient Massive Content Delivery via D2D Communications , 2018, IEEE Transactions on Industrial Informatics.

[30]  Yoshikazu Miyanaga,et al.  An Autonomous Learning-Based Algorithm for Joint Channel and Power Level Selection by D2D Pairs in Heterogeneous Cellular Networks , 2016, IEEE Transactions on Communications.

[31]  Nei Kato,et al.  A Cooperative User-System Approach for Optimizing Performance in Content Distribution/Delivery Networks , 2012, IEEE Journal on Selected Areas in Communications.

[32]  Yueming Cai,et al.  Physical–Social-Aware D2D Content Sharing Networks: A Provider–Demander Matching Game , 2018, IEEE Transactions on Vehicular Technology.

[33]  Amir Leshem,et al.  Game Theoretic Dynamic Channel Allocation for Frequency-Selective Interference Channels , 2017, IEEE Transactions on Information Theory.

[34]  Yongming Huang,et al.  Game Theoretic Max-logit Learning Approaches for Joint Base Station Selection and Resource Allocation in Heterogeneous Networks , 2015, IEEE Journal on Selected Areas in Communications.

[35]  Derrick Wing Kwan Ng,et al.  Multi-Quality Multicast Beamforming With Scalable Video Coding , 2018, IEEE Transactions on Communications.

[36]  Franco Davoli,et al.  A Multi-Clustering Approach to Scale Distributed Tenant Networks for Mobile Edge Computing , 2019, IEEE Journal on Selected Areas in Communications.

[37]  Ning Zhang,et al.  Content Popularity Prediction Towards Location-Aware Mobile Edge Caching , 2018, IEEE Transactions on Multimedia.

[38]  Alireza Sadeghi,et al.  Optimal and Scalable Caching for 5G Using Reinforcement Learning of Space-Time Popularities , 2017, IEEE Journal of Selected Topics in Signal Processing.

[39]  Naomi Ehrich Leonard,et al.  Parameter Estimation in Softmax Decision-Making Models With Linear Objective Functions , 2015, IEEE Transactions on Automation Science and Engineering.

[40]  Mihaela van der Schaar,et al.  Self-Organizing Networks of Information Gathering Cognitive Agents , 2015, IEEE Transactions on Cognitive Communications and Networking.

[41]  Gang Feng,et al.  Optimal Cooperative Content Caching and Delivery Policy for Heterogeneous Cellular Networks , 2017, IEEE Transactions on Mobile Computing.

[42]  Christophe Diot,et al.  Impact of Human Mobility on Opportunistic Forwarding Algorithms , 2007, IEEE Transactions on Mobile Computing.

[43]  Beatriz Lorenzo,et al.  Optimal Routing and Traffic Scheduling for Multihop Cellular Networks Using Genetic Algorithm , 2013, IEEE Transactions on Mobile Computing.