Joint Offloading and Resource Allocation in Mobile Edge Computing Systems: An Actor-Critic Approach

Offloading computationally intensive tasks from user equipments (UEs) to mobile edge computing (MEC) servers is a promising technique to boost up the computational capacity of UEs. However, MEC will incur extra energy consumption and time delays, which motivates the deployment of energy harvesting (EH) small cell networks with MEC in mobile networks. Due to the complexity of such networks, it is challenging to effectively allocate resources for UEs. In this paper, we investigate the offloading decision, wireless and computational resources allocation problem in energy harvesting (EH) small cell networks with MEC. Different from existing literatures, our research focuses on improving mobile operators' revenue by maximizing the amount of the offloaded tasks while decreasing the energy expenditure and time-delays. Besides, queues are created at the MEC server side to store the un-executed tasks in a time slot, which is used as a punishment in our utility function to avoid serious delay. Considering the varying lengths of queues, the states of EH-batteries of small base stations (SBSs) and down-link channels, the above problem is modeled as a Markov decision process (MDP). Since the states and actions in the MDP are infinite, an online and on-policy actor-critic with eligibility traces algorithm is proposed to resolve the problem. Simulation results show the proposed algorithm has superior performances compared with the policy-gradient algorithm and Q- learning.

[1]  Vijay R. Konda,et al.  OnActor-Critic Algorithms , 2003, SIAM J. Control. Optim..

[2]  Qianbin Chen,et al.  Computation Offloading and Resource Allocation in Wireless Cellular Networks With Mobile Edge Computing , 2017, IEEE Transactions on Wireless Communications.

[3]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[4]  F. Richard Yu,et al.  A Joint Cross-Layer and Colayer Interference Management Scheme in Hyperdense Heterogeneous Networks Using Mean-Field Game Theory , 2016, IEEE Transactions on Vehicular Technology.

[5]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[6]  Victor C. M. Leung,et al.  Green Full-Duplex Self-Backhaul and Energy Harvesting Small Cell Networks With Massive MIMO , 2016, IEEE Journal on Selected Areas in Communications.

[7]  Zdenek Becvar,et al.  Mobile Edge Computing: A Survey on Architecture and Computation Offloading , 2017, IEEE Communications Surveys & Tutorials.

[8]  F. Richard Yu,et al.  Enhancing QoE-Aware Wireless Edge Caching With Software-Defined Wireless Networks , 2017, IEEE Transactions on Wireless Communications.

[9]  Osvaldo Simeone,et al.  Energy-Efficient Resource Allocation for Mobile Edge Computing-Based Augmented Reality Applications , 2016, IEEE Wireless Communications Letters.

[10]  Victor C. M. Leung,et al.  Deep-Reinforcement-Learning-Based Optimization for Cache-Enabled Opportunistic Interference Alignment Wireless Networks , 2017, IEEE Transactions on Vehicular Technology.

[11]  Jun Zhang,et al.  Stochastic Joint Radio and Computational Resource Management for Multi-User Mobile-Edge Computing Systems , 2017, IEEE Transactions on Wireless Communications.

[12]  Zhu Han,et al.  User Scheduling and Resource Allocation in HetNets With Hybrid Energy Supply: An Actor-Critic Reinforcement Learning Approach , 2018, IEEE Transactions on Wireless Communications.

[13]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[14]  H. Vincent Poor,et al.  Cooperation and Storage Tradeoffs in Power Grids With Renewable Energy Resources , 2014, IEEE Journal on Selected Areas in Communications.

[15]  Qianbin Chen,et al.  Joint Computation Offloading and Interference Management in Wireless Cellular Networks with Mobile Edge Computing , 2017, IEEE Transactions on Vehicular Technology.

[16]  Nei Kato,et al.  Postdisaster User Location Maneuvering Method for Improving the QoE Guaranteed Service Time in Energy Harvesting Small Cell Networks , 2017, IEEE Transactions on Vehicular Technology.

[17]  Tony Q. S. Quek,et al.  Offloading in Mobile Edge Computing: Task Allocation and Computational Frequency Scaling , 2017, IEEE Transactions on Communications.

[18]  Qianbin Chen,et al.  Integration of Networking, Caching, and Computing in Wireless Systems: A Survey, Some Research Issues, and Challenges , 2018, IEEE Communications Surveys & Tutorials.