Scheduling for Mobile Edge Computing With Random User Arrivals—An Approximate MDP and Reinforcement Learning Approach

In this paper, we investigate the scheduling design of a mobile edge computing (MEC) system, where active mobile devices with computation tasks randomly appear in a cell. Every task can be computed at either the mobile device or the MEC server. We jointly optimize the task offloading decision, uplink transmission device selection and power allocation by formulating the problem as an infinite-horizon Markov decision process (MDP). Compared with most of the existing literature, this is the first attempt to address the transmission and computation optimization with random device arrivals in an infinite time horizon to our best knowledge. Due to the uncertainty in the device number and location, the conventional approximate MDP approaches addressing the curse of dimensionality cannot be applied. An alternative and suitable low-complexity solution framework is proposed in this work. We first introduce a baseline scheduling policy, whose value function can be derived analytically with the statistics of random mobile device arrivals. Then, one-step policy iteration is adopted to obtain a sub-optimal scheduling policy whose performance can be bounded analytically. The complexity of deriving the sub-optimal policy is reduced dramatically compared with conventional solutions of MDP by eliminating the complicated value iteration. To address a more general scenario where the statistics of random mobile device arrivals are unknown, a novel and efficient algorithm integrating reinforcement learning and stochastic gradient descent (SGD) is proposed to improve the system performance in an online manner. Simulation results show that the gain of the sub-optimal policy over various benchmarks is significant.

[1]  Ying Cui,et al.  Joint Pushing and Caching for Bandwidth Utilization Maximization in Wireless Networks , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[2]  Zibin Zheng,et al.  Online Deep Reinforcement Learning for Computation Offloading in Blockchain-Empowered Mobile Edge Computing , 2019, IEEE Transactions on Vehicular Technology.

[3]  Dusit Niyato,et al.  A Dynamic Offloading Algorithm for Mobile Computing , 2012, IEEE Transactions on Wireless Communications.

[4]  Rui Wang,et al.  Cellular Offloading via Downlink Cache Placement , 2018, 2018 IEEE International Conference on Communications (ICC).

[5]  Huimin Yu,et al.  Deep Reinforcement Learning for Offloading and Resource Allocation in Vehicle Edge Computing and Networks , 2019, IEEE Transactions on Vehicular Technology.

[6]  Khaled Ben Letaief,et al.  Power-Delay Tradeoff in Multi-User Mobile-Edge Computing Systems , 2016, 2016 IEEE Global Communications Conference (GLOBECOM).

[7]  Vincent K. N. Lau,et al.  Distributive Stochastic Learning for Delay-Optimal OFDMA Power and Subband Allocation , 2010, IEEE Transactions on Signal Processing.

[8]  Vincent K. N. Lau,et al.  Queue-Aware Distributive Resource Control for Delay-Sensitive Two-Hop MIMO Cooperative Systems , 2010, IEEE Transactions on Signal Processing.

[9]  Khaled Ben Letaief,et al.  Delay-optimal computation task scheduling for mobile-edge computing systems , 2016, 2016 IEEE International Symposium on Information Theory (ISIT).

[10]  Haiyun Luo,et al.  Energy-Optimal Mobile Cloud Computing under Stochastic Wireless Channel , 2013, IEEE Transactions on Wireless Communications.

[11]  Khaled Ben Letaief,et al.  Dynamic Computation Offloading for Mobile-Edge Computing With Energy Harvesting Devices , 2016, IEEE Journal on Selected Areas in Communications.

[12]  B. Liang,et al.  Mobile Edge Computing , 2020, Encyclopedia of Wireless Networks.

[13]  Songtao Guo,et al.  Energy-Efficient Cooperative Resource Allocation in Wireless Powered Mobile Edge Computing , 2019, IEEE Internet of Things Journal.

[14]  Leonard Kleinrock,et al.  Theory, Volume 1, Queueing Systems , 1975 .

[15]  M. Nikulin,et al.  Unbiased Estimators and Their Applications: Volume 1: Univariate Case , 1993 .

[16]  Wenzhong Li,et al.  Efficient Multi-User Computation Offloading for Mobile-Edge Cloud Computing , 2015, IEEE/ACM Transactions on Networking.

[17]  Vincent K. N. Lau,et al.  Delay-Aware Two-Hop Cooperative Relay Communications via Approximate MDP and Stochastic Learning , 2013, IEEE Transactions on Information Theory.

[18]  Kaibin Huang,et al.  Wirelessly Powered Mobile Computation Offloading: Energy Savings Maximization , 2014, 2015 IEEE Global Communications Conference (GLOBECOM).

[19]  Vincent K. N. Lau,et al.  Delay-optimal power and subcarrier allocation for OFDMA systems via stochastic approximation , 2009, IEEE Transactions on Wireless Communications.

[20]  Shaojie Tang,et al.  Online Learning based Uplink Scheduling in HetNets with Limited Backhaul Capacity , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[21]  Guihai Chen,et al.  Energy-Efficient Dynamic Virtual Machine Management in Data Centers , 2019, IEEE/ACM Transactions on Networking.

[22]  S. Wittevrongel,et al.  Queueing Systems , 2019, Introduction to Stochastic Processes and Simulation.

[23]  Yan Zhang,et al.  Mobile Edge Computing: A Survey , 2018, IEEE Internet of Things Journal.

[24]  Ying Cui,et al.  Joint Optimization of File Placement and Delivery in Cache-Assisted Wireless Networks , 2018, GLOBECOM.

[25]  Bojie Lv,et al.  MDP-Based Scheduling Design for Mobile-Edge Computing Systems with Random User Arrival , 2019, 2019 IEEE Global Communications Conference (GLOBECOM).

[26]  Nan Zhao,et al.  Integrated Networking, Caching, and Computing for Connected Vehicles: A Deep Reinforcement Learning Approach , 2018, IEEE Transactions on Vehicular Technology.

[27]  Rui Wang,et al.  Joint Downlink Scheduling for File Placement and Delivery in Cache-Assisted Wireless Networks With Finite File Lifetime , 2019, IEEE Transactions on Communications.

[28]  Rose Qingyang Hu,et al.  Mobility-Aware Edge Caching and Computing in Vehicle Networks: A Deep Reinforcement Learning , 2018, IEEE Transactions on Vehicular Technology.

[29]  Kaibin Huang,et al.  Multiuser Resource Allocation for Mobile-Edge Computation Offloading , 2016, 2016 IEEE Global Communications Conference (GLOBECOM).

[30]  Kaibin Huang,et al.  Wireless Networks for Mobile Edge Computing: Spatial Modeling and Latency Analysis , 2017, IEEE Transactions on Wireless Communications.

[31]  Nei Kato,et al.  Smart Resource Allocation for Mobile Edge Computing: A Deep Reinforcement Learning Approach , 2019, IEEE Transactions on Emerging Topics in Computing.

[32]  Xu Chen,et al.  Decentralized Computation Offloading Game for Mobile Cloud Computing , 2014, IEEE Transactions on Parallel and Distributed Systems.

[33]  Guihai Chen,et al.  Dynamic virtual machine management via approximate Markov decision process , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[34]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[35]  Thomas D. Burd,et al.  Processor design for portable systems , 1996, J. VLSI Signal Process..

[36]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[37]  Haisheng Tan,et al.  Cooperative Job Dispatching in Edge Computing Network with Unpredictable Uploading Delay , 2020, J. Commun. Inf. Networks.

[38]  Ying Cui,et al.  Joint Optimization of File Placement and Delivery in Cache-Assisted Wireless Networks , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[39]  Abbas Jamalipour,et al.  Wireless communications , 2005, GLOBECOM '05. IEEE Global Telecommunications Conference, 2005..

[40]  Sangheon Pack,et al.  Spatial and Temporal Computation Offloading Decision Algorithm in Edge Cloud-Enabled Heterogeneous Networks , 2018, IEEE Access.

[41]  Carl D. Meyer,et al.  Matrix Analysis and Applied Linear Algebra , 2000 .