DMRO: A Deep Meta Reinforcement Learning-Based Task Offloading Framework for Edge-Cloud Computing

With the continuous growth of mobile data and the unprecedented demand for computing power, resource-constrained edge devices cannot effectively meet the requirements of Internet of Things (IoT) applications and Deep Neural Network (DNN) computing. As a distributed computing paradigm, edge offloading that migrates complex tasks from IoT devices to edge-cloud servers can break through the resource limitation of IoT devices, reduce the computing burden and improve the efficiency of task processing. However, the problem of optimal offloading decision-making is NP-hard, traditional optimization methods are difficult to achieve results efficiently. Besides, there are still some shortcomings in existing deep learning methods, e.g., the slow learning speed and the failure of the original network parameters when the environment changes. To tackle these challenges, we propose a Deep Meta Reinforcement Learning-based offloading (DMRO) algorithm, which combines multiple parallel DNNs with Q-learning to make fine-grained offloading decisions. By aggregating the perceptive ability of deep learning, the decision-making ability of reinforcement learning, and the rapid environment learning ability of meta-learning, it is possible to quickly and flexibly obtain the optimal offloading strategy from the IoT environment. Simulation results demonstrate that the proposed algorithm achieves obvious improvement over the Deep Q-Learning algorithm and has strong portability in making real-time offloading decisions even in time-varying IoT environments.

[1]  Trevor N. Mudge,et al.  Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge , 2017, ASPLOS.

[2]  Keqin Li,et al.  Joint offloading and scheduling decisions for DAG applications in mobile edge computing , 2020, Neurocomputing.

[3]  Huaming Wu,et al.  Collaborate Edge and Cloud Computing With Distributed Deep Learning for Smart City Internet of Things , 2020, IEEE Internet of Things Journal.

[4]  Xiaofei Wang,et al.  Convergence of Edge Computing and Deep Learning: A Comprehensive Survey , 2019, IEEE Communications Surveys & Tutorials.

[5]  Victor C. M. Leung,et al.  An Efficient Computation Offloading Management Scheme in the Densely Deployed Small Cell Networks With Mobile Edge Computing , 2018, IEEE/ACM Transactions on Networking.

[6]  Zibin Zheng,et al.  Joint Computation Offloading and Coin Loaning for Blockchain-Empowered Mobile-Edge Computing , 2019, IEEE Internet of Things Journal.

[7]  Xiaoyi Lu,et al.  Early Experience in Benchmarking Edge AI Processors with Object Detection Workloads , 2019, Bench.

[8]  Victor C. M. Leung,et al.  Deep-Reinforcement-Learning-Based Optimization for Cache-Enabled Opportunistic Interference Alignment Wireless Networks , 2017, IEEE Transactions on Vehicular Technology.

[9]  Marimuthu Palaniswami,et al.  An Application Placement Technique for Concurrent IoT Applications in Edge and Fog Computing Environments , 2021, IEEE Transactions on Mobile Computing.

[10]  Lin Wang,et al.  Energy Management for Multi-User Mobile-Edge Computing Systems with Energy Harvesting Devices and QoS Constraints , 2018, 2018 27th International Conference on Computer Communication and Networks (ICCCN).

[11]  Hai Jin,et al.  Stable Local Broadcast in Multihop Wireless Networks Under SINR , 2018, IEEE/ACM Transactions on Networking.

[12]  Yang Yu,et al.  Computation Offloading for Mobile-Edge Computing with Multi-user , 2019, 2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS).

[13]  Ken Goldberg,et al.  Deep Imitation Learning for Complex Manipulation Tasks from Virtual Reality Teleoperation , 2017, ICRA.

[14]  Xu Feng,et al.  Distributed Deep Learning-based Offloading for Mobile Edge Computing Networks , 2018, Mobile Networks and Applications.

[15]  Tao Li,et al.  A Framework for Partitioning and Execution of Data Stream Applications in Mobile Cloud Computing , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[16]  Wei Ni,et al.  Optimal Schedule of Mobile Edge Computing for Internet of Things Using Partial Information , 2017, IEEE Journal on Selected Areas in Communications.

[17]  Wei Li,et al.  A dynamic tradeoff data processing framework for delay-sensitive applications in Cloud of Things systems , 2018, J. Parallel Distributed Comput..

[18]  Victor C. M. Leung,et al.  End-edge-cloud collaborative computation offloading for multiple mobile users in heterogeneous edge-server environment , 2020, Wireless Networks.

[19]  Rajkumar Buyya,et al.  Next generation cloud computing: New trends and research directions , 2017, Future Gener. Comput. Syst..

[20]  Yuanyuan Yang,et al.  A quick-response framework for multi-user computation offloading in mobile cloud computing , 2018, Future Gener. Comput. Syst..

[21]  Tarik Taleb,et al.  On Multi-Access Edge Computing: A Survey of the Emerging 5G Network Edge Cloud Architecture and Orchestration , 2017, IEEE Communications Surveys & Tutorials.

[22]  Jiguo Yu,et al.  Faster Parallel Core Maintenance Algorithms in Dynamic Graphs , 2020, IEEE Transactions on Parallel and Distributed Systems.

[23]  Rajkumar Buyya,et al.  STAR: SLA-aware Autonomic Management of Cloud Resources , 2020, IEEE Transactions on Cloud Computing.

[24]  Katinka Wolter,et al.  Analysis of the Energy-Response Time Tradeoff for Mobile Cloud Offloading Using Combined Metrics , 2015, 2015 27th International Teletraffic Congress.

[25]  Francis C. M. Lau,et al.  Implementing The Abstract MAC Layer in Dynamic Networks , 2021, IEEE Transactions on Mobile Computing.

[26]  Jane X. Wang,et al.  Reinforcement Learning, Fast and Slow , 2019, Trends in Cognitive Sciences.

[27]  Mugen Peng,et al.  Edge computing technologies for Internet of Things: a primer , 2017, Digit. Commun. Networks.

[28]  Yonggang Wen,et al.  Energy-Efficient Task Execution for Application as a General Topology in Mobile Cloud Computing , 2018, IEEE Transactions on Cloud Computing.

[29]  Quan Chen,et al.  Ebird: Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services , 2019, 2019 IEEE 37th International Conference on Computer Design (ICCD).

[30]  Yun Yang,et al.  A novel directional and non-local-convergent particle swarm optimization based workflow scheduling in cloud-edge environment , 2019, Future Gener. Comput. Syst..

[31]  Bo Li,et al.  Gearing resource-poor mobile devices with powerful clouds: architectures, challenges, and applications , 2013, IEEE Wireless Communications.

[32]  Zehui Xiong,et al.  Joint optimization of service chain caching and task offloading in mobile edge computing , 2021, Appl. Soft Comput..

[33]  Bin Cao,et al.  Lyapunov Optimization-Based Trade-Off Policy for Mobile Cloud Offloading in Heterogeneous Wireless Networks , 2019, IEEE Transactions on Cloud Computing.

[34]  Shuguang Cui,et al.  Joint offloading and computing optimization in wireless powered mobile-edge computing systems , 2017, 2017 IEEE International Conference on Communications (ICC).

[35]  Yu Cao,et al.  Energy-Delay Tradeoff for Dynamic Offloading in Mobile-Edge Computing System With Energy Harvesting Devices , 2018, IEEE Transactions on Industrial Informatics.

[36]  Ying Jun Zhang,et al.  Deep Reinforcement Learning for Online Computation Offloading in Wireless Powered Mobile-Edge Computing Networks , 2018, IEEE Transactions on Mobile Computing.

[37]  Olivia Das,et al.  Modeling the Effect of Parallel Execution on Multi-site Computation Offloading in Mobile Cloud Computing , 2018, EPEW.

[38]  Min Chen,et al.  Task Offloading for Mobile Edge Computing in Software Defined Ultra-Dense Network , 2018, IEEE Journal on Selected Areas in Communications.

[39]  Geoffrey Fox,et al.  Energy-efficient multisite offloading policy using Markov decision process for mobile cloud computing , 2016, Pervasive Mob. Comput..

[40]  Zeb Kurth-Nelson,et al.  Learning to reinforcement learn , 2016, CogSci.

[41]  Li Lin,et al.  Echo: An Edge-Centric Code Offloading System With Quality of Service Guarantee , 2018, IEEE Access.

[42]  Xuyun Zhang,et al.  A computation offloading method over big data for IoT-enabled cloud-edge computing , 2019, Future Gener. Comput. Syst..

[43]  David E. Bernholdt,et al.  OpenMP 4.5 Validation and Verification Suite for Device Offload , 2018, IWOMP.

[44]  Tom Schaul,et al.  Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[45]  Hyundong Shin,et al.  Learning for Computation Offloading in Mobile Edge Computing , 2018, IEEE Transactions on Communications.

[46]  Kaibin Huang,et al.  Energy-Efficient Resource Allocation for Mobile-Edge Computation Offloading , 2016, IEEE Transactions on Wireless Communications.

[47]  Li Ning,et al.  Distributed Spanner Construction With Physical Interference: Constant Stretch and Linear Sparseness , 2017, IEEE/ACM Transactions on Networking.

[48]  Khaled Ben Letaief,et al.  Dynamic Computation Offloading for Mobile-Edge Computing With Energy Harvesting Devices , 2016, IEEE Journal on Selected Areas in Communications.

[49]  Soumaya Cherkaoui,et al.  A Game Theory Based Efficient Computation Offloading in an UAV Network , 2019, IEEE Transactions on Vehicular Technology.

[50]  Nancy Samaan,et al.  A Novel Statistical Cost Model and an Algorithm for Efficient Application Offloading to Clouds , 2018, IEEE Transactions on Cloud Computing.

[51]  Yuan Wu,et al.  Deep Reinforcement Learning-Based Task Offloading and Resource Allocation for Mobile Edge Computing , 2018, MLICOM.

[52]  Bo Li,et al.  eTime: Energy-efficient transmission between cloud and mobile devices , 2013, 2013 Proceedings IEEE INFOCOM.

[53]  Min Sheng,et al.  Mobile-Edge Computing: Partial Computation Offloading Using Dynamic Voltage Scaling , 2016, IEEE Transactions on Communications.

[54]  Katinka Wolter,et al.  Tradeoff Analysis for Mobile Cloud Offloading Based on an Additive Energy-Performance Metric , 2015, EAI Endorsed Trans. Future Intell. Educ. Environ..

[55]  Robert C. Green,et al.  Mobile Edge Offloading Using Markov Decision Processes , 2018, EDGE.

[56]  Kenli Li,et al.  COOPER-SCHED: A Cooperative Scheduling Framework for Mobile Edge Computing with Expected Deadline Guarantee , 2020 .

[57]  Jingyu Wang,et al.  Knowledge-Driven Service Offloading Decision for Vehicular Edge Computing: A Deep Reinforcement Learning Approach , 2019, IEEE Transactions on Vehicular Technology.

[58]  Adel Nadjaran Toosi,et al.  Serverless Edge Computing: Vision and Challenges , 2021, ACSW.

[59]  Sanjay Misra,et al.  Transformative effects of IoT, Blockchain and Artificial Intelligence on cloud computing: Evolution, vision, trends and open challenges , 2019, Internet Things.

[60]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[61]  Rajkumar Buyya,et al.  Fog Computing: A Taxonomy, Survey and Future Directions , 2016, Internet of Everything.

[62]  Wei Cao,et al.  Intelligent Offloading in Multi-Access Edge Computing: A State-of-the-Art Review and Framework , 2019, IEEE Communications Magazine.

[63]  Ying Chen,et al.  Energy Efficient Dynamic Offloading in Mobile Edge Computing for Internet of Things , 2019, IEEE Transactions on Cloud Computing.

[64]  Huaming Wu,et al.  Stochastic Analysis of Delayed Mobile Offloading in Heterogeneous Networks , 2018, IEEE Transactions on Mobile Computing.

[65]  F. Richard Yu,et al.  Joint Offloading and Resource Allocation in Mobile Edge Computing Systems: An Actor-Critic Approach , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[66]  Jing Wang,et al.  A deep reinforcement learning based framework for power-efficient resource allocation in cloud RANs , 2017, 2017 IEEE International Conference on Communications (ICC).

[67]  Feng Xia,et al.  Deep Reinforcement Learning for Vehicular Edge Computing , 2019, ACM Trans. Intell. Syst. Technol..

[68]  K. B. Letaief,et al.  A Survey on Mobile Edge Computing: The Communication Perspective , 2017, IEEE Communications Surveys & Tutorials.

[69]  Bharat K. Bhargava,et al.  A Survey of Computation Offloading for Mobile Systems , 2012, Mobile Networks and Applications.

[70]  Sukhpal Singh Gill,et al.  Quantum and blockchain based Serverless edge computing: A vision, model, new trends and future directions , 2021, Internet Technol. Lett..

[71]  Marcin Andrychowicz,et al.  Learning to learn by gradient descent by gradient descent , 2016, NIPS.

[72]  Naghmeh S. Moayedian,et al.  An Offloading Strategy in Mobile Cloud Computing Considering Energy and Delay Constraints , 2018, IEEE Access.

[73]  Zdenek Becvar,et al.  Mobile Edge Computing: A Survey on Architecture and Computation Offloading , 2017, IEEE Communications Surveys & Tutorials.

[74]  Xu Chen,et al.  Chimera: An Energy-Efficient and Deadline-Aware Hybrid Edge Computing Framework for Vehicular Crowdsensing Applications , 2019, IEEE Internet of Things Journal.

[75]  Min Chen,et al.  A Markov Decision Process-based service migration procedure for follow me cloud , 2014, 2014 IEEE International Conference on Communications (ICC).

[76]  Chadi Assi,et al.  Computational Cost and Energy Efficient Task Offloading in Hierarchical Edge-Clouds , 2018, 2018 IEEE 29th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC).

[77]  Chin-Ling Chen,et al.  Semi-Online Computational Offloading by Dueling Deep-Q Network for User Behavior Prediction , 2020, IEEE Access.

[78]  Yuanyuan Yang,et al.  Energy-efficient computation offloading and resource allocation for delay-sensitive mobile edge computing , 2019, Sustain. Comput. Informatics Syst..

[79]  Zhao Tong,et al.  A scheduling scheme in the cloud computing environment using deep Q-learning , 2020, Inf. Sci..

[80]  Bingsheng He,et al.  Cost-Aware Partitioning for Efficient Large Graph Processing in Geo-Distributed Datacenters , 2020, IEEE Transactions on Parallel and Distributed Systems.

[81]  Mahmoud Al-Ayyoub,et al.  The future of mobile cloud computing: Integrating cloudlets and Mobile Edge Computing , 2016, 2016 23rd International Conference on Telecommunications (ICT).

[82]  Liang Huang,et al.  Meta-Learning Based Dynamic Computation Task Offloading for Mobile Edge Computing Networks , 2021, IEEE Communications Letters.

[83]  Victor C. M. Leung,et al.  Downlink Energy Efficiency of Power Allocation and Wireless Backhaul Bandwidth Allocation in Heterogeneous Small Cell Networks , 2017, IEEE Transactions on Communications.

[84]  Xu Chen,et al.  D2D Fogging: An Energy-Efficient and Incentive-Aware Task Offloading Framework via Network-assisted D2D Collaboration , 2016, IEEE Journal on Selected Areas in Communications.

[85]  Jiguo Yu,et al.  Localized and distributed link scheduling algorithms in IoT under rayleigh fading , 2019, Comput. Networks.

[86]  Partha Pratim Ray,et al.  SDN/NFV architectures for edge-cloud oriented IoT: A systematic review , 2021, Computer Communications.

[87]  Ming Tang,et al.  Deep Reinforcement Learning for Task Offloading in Mobile Edge Computing Systems , 2020, IEEE Transactions on Mobile Computing.

[88]  Wenguang Chen,et al.  Automatic Irregularity-Aware Fine-Grained Workload Partitioning on Integrated Architectures , 2021, IEEE Transactions on Knowledge and Data Engineering.

[89]  Xu Chen,et al.  Learning Driven Computation Offloading for Asymmetrically Informed Edge Computing , 2019, IEEE Transactions on Parallel and Distributed Systems.

[90]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[91]  Neeraj Kumar,et al.  A Novel Pairing-Free Lightweight Authentication Protocol for Mobile Cloud Computing Framework , 2021, IEEE Systems Journal.

[92]  Athanasios V. Vasilakos,et al.  MAPCloud: Mobile Applications on an Elastic and Scalable 2-Tier Cloud Architecture , 2012, 2012 IEEE Fifth International Conference on Utility and Cloud Computing.

[93]  Weihua Zhuang,et al.  Learning-Based Computation Offloading for IoT Devices With Energy Harvesting , 2017, IEEE Transactions on Vehicular Technology.

[94]  Junlong Zhu,et al.  A Computing Offloading Game for Mobile Devices and Edge Cloud Servers , 2018, Wirel. Commun. Mob. Comput..

[95]  Sakshi Kaushal,et al.  Energy conscious multi-site computation offloading for mobile cloud computing , 2018, Soft Comput..

[96]  Zhenming Liu,et al.  Delivering Deep Learning to Mobile Devices via Offloading , 2017, VR/AR Network@SIGCOMM.

[97]  John Thompson,et al.  Computational Load Balancing on the Edge in Absence of Cloud and Fog , 2019, IEEE Transactions on Mobile Computing.

[98]  Neeraj Kumar,et al.  An Edge-Fog Computing Framework for Cloud of Things in Vehicle to Grid Environment , 2020, 2020 IEEE 21st International Symposium on "A World of Wireless, Mobile and Multimedia Networks" (WoWMoM).

[99]  Yuanyuan Yang,et al.  Efficient and Secure Multi-User Multi-Task Computation Offloading for Mobile-Edge Computing in Mobile IoT Networks , 2020, IEEE Transactions on Network and Service Management.

[100]  Sergey Levine,et al.  Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[101]  Yuan Xi-Gang,et al.  An improved PSO algorithm for solving non-convex NLP/MINLP problems with equality constraints , 2007 .

[102]  Yong Li,et al.  A Survey on Edge Intelligence , 2020, ArXiv.

[103]  Andreas Mäder,et al.  Device-Centric Energy Optimization for Edge Cloud Offloading , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[104]  Vankadara Saritha,et al.  An efficient algorithm for dynamic task offloading using cloudlets in mobile cloud computing , 2019, Int. J. Commun. Syst..

[105]  Lei Guo,et al.  Deep Reinforcement Learning for Intelligent Internet of Vehicles: An Energy-Efficient Computational Offloading Scheme , 2019, IEEE Transactions on Cognitive Communications and Networking.

[106]  Sherali Zeadally,et al.  A survey and taxonomy on energy efficient resource allocation techniques for cloud computing systems , 2016, Computing.

[107]  Victor C. M. Leung,et al.  An energy- and cost-aware computation offloading method for workflow applications in mobile edge computing , 2019, EURASIP J. Wirel. Commun. Netw..

[108]  David Budden,et al.  Distributed Prioritized Experience Replay , 2018, ICLR.

[109]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[110]  Mehdi Bennis,et al.  Intelligent Edge: Leveraging Deep Imitation Learning for Mobile Edge Computation Offloading , 2020, IEEE Wireless Communications.

[111]  Xiaoyi Lu,et al.  TriEC: tripartite graph based erasure coding NIC offload , 2019, SC.

[112]  Huaming Wu,et al.  EEDTO: An Energy-Efficient Dynamic Task Offloading Algorithm for Blockchain-Enabled IoT-Edge-Cloud Orchestrated Computing , 2021, IEEE Internet of Things Journal.

[113]  Katinka Wolter,et al.  An Efficient Application Partitioning Algorithm in Mobile Environments , 2019, IEEE Transactions on Parallel and Distributed Systems.

[114]  Albert Y. Zomaya,et al.  Fast Adaptive Task Offloading in Edge Computing Based on Meta Reinforcement Learning , 2021, IEEE Transactions on Parallel and Distributed Systems.

[115]  Kenli Li,et al.  COOPER-MATCH: Job Offloading with A Cooperative Game for Guaranteeing Strict Deadlines in MEC , 2020 .

[116]  Helen D. Karatza,et al.  A hybrid approach to scheduling real-time IoT workflows in fog and cloud environments , 2018, Multimedia Tools and Applications.

[117]  Eryk Dutkiewicz,et al.  Offloading Energy Efficiency with Delay Constraint for Cooperative Mobile Edge Computing Networks , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[118]  David E. Bernholdt,et al.  Analysis of OpenMP 4.5 Offloading in Implementations: Correctness and Overhead , 2019, Parallel Comput..

[119]  Chen-Khong Tham,et al.  A deep reinforcement learning based offloading scheme in ad-hoc mobile clouds , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS).

[120]  Wenzhong Li,et al.  Efficient Multi-User Computation Offloading for Mobile-Edge Cloud Computing , 2015, IEEE/ACM Transactions on Networking.

[121]  Chonho Lee,et al.  A survey of mobile cloud computing: architecture, applications, and approaches , 2013, Wirel. Commun. Mob. Comput..

[122]  Yi Sun,et al.  Energy-Efficient Decision Making for Mobile Cloud Offloading , 2020, IEEE Transactions on Cloud Computing.

[123]  Yu Wang,et al.  Cloudlet Placement and Task Allocation in Mobile Edge Computing , 2019, IEEE Internet of Things Journal.

[124]  Adrián Castelló,et al.  Theoretical Scalability Analysis of Distributed Deep Convolutional Neural Networks , 2019, 2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID).

[125]  Sergey Levine,et al.  Meta-Reinforcement Learning of Structured Exploration Strategies , 2018, NeurIPS.

[126]  Zibin Zheng,et al.  Multi-Hop Cooperative Computation Offloading for Industrial IoT–Edge–Cloud Computing Environments , 2019, IEEE Transactions on Parallel and Distributed Systems.