Deep reinforcement learning for computation offloading in mobile edge computing environment

Abstract Recently, in order to distribute computing, networking resources, services, near terminals, mobile fog is gradually becoming the mobile edge computing (MEC) paradigm. In a mobile fog environment, the quality of service affected by offloading speeds and the fog processing, however the traditional fog method to solve the problem of computation resources allocation is difficult because of the complex network states distribution environment (that is, F-AP states, AP states, mobile device states and code block states). In this paper, to improve the fog resource provisioning performance of mobile devices, the learning-based mobile fog scheme with deep deterministic policy gradient (DDPG) algorithm is proposed. An offloading block pulsating discrete event system is modeled as a Markov Decision Processes (MDPs), which can realize the offloading computing without knowing the transition probabilities among different network states. Furthermore, the DDPG algorithm is used to solve the issue of state spaces explosion and learn an optimal offloading policy on distributed mobile fog computing. The simulation results show that our proposed scheme achieves 20%, 37%, 46% improvement on related performance compared with the policy gradient (PG), deterministic policy gradient (DPG) and actor–critic (AC) methods. Besides, compared with the traditional fog provisioning scheme, our scheme shows better cost performance of fog resource provisioning under different locations number and different task arrival rates.

[1]  Anfeng Liu,et al.  An Intelligent Game-Based Offloading Scheme for Maximizing Benefits of IoT-Edge-Cloud Ecosystems , 2022, IEEE Internet of Things Journal.

[2]  Yannan Li,et al.  Blockchain-Based Solutions to Security and Privacy Issues in the Internet of Things , 2018, IEEE Wireless Communications.

[3]  Anfeng Liu,et al.  Intelligent UAVs Trajectory Optimization From Space-Time for Data Collection in Social Networks , 2021, IEEE Transactions on Network Science and Engineering.

[4]  Bin Hu,et al.  When Deep Reinforcement Learning Meets 5G-Enabled Vehicular Networks: A Distributed Offloading Framework for Traffic Big Data , 2020, IEEE Transactions on Industrial Informatics.

[5]  Zhiwen Zeng,et al.  An Intelligent Collaboration Trust Interconnections System for Mobile Information Control in Ubiquitous 5G Networks , 2021, IEEE Transactions on Network Science and Engineering.

[6]  Jinsong Gui,et al.  Joint mobile vehicle–UAV scheme for secure data collection in a smart city , 2020, Annals of Telecommunications.

[7]  H. Vincent Poor,et al.  Latency and Reliability-Aware Task Offloading and Resource Allocation for Mobile Edge Computing , 2017, 2017 IEEE Globecom Workshops (GC Wkshps).

[8]  Arun Kumar Sangaiah,et al.  Mobility Based Trust Evaluation for Heterogeneous Electric Vehicles Network in Smart Cities , 2021, IEEE Transactions on Intelligent Transportation Systems.

[9]  Xiaolong Xu,et al.  Efficient computation offloading for Internet of Vehicles in edge computing-assisted 5G networks , 2019, The Journal of Supercomputing.

[10]  Shaohua Wan,et al.  Deep Learning Models for Real-time Human Activity Recognition with Smartphones , 2019, Mobile Networks and Applications.

[11]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[12]  Peter Kilpatrick,et al.  Challenges and Opportunities in Edge Computing , 2016, 2016 IEEE International Conference on Smart Cloud (SmartCloud).

[13]  Hongjun Dai,et al.  A scheduling algorithm for autonomous driving tasks on mobile edge computing servers , 2019, J. Syst. Archit..

[14]  Jason P. Jue,et al.  All One Needs to Know about Fog Computing and Related Edge Computing Paradigms , 2019 .

[15]  Pan Hui,et al.  ThinkAir: Dynamic resource allocation and parallel execution in the cloud for mobile code offloading , 2012, 2012 Proceedings IEEE INFOCOM.

[16]  Mahadev Satyanarayanan,et al.  The Role of Cloudlets in Hostile Environments , 2013, IEEE Pervasive Comput..

[17]  J. Wenny Rahayu,et al.  Mobile cloud computing: A survey , 2013, Future Gener. Comput. Syst..

[18]  Minming Li,et al.  Coordinated resource provisioning and maintenance scheduling in cloud data centers , 2013, 2013 Proceedings IEEE INFOCOM.

[19]  Pierluigi Siano,et al.  Sustainable Smart Cities Through the Lens of Complex Interdependent Infrastructures: Panorama and State-of-the-art , 2018, Studies in Systems, Decision and Control.

[20]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[21]  János Sztrik,et al.  Basic Queueing Theory , 2016 .

[22]  Shaobo Zhang,et al.  Fast Multicast With Adjusting Transmission Power and Active Slots in Software Define IoT , 2020, IEEE Access.

[23]  Philipp Leitner,et al.  Resource Provisioning for IoT Services in the Fog , 2016, 2016 IEEE 9th International Conference on Service-Oriented Computing and Applications (SOCA).

[24]  H. Andrés Neyem,et al.  Towards a practical framework for code offloading in the Internet of Things , 2019, Future Gener. Comput. Syst..

[25]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.

[26]  Anfeng Liu,et al.  A trustworthiness-based vehicular recruitment scheme for information collections in Distributed Networked Systems , 2021, Inf. Sci..

[27]  Rajkumar Buyya,et al.  Heterogeneity in Mobile Cloud Computing: Taxonomy and Open Challenges , 2014, IEEE Communications Surveys & Tutorials.

[28]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[29]  Xuemin Shen,et al.  Synergy of Big Data and 5G Wireless Networks: Opportunities, Approaches, and Challenges , 2018, IEEE Wireless Communications.

[30]  Xiaoheng Deng,et al.  QoE-driven computation offloading for Edge Computing , 2019, J. Syst. Archit..

[31]  Anfeng Liu,et al.  Objective-Variable Tour Planning for Mobile Data Collection in Partitioned Sensor Networks , 2022, IEEE Transactions on Mobile Computing.

[32]  Shaohua Wan,et al.  LMM: latency-aware micro-service mashup in mobile edge computing environment , 2020, Neural Computing and Applications.

[33]  Dusit Niyato,et al.  Deep Reinforcement Learning for Mobile 5G and Beyond: Fundamentals, Applications, and Challenges , 2019, IEEE Vehicular Technology Magazine.

[34]  Sateesh Addepalli,et al.  Fog computing and its role in the internet of things , 2012, MCC '12.

[35]  Anfeng Liu,et al.  A Deep Learning-Based Mobile Crowdsensing Scheme by Predicting Vehicle Mobility , 2021, IEEE Transactions on Intelligent Transportation Systems.

[36]  Tian Wang,et al.  Energy-aware MAC protocol for data differentiated services in sensor-cloud computing , 2020, J. Cloud Comput..

[37]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[38]  Anfeng Liu,et al.  BD-VTE: A Novel Baseline Data Based Verifiable Trust Evaluation Scheme for Smart Network Systems , 2021, IEEE Transactions on Network Science and Engineering.

[39]  Gerhard Weiß,et al.  Distributed reinforcement learning , 1995, Robotics Auton. Syst..

[40]  Mugen Peng,et al.  Deep Reinforcement Learning-Based Mode Selection and Resource Management for Green Fog Radio Access Networks , 2018, IEEE Internet of Things Journal.

[41]  Mianxiong Dong,et al.  Intelligent resource allocation management for vehicles network: An A3C learning approach , 2020, Comput. Commun..

[42]  Teruo Higashino,et al.  Edge-centric Computing: Vision and Challenges , 2015, CCRV.

[43]  Md Zakirul Alam Bhuiyan,et al.  Multiagent Deep Reinforcement Learning for Vehicular Computation Offloading in IoT , 2021, IEEE Internet of Things Journal.

[44]  Yunheung Paek,et al.  Precise execution offloading for applications with dynamic behavior in mobile cloud computing , 2016, Pervasive Mob. Comput..

[45]  Tian Wang,et al.  Artificial intelligence aware and security-enhanced traceback technique in mobile edge computing , 2020, Comput. Commun..

[46]  Anfeng Liu,et al.  An Effective Early Message Ahead Join Adaptive Data Aggregation Scheme for Sustainable IoT , 2021, IEEE Transactions on Network Science and Engineering.

[47]  Mianxiong Dong,et al.  Result return aware offloading scheme in vehicular edge networks for IoT , 2020, Comput. Commun..