Joint Task Offloading and Resource Allocation for Quality-Aware Edge-Assisted Machine Learning Task Inference

Edge computing is essential to enhance delay-sensitive and computation-intensive machine learning (ML) task inference services. Quality of inference results, which is mainly impacted by the task data and ML models, is an important indicator impacting the system performance. In this paper, we consider a quality-aware edge-assisted ML task inference scenario and propose a resource management scheme to minimize the total task processing delay while guaranteeing the stability of all the task queues and the inference accuracy requirements of all the tasks. In our scheme, the task offloading, task data adjustment, computing resource allocation, and wireless channel allocation are jointly optimized. The Lyapunov optimization technique is adopted to transform the original optimization problem into a deterministic problem for each time slot. Considering the high complexity of the optimization problem, we design an algorithm that decomposes the problem into a task offloading and channel allocation (TOCA) sub-problem, a task data adjustment sub-problem, and a computing resource allocation sub-problem, and then solves them iteratively. A low-complexity heuristic algorithm is also designed to solve the TOCA sub-problem efficiently. Extensive simulations are conducted by varying different crucial parameters. The results demonstrate the superiority of our scheme in comparison with 4 other schemes.

[1]  Fan Wu,et al.  Joint Task Offloading and Resource Allocation for Vehicular Edge Computing Based on V2I and V2V Modes , 2023, IEEE Transactions on Intelligent Transportation Systems.

[2]  Fan Wu,et al.  Joint Task Offloading and Resource Allocation for Accuracy-Aware Machine-Learning-Based IIoT Applications , 2023, IEEE Internet of Things Journal.

[3]  B. Tang,et al.  DNN Deployment, Task Offloading, and Resource Allocation for Joint Task Inference in IIoT , 2023, IEEE Transactions on Industrial Informatics.

[4]  Laurence T. Yang,et al.  Compressing Deep Model With Pruning and Tucker Decomposition for Smart Embedded Systems , 2022, IEEE Internet of Things Journal.

[5]  Xiaojiang Du,et al.  Resource Management for Edge Intelligence (EI)-Assisted IoV Using Quantum-Inspired Reinforcement Learning , 2022, IEEE Internet of Things Journal.

[6]  Zhiqing Wei,et al.  Intelligent Computation Offloading for MEC-Based Cooperative Vehicle Infrastructure System: A Deep Reinforcement Learning Approach , 2022, IEEE Transactions on Vehicular Technology.

[7]  H. Vincent Poor,et al.  Age of Information in Energy Harvesting Aided Massive Multiple Access Networks , 2021, IEEE Journal on Selected Areas in Communications.

[8]  Liang Zhao,et al.  Collaborative Service Placement, Task Scheduling, and Resource Allocation for Task Offloading with Edge-Cloud Cooperation , 2022, IEEE Transactions on Mobile Computing.

[9]  Lei Guo,et al.  Resource Management for Pervasive-Edge-Computing-Assisted Wireless VR Streaming in Industrial Internet of Things , 2021, IEEE Transactions on Industrial Informatics.

[10]  Daniel E. Lucani,et al.  Optimal Accuracy-Time Trade-off for Deep Learning Services in Edge Computing Systems , 2020, ICC 2021 - IEEE International Conference on Communications.

[11]  Bo Yang,et al.  Offloading Optimization in Edge Computing for Deep-Learning-Enabled Target Tracking by Internet of UAVs , 2020, IEEE Internet of Things Journal.

[12]  Xu Chen,et al.  Joint Multiuser DNN Partitioning and Computational Resource Allocation for Collaborative Edge Intelligence , 2020, IEEE Internet of Things Journal.

[13]  Yilin Wu,et al.  FPCB Surface Defect Detection: A Decoupled Two-Stage Object Detection Framework , 2021, IEEE Transactions on Instrumentation and Measurement.

[14]  Mario Di Francesco,et al.  Distributed Inference Acceleration with Adaptive DNN Partitioning and Offloading , 2020, IEEE INFOCOM 2020 - IEEE Conference on Computer Communications.

[15]  Zhu Han,et al.  Joint Optimization Strategy of Computation Offloading and Resource Allocation in Multi-Access Edge Computing Environment , 2020, IEEE Transactions on Vehicular Technology.

[16]  Feng Lyu,et al.  Joint Channel Allocation and Resource Management for Stochastic Computation Offloading in MEC , 2020, IEEE Transactions on Vehicular Technology.

[17]  Xiongwen Zhao,et al.  Learning-Based Context-Aware Resource Allocation for Edge-Computing-Empowered Industrial IoT , 2020, IEEE Internet of Things Journal.

[18]  Nada Golmie,et al.  Toward Edge-Based Deep Learning in Industrial Internet of Things , 2020, IEEE Internet of Things Journal.

[19]  Hong-Yuan Mark Liao,et al.  YOLOv4: Optimal Speed and Accuracy of Object Detection , 2020, ArXiv.

[20]  Yaonan Wang,et al.  A Surface Defect Detection Framework for Glass Bottle Bottom Using Visual Attention Model and Wavelet Transform , 2020, IEEE Transactions on Industrial Informatics.

[21]  Bo Yang,et al.  Mobile-Edge-Computing-Based Hierarchical Machine Learning Tasks Distribution for IIoT , 2020, IEEE Internet of Things Journal.

[22]  Mehdi Bennis,et al.  Q-GADMM: Quantized Group ADMM for Communication Efficient Decentralized Machine Learning , 2019, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[23]  Albert Y. Zomaya,et al.  Edge Intelligence: The Confluence of Edge Computing and Artificial Intelligence , 2019, IEEE Internet of Things Journal.

[24]  Xiaobo Zhou,et al.  Edge Computing in Industrial Internet of Things: Architecture, Advances and Challenges , 2020, IEEE Communications Surveys & Tutorials.

[25]  Yanlin Yue,et al.  AI-Enhanced Offloading in Edge Computing: When Machine Learning Meets Industrial IoT , 2019, IEEE Network.

[26]  Li Zhou,et al.  Energy-Latency Tradeoff for Energy-Aware Offloading in Mobile Edge Computing Networks , 2018, IEEE Internet of Things Journal.

[27]  Liang Gao,et al.  Adaptive Fog Configuration for the Industrial Internet of Things , 2018, IEEE Transactions on Industrial Informatics.

[28]  Kin K. Leung,et al.  When Edge Meets Learning: Adaptive Control for Resource-Constrained Distributed Machine Learning , 2018, IEEE INFOCOM 2018 - IEEE Conference on Computer Communications.

[29]  R. Horst,et al.  DC Programming: Overview , 1999 .

[30]  K. Tammer The application of parametric optimization and imbedding to the foundation and realization of a generalized primal decomposition approach , 1987 .