Dynamic Component Placement and Request Scheduling for IoT Big Data Streaming

Internet-of-Things (IoT) big data streaming applications, such as video surveillance and automatic driving, tend to use mobile-edge computing (MEC) infrastructure to enhance their performance and augment their functionalities. Although extensive previous studies have worked on offloading requests to MEC servers, none of them has comprehensively and thoroughly considered the important features of IoT data streaming applications (i.e., component dependency and dynamic arrival) and the infrastructure provisioning (i.e., capacity constraint and colocation interference). In this article, we consider the offloading problem for dynamically arrived IoT data streaming requests on MEC servers in real time. We model it as a delay-sensitive multiuser multiresource online offloading problem respecting component dependency and capacity constraint. The problem is NP-hard with offloading decisions coupling together. To solve it, we decouple the problem into component placement problem and request scheduling problem and propose a two-stage DPGPD algorithm with polynomial time complexity. We show the first stage dynamic programming (DP) algorithm is the optimal solution and the second-stage greedy primal–dual (GPD) algorithm is asymptotic optimal. The simulation results show that our solution is effective yet efficient compared to benchmark solutions. (DP provides the optimal placement layout with <inline-formula> <tex-math notation="LaTeX">$12 \times $ </tex-math></inline-formula> less decision time of Gurobi; and GPD provides the asymptotic optimal scheduling with <inline-formula> <tex-math notation="LaTeX">$5 \times $ </tex-math></inline-formula> less average waiting time compared to least work left (LWL) in heavy workload.) We implement a dedicated prototype and exploit several representative big data streaming applications to evaluate it. Lab-scale experiment shows that our solution can provide over <inline-formula> <tex-math notation="LaTeX">$3 \times $ </tex-math></inline-formula> less total completion time compared to local execution.

[1]  Bhaskar Krishnamachari,et al.  Hermes: Latency Optimal Task Assignment for Resource-constrained Mobile Computing , 2017, IEEE Transactions on Mobile Computing.

[2]  Alec Wolman,et al.  MAUI: making smartphones last longer with code offload , 2010, MobiSys '10.

[3]  Dipankar Raychaudhuri,et al.  Hetero-Edge: Orchestration of Real-time Vision Applications on Heterogeneous Edge Clouds , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[4]  Zhuo Chen,et al.  Edge Analytics in the Internet of Things , 2015, IEEE Pervasive Computing.

[5]  M. Herbster,et al.  Service Placement with Provable Guarantees in Heterogeneous Edge Computing Systems , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[6]  Yuan Zhang,et al.  To offload or not to offload: An efficient code partition algorithm for mobile cloud computing , 2012, 2012 IEEE 1st International Conference on Cloud Networking (CLOUDNET).

[7]  Ying Jun Zhang,et al.  Deep Reinforcement Learning for Online Computation Offloading in Wireless Powered Mobile-Edge Computing Networks , 2018, IEEE Transactions on Mobile Computing.

[8]  Y. Charlie Hu,et al.  Mobility Support in Cellular Networks: A Measurement Study on Its Configurations and Implications , 2018, Internet Measurement Conference.

[9]  M. Siekkinen,et al.  Edge Computing Assisted Adaptive Mobile Video Streaming , 2019, IEEE Transactions on Mobile Computing.

[10]  Long Lu,et al.  StreamBox-TZ: Secure Stream Analytics at the Edge with TrustZone , 2018, USENIX ATC.

[11]  Kaoru Ota,et al.  Edge-Assisted Stream Scheduling Scheme for the Green-Communication-Based IoT , 2019, IEEE Internet of Things Journal.

[12]  A. Salman Avestimehr,et al.  Communication-Aware Scheduling of Serial Tasks for Dispersed Computing , 2018, 2018 IEEE International Symposium on Information Theory (ISIT).

[13]  Yunhao Liu,et al.  Beyond QoE: Diversity Adaption in Video Streaming at the Edge , 2019, 2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS).

[14]  Zhihan Lv,et al.  Next-Generation Big Data Analytics: State of the Art, Challenges, and Future Research Topics , 2017, IEEE Transactions on Industrial Informatics.

[15]  Byung-Gon Chun,et al.  CloneCloud: elastic execution between mobile device and cloud , 2011, EuroSys '11.

[16]  Koen E. A. van de Sande,et al.  Selective Search for Object Recognition , 2013, International Journal of Computer Vision.

[17]  D. Vere-Jones Markov Chains , 1972, Nature.

[18]  Mahadev Satyanarayanan,et al.  A Scalable and Privacy-Aware IoT Service for Live Video Analytics , 2017, MMSys.

[19]  Xu Chen,et al.  ERP: Edge Resource Pooling for Data Stream Mobile Computing , 2019, IEEE Internet of Things Journal.

[20]  Tao Jiang,et al.  Edge Computing Framework for Cooperative Video Processing in Multimedia IoT Systems , 2018, IEEE Transactions on Multimedia.

[21]  Thomas F. La Porta,et al.  Service Placement and Request Scheduling for Data-intensive Applications in Edge Clouds , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[22]  Alexander L. Stolyar,et al.  Shadow-Routing Based Dynamic Algorithms for Virtual Machine Placement in a Network Cloud , 2013, IEEE Transactions on Cloud Computing.

[23]  Weifa Liang,et al.  Task Offloading with Network Function Requirements in a Mobile Edge-Cloud Network , 2019, IEEE Transactions on Mobile Computing.

[24]  Yi Lu,et al.  Asymptotic independence of queues under randomized load balancing , 2012, Queueing Syst. Theory Appl..

[25]  Qiang Liu,et al.  DARE: Dynamic Adaptive Mobile Augmented Reality with Edge Computing , 2018, 2018 IEEE 26th International Conference on Network Protocols (ICNP).

[26]  Weisong Shi,et al.  Edge Computing: Vision and Challenges , 2016, IEEE Internet of Things Journal.

[27]  Edward A. Lee,et al.  AWStream: adaptive wide-area streaming analytics , 2018, SIGCOMM.

[28]  Ning Zhang,et al.  Joint Computation and Communication Resource Allocation for Energy-Efficient Mobile Edge Networks , 2019, ICC 2019 - 2019 IEEE International Conference on Communications (ICC).

[29]  Mor Harchol-Balter,et al.  Load Balancing Guardrails: Keeping Your Heavy Traffic on the Road to Low Response Times , 2019, Abstracts of the 2019 SIGMETRICS/Performance Joint International Conference on Measurement and Modeling of Computer Systems.

[30]  Alexander L. Stolyar,et al.  Maximizing Queueing Network Utility Subject to Stability: Greedy Primal-Dual Algorithm , 2005, Queueing Syst. Theory Appl..

[31]  Ramesh Govindan,et al.  Odessa: enabling interactive perception applications on mobile devices , 2011, MobiSys '11.

[32]  Gerhard J. Woeginger,et al.  There is no Asymptotic PTAS for Two-Dimensional Vector Packing , 1997, Inf. Process. Lett..

[33]  Jun Cai,et al.  A Multi-User Mobile Computation Offloading and Transmission Scheduling Mechanism for Delay-Sensitive Applications , 2020, IEEE Transactions on Mobile Computing.

[34]  Xu Chen,et al.  Learning Driven Computation Offloading for Asymmetrically Informed Edge Computing , 2019, IEEE Transactions on Parallel and Distributed Systems.

[35]  Yu Wang,et al.  Cloudlet Placement and Task Allocation in Mobile Edge Computing , 2019, IEEE Internet of Things Journal.

[36]  Karim Habak,et al.  COSMOS: computation offloading as a service for mobile devices , 2014, MobiHoc '14.

[37]  Chita R. Das,et al.  D-factor: a quantitative model of application slow-down in multi-resource shared systems , 2012, SIGMETRICS '12.

[38]  Min Sheng,et al.  Mobile-Edge Computing: Partial Computation Offloading Using Dynamic Voltage Scaling , 2016, IEEE Transactions on Communications.

[39]  Wenzhong Li,et al.  Efficient Multi-User Computation Offloading for Mobile-Edge Cloud Computing , 2015, IEEE/ACM Transactions on Networking.

[40]  Y. Charlie Hu,et al.  Furion: Engineering High-Quality Immersive Virtual Reality on Today's Mobile Devices , 2017, IEEE Transactions on Mobile Computing.

[41]  Alexander L. Stolyar,et al.  A large-scale service system with packing constraints: minimizing the number of occupied servers , 2013, SIGMETRICS '13.

[42]  Min Dong,et al.  Joint offloading decision and resource allocation for multi-user multi-task mobile cloud , 2016, 2016 IEEE International Conference on Communications (ICC).

[43]  R. Srikant,et al.  Stochastic models of load balancing and scheduling in cloud computing clusters , 2012, 2012 Proceedings IEEE INFOCOM.

[44]  Hai Jin,et al.  Towards load-balanced VNF assignment in geo-distributed NFV Infrastructure , 2017, 2017 IEEE/ACM 25th International Symposium on Quality of Service (IWQoS).

[45]  Jiannong Cao,et al.  Joint Computation Partitioning and Resource Allocation for Latency Sensitive Applications in Mobile Edge Clouds , 2017, 2017 IEEE 10th International Conference on Cloud Computing (CLOUD).

[46]  Matti Siekkinen,et al.  CloudVR: Cloud Accelerated Interactive Mobile Virtual Reality , 2018, ACM Multimedia.

[47]  Kin K. Leung,et al.  Online Placement of Multi-Component Applications in Edge Computing Environments , 2016, IEEE Access.

[48]  Ben Liang,et al.  Joint Offloading Decision and Resource Allocation with Uncertain Task Computing Requirement , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[49]  K. B. Letaief,et al.  A Survey on Mobile Edge Computing: The Communication Perspective , 2017, IEEE Communications Surveys & Tutorials.