Managing Latency in Edge-Cloud Environment

Abstract Modern Cyber-physical Systems (CPS) include applications like smart traffic, smart agriculture, smart power grid, etc. Commonly, these systems are distributed and composed of end-user applications and microservices that typically run in the cloud. The connection with the physical world, which is inherent to CPS, brings the need to operate and respond in real-time. As the cloud becomes part of the computation loop, the real-time requirements have to be also reflected by the cloud. In this paper, we present an approach that provides soft real-time guarantees on the response time of services running in cloud and edge-cloud (i.e., cloud geographically close to the end-user), where these services are developed in high-level programming languages. In particular, we elaborate a method that allows us to predict the upper bound of the response time of a service when sharing the same computer with other services. Importantly, as our approach focuses on minimizing the impact on the developer of such services, it does not require any special programming model nor limits usage of common libraries, etc.

[1]  ChanaInderveer,et al.  QoS-Aware Autonomic Resource Management in Cloud Computing , 2015 .

[2]  Lingjia Tang,et al.  Bubble-flux: precise online QoS management for increased utilization in warehouse scale computers , 2013, ISCA.

[3]  Matthias Hein,et al.  Non-negative least squares for high-dimensional linear models: consistency and sparse recovery without regularization , 2012, 1205.0953.

[4]  Maryam Amiri,et al.  Survey on prediction models of applications for resources provisioning in cloud , 2017, J. Netw. Comput. Appl..

[5]  Philip Samuel,et al.  Service‐level agreement–aware scheduling and load balancing of tasks in cloud , 2019, Softw. Pract. Exp..

[6]  Kevin Skadron,et al.  Bubble-up: Increasing utilization in modern warehouse scale computers via sensible co-locations , 2011, 2011 44th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[7]  Muhammad Arshad Islam,et al.  SLA-RALBA: cost-efficient and resource-aware load balancing algorithm for cloud computing , 2019, The Journal of Supercomputing.

[8]  Aman Kansal,et al.  Q-clouds: managing performance interference effects for QoS-aware clouds , 2010, EuroSys '10.

[9]  Jesús Montes,et al.  FMonE: A Flexible Monitoring Solution at the Edge , 2018, Wirel. Commun. Mob. Comput..

[10]  Jie Chen,et al.  Reweighted nonnegative least-mean-square algorithm , 2016, Signal Process..

[11]  Danna Zhou,et al.  d. , 1840, Microbial pathogenesis.

[12]  Rami Bahsoon,et al.  A Systematic Review of Service Level Management in the Cloud , 2015, ACM Comput. Surv..

[13]  Sherali Zeadally,et al.  A survey and taxonomy on energy efficient resource allocation techniques for cloud computing systems , 2016, Computing.

[14]  Albert Y. Zomaya,et al.  CtrlCloud: Performance-Aware Adaptive Control for Shared Resources in Clouds , 2017, 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID).

[15]  Zoltán Ádám Mann,et al.  Allocation of Virtual Machines in Cloud Data Centers—A Survey of Problem Models and Optimization Algorithms , 2015, ACM Comput. Surv..

[16]  Jie Liu,et al.  Cuanta: quantifying effects of shared on-chip resource interference for consolidated virtual machines , 2011, SoCC.

[17]  Jeffrey O. Kephart,et al.  The Vision of Autonomic Computing , 2003, Computer.

[18]  Jérôme François,et al.  A Holistic Monitoring Service for Fog/Edge Infrastructures: A Foresight Study , 2017, 2017 IEEE 5th International Conference on Future Internet of Things and Cloud (FiCloud).

[19]  Prasanta K. Jana,et al.  SLA-based task scheduling algorithms for heterogeneous multi-cloud environment , 2017, The Journal of Supercomputing.

[20]  Rajiv Ranjan,et al.  Cross-Layer Multi-Cloud Real-Time Application QoS Monitoring and Benchmarking As-a-Service Framework , 2015, IEEE Transactions on Cloud Computing.

[21]  Hiranya Jayathilaka,et al.  Response time service level agreements for cloud-hosted web applications , 2015, SoCC.

[22]  Amer Diwan,et al.  The DaCapo benchmarks: java benchmarking development and analysis , 2006, OOPSLA '06.

[23]  Bowen Zhou,et al.  Pythia: Improving Datacenter Utilization via Precise Contention Prediction for Multiple Co-located Workloads , 2018, Middleware.

[24]  Mahadev Satyanarayanan,et al.  The Emergence of Edge Computing , 2017, Computer.

[25]  Xi Chen,et al.  CloudScope: Diagnosing and Managing Performance Interference in Multi-tenant Clouds , 2015, 2015 IEEE 23rd International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems.

[26]  Xin Yao,et al.  A Survey and Taxonomy of Self-Aware and Self-Adaptive Cloud Autoscaling Systems , 2016, ACM Comput. Surv..

[27]  Christina Delimitrou,et al.  Paragon: QoS-aware scheduling for heterogeneous datacenters , 2013, ASPLOS '13.

[28]  Prem Prakash Jayaraman,et al.  Osmotic Monitoring of Microservices between the Edge and Cloud , 2018, 2018 IEEE 20th International Conference on High Performance Computing and Communications; IEEE 16th International Conference on Smart City; IEEE 4th International Conference on Data Science and Systems (HPCC/SmartCity/DSS).

[29]  Christina Delimitrou,et al.  Quasar: resource-efficient and QoS-aware cluster management , 2014, ASPLOS.

[30]  Tommaso Cucinotta,et al.  Challenges in real-time virtualization and predictable cloud computing , 2014, J. Syst. Archit..

[31]  Zhongjie Wang,et al.  Re-deploying Microservices in Edge and Cloud Environment for the Optimization of User-Perceived Service Quality , 2019, ICSOC.

[32]  Mira Mezini,et al.  Da capo con scala: design and analysis of a scala benchmark suite for the java virtual machine , 2011, OOPSLA '11.