A Data Stream Processing Optimisation Framework for Edge Computing Applications

Data Stream Processing (DSP) is a widely used programming paradigm to process an unbounded event stream. Often, DSP frameworks are deployed on the cloud with a scalable resource model. One of the key requirements of DSP is to produce results with low latency. With the emergence of IoT, many event sources have been located outside the cloud which can result in higher end-to-end latency due to communication overhead. However, due to the abundance of resources at the IoT layer, Edge computing has emerged as a viable computational paradigm. In this paper, we devise an optimisation framework, consisting of a constraint satisfaction formulation and a system model, that aims to minimise end-to-end latency through appropriate placement of DSP operators either on cloud nodes or edge devices, i.e. deployed in an edge-cloud integrated environment. We test our optimisation framework using OMNeT++, with realistic topologies and power consumption data, and show that it is capable of achieving approx 1.65 times reduction of latency compared to edge-only and cloud-only placements, which in turn also reduces the energy consumption per event by up to approx 4% at the edge layer. To the best of our knowledge our optimisation framework is the first of its kind to integrate power, bandwidth and CPU constraints with latency minimisation.

[1]  Nicolas Hidalgo,et al.  Symbiosis: Sharing mobile resources for stream processing , 2014, 2014 IEEE Symposium on Computers and Communications (ISCC).

[2]  Pavel Smrz,et al.  Scheduling Decisions in Stream Processing on Heterogeneous Clusters , 2014, 2014 Eighth International Conference on Complex, Intelligent and Software Intensive Systems.

[3]  András Varga,et al.  An overview of the OMNeT++ simulation environment , 2008, SimuTools.

[4]  Rajkumar Buyya,et al.  Distributed data stream processing and edge computing: A survey on resource elasticity and future directions , 2017, J. Netw. Comput. Appl..

[5]  Margo I. Seltzer,et al.  Network-Aware Operator Placement for Stream-Processing Systems , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[6]  Walter Binder,et al.  Continuous Bytecode Instruction Counting for CPU Consumption Estimation , 2006, Third International Conference on the Quantitative Evaluation of Systems - (QEST'06).

[7]  Thomas Locher,et al.  Task allocation for distributed stream processing , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[8]  Ying Li,et al.  Placement Strategies for Internet-Scale Data Stream Systems , 2008, IEEE Internet Computing.

[9]  Ying Xing,et al.  Dynamic load distribution in the Borealis stream processor , 2005, 21st International Conference on Data Engineering (ICDE'05).

[10]  David Hausheer,et al.  PowerPi: Measuring and modeling the power consumption of the Raspberry Pi , 2014, 39th Annual IEEE Conference on Local Computer Networks.

[11]  Fabian Kaup Energy-efficiency and Performance in Communication Networks: Analyzing Energy-Performance Trade-offs in Communication Networks and their Implications on Future Network Structure and Management , 2017 .

[12]  Weisong Shi,et al.  Edge Computing: Vision and Challenges , 2016, IEEE Internet of Things Journal.

[13]  Mohammad Hosseini,et al.  R-Storm: Resource-Aware Scheduling in Storm , 2015, Middleware.

[14]  Yogesh L. Simmhan,et al.  RIoTBench: An IoT benchmark for distributed stream processing systems , 2017, Concurr. Comput. Pract. Exp..

[15]  Vincenzo Grassi,et al.  Optimal operator placement for distributed stream processing applications , 2016, DEBS.

[16]  Yogesh L. Simmhan,et al.  Distributed Scheduling of Event Analytics across Edge and Cloud , 2016, ACM Trans. Cyber Phys. Syst..

[17]  Michael Till Beck,et al.  Mobile Edge Computing: A Taxonomy , 2014 .

[18]  Héctor Pomares,et al.  mHealthDroid: A Novel Framework for Agile Development of Mobile Health Applications , 2014, IWAAL.

[19]  Vincenzo Grassi,et al.  Optimal Operator Replication and Placement for Distributed Stream Processing Systems , 2017, PERV.

[20]  Peter J. Stuckey,et al.  The MiniZinc Challenge 2008-2013 , 2014, AI Mag..

[21]  Li-Shiuan Peh,et al.  MobiStreams: A Reliable Distributed Stream Processing System for Mobile Devices , 2014, 2014 IEEE 28th International Parallel and Distributed Processing Symposium.

[22]  Peter J. Stuckey,et al.  MiniZinc: Towards a Standard CP Modelling Language , 2007, CP.

[23]  Yogesh L. Simmhan,et al.  Cloud-Based Software Platform for Big Data Analytics in Smart Grids , 2013, Computing in Science & Engineering.