A Pareto-Efficient Algorithm for Data Stream Processing at Network Edges

Data stream processing has received considerable attention from both research community and industry over the last years. Since latency is a key issue in data stream processing environments, the majority of the works existing in the literature focus on minimizing the latency experienced by the users. The aforementioned minimization takes place by assigning the data stream processing components close to data sources. Server consolidation is also a key issue for drastically reducing energy consumption in computing systems. Unfortunately, energy consumption and latency are two objective functions that may be in conflict with each other. Therefore, when the target function is to minimize energy consumption, the delay experienced by users may be considerable high, and the opposite. For the above reason there is a dire need to design strategies such that by targeting the minimization of energy consumption, there is a graceful degradation in latency, as well as the opposite. To achieve the above, we propose a Pareto-efficient algorithm that tackles the problem of data processing tasks placement simultaneously in both dimensions regarding the energy consumption and latency. The proposed algorithm outputs a set of solutions that are not dominated by any solution within the set regarding energy consumption and latency. The experimental results show that the proposed approach is superior against single-solution approaches because by targeting one objective function the other one can be gracefully degraded by choosing the appropriate solution.

[1]  Albert Y. Zomaya,et al.  Data Stream Processing at Network Edges , 2018, 2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).

[2]  Thanasis Loukopoulos,et al.  Application-Aware Workload Consolidation to Minimize Both Energy Consumption and Network Load in Cloud Environments , 2013, 2013 42nd International Conference on Parallel Processing.

[3]  Antti Ylä-Jääski,et al.  Virtual Machine Consolidation with Multiple Usage Prediction for Energy-Efficient Cloud Data Centers , 2020, IEEE Transactions on Services Computing.

[4]  Enda Barrett,et al.  An advanced reinforcement learning approach for energy-aware virtual machine consolidation in cloud data centers , 2017, 2017 12th International Conference for Internet Technology and Secured Transactions (ICITST).

[5]  Tiziano De Matteis,et al.  Proactive elasticity and energy awareness in data stream processing , 2017, J. Syst. Softw..

[6]  Keqin Li,et al.  Re-Stream: Real-time and energy-efficient resource scheduling in big data stream computing environments , 2015, Inf. Sci..

[7]  Ibrahim Matta,et al.  BRITE: an approach to universal topology generation , 2001, MASCOTS 2001, Proceedings Ninth International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[8]  Jian Tang,et al.  A predictive scheduling framework for fast and distributed stream data processing , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[9]  Fei Wang,et al.  An Iterative Budget Algorithm for Dynamic Virtual Machine Consolidation Under Cloud Computing Environment , 2018, IEEE Transactions on Services Computing.

[10]  Samuel Kounev,et al.  Analysis of the Influences on Server Power Consumption and Energy Efficiency for CPU-Intensive Workloads , 2015, ICPE.

[11]  Vincenzo Grassi,et al.  Optimal operator placement for distributed stream processing applications , 2016, DEBS.