BGElasor: Elastic-Scaling Framework for Distributed Streaming Processing with Deep Neural Network

In face of constant fluctuations and sudden bursts of data stream, elasticity of distributed stream processing system has become increasingly important. The proactive policy offers a powerful means to realize the effective elastic scaling. The existing methods lack the latent features of data stream, it leads the poor prediction. Furthermore, the poor prediction results in the high cost of adaptation and the instability. To address these issues, we propose the framework named BGElasor, which is a proactive and low-cost elastic-scaling framework based on the accurate prediction using deep neural networks. It can capture the potentially-complicated pattern to enhance the accuracy of prediction, reduce the cost of adaptation and avoid adaptation bumps. The experimental results show that BGElasor not only improves the prediction accuracy with three kinds of typical loads, but also ensure the end-to-end latency on QoS with low cost.

[1]  Jennifer Widom,et al.  STREAM: The Stanford Stream Data Manager , 2003, IEEE Data Eng. Bull..

[2]  Samuel Kounev,et al.  Self-adaptive workload classification and forecasting for proactive resource provisioning , 2013, ICPE '13.

[3]  Seif Haridi,et al.  Apache Flink™: Stream and Batch Processing in a Single Engine , 2015, IEEE Data Eng. Bull..

[4]  Vana Kalogeraki,et al.  Hot-spot prediction and alleviation in distributed stream processing applications , 2008, 2008 IEEE International Conference on Dependable Systems and Networks With FTCS and DCC (DSN).

[5]  Kun-Lung Wu,et al.  Elastic Scaling for Data Stream Processing , 2014, IEEE Transactions on Parallel and Distributed Systems.

[6]  Claudio Soriente,et al.  StreamCloud: An Elastic and Scalable Data Streaming System , 2012, IEEE Transactions on Parallel and Distributed Systems.

[7]  Ying Xing,et al.  Dynamic load distribution in the Borealis stream processor , 2005, 21st International Conference on Data Engineering (ICDE'05).

[8]  Ying Xing,et al.  The Design of the Borealis Stream Processing Engine , 2005, CIDR.

[9]  Dimitrios Gunopulos,et al.  Elastic complex event processing exploiting prediction , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[10]  Ying Xing,et al.  Providing resiliency to load variations in distributed stream processing , 2006, VLDB.

[11]  Raul Castro Fernandez,et al.  Integrating scale out and fault tolerance in stream processing using operator state management , 2013, SIGMOD '13.

[12]  Leonardo Neumeyer,et al.  S4: Distributed Stream Computing Platform , 2010, 2010 IEEE International Conference on Data Mining Workshops.

[13]  M. Tamer Özsu,et al.  Adaptive input admission and management for parallel stream processing , 2013, DEBS.

[14]  Nicolas Hidalgo,et al.  Self-adaptive processing graph with operator fission for elastic stream processing , 2017, J. Syst. Softw..