State and runtime-aware scheduling in elastic stream computing systems

Abstract State and runtime-aware scheduling is one of the problems that is hard to resolve in elastic big data stream computing systems, as the state of each vertex is different, and the arrival rate of data streams fluctuates over time. A state and runtime-aware scheduling framework should be able to dynamically adapt to the fluctuation of the arrival rate of data streams and be aware of vertex states and resource availability. Currently, there is an increasing number of research work focusing on application scheduling in stream computing systems, however, this problem is still far from being completely solved. In this paper, we focus on the state of vertex in applications and the runtime feature of resources in a data center, and propose a state and runtime-aware scheduling framework (Sra-Stream) for elastic streaming computing systems, which incorporates the following features: (1) Profiling mathematical relationships between the system response time and the arrival rate of data streams, and identifying relevant resource constraints to meet the low response time and high throughput objectives. (2) Classifying vertex into stateless vertex or stateful vertex from a quantitative perspective, and achieving vertex parallelization by considering the state of the vertex. (3) Demonstrating a proposed stream application scheduling scheme consisting of a modified first-fit based runtime-aware data tuple scheduling strategy at the initial stage, and a maximum latency-sensitive based runtime-aware data stream scheduling strategy at the online stage, by considering the current scheduling status of the application. (4) Evaluating the achievement levels of low response time and high throughput objectives in a real-world elastic stream computing system. Experimental results conclusively demonstrate that the proposed Sra-Stream provides significant performance improvements on achieving the low system response time and high system throughput.

[1]  Tiziano De Matteis,et al.  Keep calm and react with foresight: strategies for low-latency and energy-efficient elastic data stream processing , 2016, PPoPP.

[2]  Kenli Li,et al.  A Hybrid Chemical Reaction Optimization Scheme for Task Scheduling on Heterogeneous Computing Systems , 2015, IEEE Transactions on Parallel and Distributed Systems.

[3]  Marko Bertogna,et al.  Schedulability Analysis of Conditional Parallel Task Graphs in Multicore Systems , 2017, IEEE Transactions on Computers.

[4]  Dongyu Qiu,et al.  Analysis and Optimization of Big-Data Stream Processing , 2016, 2016 IEEE Global Communications Conference (GLOBECOM).

[5]  R. Srikant,et al.  Scheduling Storms and Streams in the Cloud , 2015, SIGMETRICS.

[6]  Valeria Cardellini,et al.  Elastic stateful stream processing in storm , 2016, 2016 International Conference on High Performance Computing & Simulation (HPCS).

[7]  Xiang Li,et al.  Task Allocation for Stream Processing with Recovery Latency Guarantee , 2017, 2017 IEEE International Conference on Cluster Computing (CLUSTER).

[8]  Otto Carlos Muniz Bandeira Duarte,et al.  A Performance Comparison of Open-Source Stream Processing Platforms , 2016, 2016 IEEE Global Communications Conference (GLOBECOM).

[9]  Indranil Gupta,et al.  Stateful Scalable Stream Processing at LinkedIn , 2017, Proc. VLDB Endow..

[10]  Fernando Pedone,et al.  Efficient and Deterministic Scheduling for Parallel State Machine Replication , 2017, 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS).

[11]  Wann-Yun Shieh,et al.  Energy and transition-aware runtime task scheduling for multicore processors , 2013, J. Parallel Distributed Comput..

[12]  Carlos A. Varela,et al.  Maximum Sustainable throughput Prediction for Data Stream Processing over Public Clouds , 2017, 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID).

[13]  Alexandros Tzannes,et al.  Lazy Scheduling: A Runtime Adaptive Scheduler for Declarative Parallelism , 2014, TOPL.

[14]  Denis Trystram,et al.  A New On-line Method for Scheduling Independent Tasks , 2017, 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID).

[15]  Peter Kilpatrick,et al.  State access patterns in stream parallel computations , 2018, Int. J. High Perform. Comput. Appl..

[16]  Dong Yang,et al.  S-Storm: A Slot-Aware Scheduling Strategy for Even Scheduler in Storm , 2016, 2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS).

[17]  Nicolas Lumineau,et al.  A Preventive Auto-Parallelization Approach for Elastic Stream Processing , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[18]  Kian-Lee Tan,et al.  ChronoStream: Elastic stateful stream computation in the cloud , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[19]  Marco Danelutto,et al.  Elastic-PPQ: A two-level autonomic system for spatial preference query processing over dynamic data streams , 2018, Future Gener. Comput. Syst..

[20]  Yogesh L. Simmhan,et al.  Model-driven Scheduling for Distributed Stream Processing Systems , 2017, J. Parallel Distributed Comput..

[21]  Jian Tang,et al.  Performance Modeling and Predictive Scheduling for Distributed Stream Data Processing , 2016, IEEE Transactions on Big Data.

[22]  Rajkumar Buyya,et al.  E-Storm: Replication-Based State Management in Distributed Stream Processing Systems , 2017, 2017 46th International Conference on Parallel Processing (ICPP).

[23]  Jiannong Cao,et al.  Multi-User Computation Partitioning for Latency Sensitive Mobile Cloud Applications , 2015, IEEE Transactions on Computers.

[24]  James Demmel,et al.  Runtime Data Layout Scheduling for Machine Learning Dataset , 2017, 2017 46th International Conference on Parallel Processing (ICPP).

[25]  Shang Gao,et al.  Rethinking elastic online scheduling of big data streaming applications over high-velocity continuous data streams , 2018, The Journal of Supercomputing.

[26]  Chunlin Li,et al.  Real-time scheduling based on optimized topology and communication traffic in distributed real-time computation platform of storm , 2017, J. Netw. Comput. Appl..

[27]  José Antonio Lozano,et al.  A Review of Auto-scaling Techniques for Elastic Applications in Cloud Environments , 2014, Journal of Grid Computing.

[28]  Tiziano De Matteis,et al.  Parallel Patterns for Window-Based Stateful Operators on Data Streams: An Algorithmic Skeleton Approach , 2017, International Journal of Parallel Programming.

[29]  Robert Grimm,et al.  A catalog of stream processing optimizations , 2014, ACM Comput. Surv..

[30]  Zhiling Lan,et al.  Job scheduling with adjusted runtime estimates on production supercomputers , 2013, J. Parallel Distributed Comput..

[31]  Kun-Lung Wu,et al.  Elastic Scaling for Data Stream Processing , 2014, IEEE Transactions on Parallel and Distributed Systems.

[32]  Tiziano De Matteis,et al.  Proactive elasticity and energy awareness in data stream processing , 2017, J. Syst. Softw..

[33]  Scott Shenker,et al.  Discretized streams: fault-tolerant streaming computation at scale , 2013, SOSP.

[34]  Risat Pathan,et al.  Scheduling Parallel Real-Time Recurrent Tasks on Multicore Platforms , 2018, IEEE Transactions on Parallel and Distributed Systems.

[35]  Hai Jin,et al.  Runtime‐aware adaptive scheduling in stream processing , 2016, Concurr. Comput. Pract. Exp..

[36]  Takuya Azumi,et al.  Scheduling parallel and distributed processing for automotive data stream management system , 2017, J. Parallel Distributed Comput..

[37]  Seif Haridi,et al.  State Management in Apache Flink®: Consistent Stateful Distributed Stream Processing , 2017, Proc. VLDB Endow..

[38]  Jing Zhang,et al.  The Real-Time Scheduling Strategy Based on Traffic and Load Balancing in Storm , 2016, 2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS).

[39]  Valeria Cardellini,et al.  Decentralized self-adaptation for elastic Data Stream Processing , 2018, Future Gener. Comput. Syst..

[40]  Jignesh M. Patel,et al.  Storm@twitter , 2014, SIGMOD Conference.