Exploiting Application Semantics in Monitoring Real-Time Data Streams

Real-time stream processing applications must be prepared to operate under overloaded conditions. Existing load shedding techniques are not suitable for processing real-time data streams because their tuple dropping policies may violate application deadlines in an uncontrolled way. We'd argue that a more precise load shedding model, e.g., the (m, k) deadline model adopted in this paper, is much appropriate than the commonly used random dropping policy. Based on the (m, k) load shedding model and a novel load shedding approach, we propose a concrete (m, k) scheduling algorithm called SOSA-DBP by exploiting application semantics. Experimental results show that SOSA-DBP has significant performance gain over the existing (m, k) scheduling algorithm.

[1]  Sang Hyuk Son,et al.  Prediction-Based QoS Management for Real-Time Data Streams , 2006, 2006 27th IEEE International Real-Time Systems Symposium (RTSS'06).

[2]  Stanley B. Zdonik,et al.  Dealing with Overload in Distributed Stream Processing Systems , 2006, 22nd International Conference on Data Engineering Workshops (ICDEW'06).

[3]  Song Liu,et al.  Load shedding in stream databases: a control-based approach , 2006, VLDB.

[4]  Joseph M. Hellerstein,et al.  Eddies: continuously adaptive query processing , 2000, SIGMOD '00.

[5]  Rajeev Motwani,et al.  Operator scheduling in data stream systems , 2004, VLDB 2004.

[6]  Wolfgang Lehner,et al.  Real-time scheduling for data stream management systems , 2005, 17th Euromicro Conference on Real-Time Systems (ECRTS'05).

[7]  Michael Stonebraker,et al.  Monitoring Streams - A New Class of Data Management Applications , 2002, VLDB.

[8]  Hongya Wang,et al.  SOSA : A Safe Load Shedding Approach for Monitoring Data Streams in Real-Time , 2007 .

[9]  Rajeev Motwani,et al.  Load shedding for aggregation queries over data streams , 2004, Proceedings. 20th International Conference on Data Engineering.

[10]  Jennifer Widom,et al.  Models and issues in data stream systems , 2002, PODS.

[11]  Parameswaran Ramanathan,et al.  Overload Management in Real-Time Control Applications Using (m, k)-Firm Guarantee , 1999, IEEE Trans. Parallel Distributed Syst..

[12]  Alan Burns,et al.  Weakly Hard Real-Time Systems , 2001, IEEE Trans. Computers.

[13]  Philip S. Yu,et al.  A Load Shedding Framework and Optimizations for M-way Windowed Stream Joins , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[14]  Wolfgang Lehner,et al.  QStream: Deterministic Querying of Data Streams , 2004, VLDB.

[15]  Michael Stonebraker,et al.  Load Shedding in a Data Stream Manager , 2003, VLDB.

[16]  Parameswaran Ramanathan,et al.  A Dynamic Priority Assignement Technique for Streams with (m, k)-Firm Deadlines , 1995, IEEE Trans. Computers.

[17]  Stanley B. Zdonik,et al.  Window-aware load shedding for aggregation queries over data streams , 2006, VLDB.