An indexed non-equijoin algorithm based on sliding windows over data streams

Processing a join over unbounded input streams requires unbounded memory, since every tuple in one infinite stream must be compared with every tuple in the other. In fact, most join queries over unbounded input streams are restricted to finite memory due to sliding window constraints. So far, non-indexed and indexed stream equijoin algorithms based on sliding windows have been proposed in many literatures. However, none of them takes non-equijoin into consideration. In many cases, non-equijoin queries occur frequently. Hence, it is worth to discuss how to process non-equijoin queries effectively and efficiently. In this paper, we propose an indexed join algorithm for supporting non-equijoin queries. The experimental results show that our indexed non-equijoin techniques are more efficient than those without index.

[1]  Jeffrey F. Naughton,et al.  Evaluating window joins over unbounded streams , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[2]  Alon Y. Halevy,et al.  An adaptive query execution system for data integration , 1999, SIGMOD '99.

[3]  Michael Stonebraker,et al.  Optimization of parallel query execution plans in XPRS , 1991, [1991] Proceedings of the First International Conference on Parallel and Distributed Information Systems.

[4]  David Maier,et al.  Applying Punctuation Schemes to Queries Over Continuous Data Streams , 2003, IEEE Data Engineering Bulletin.

[5]  Divesh Srivastava,et al.  On computing correlated aggregates over continual data streams , 2001, SIGMOD '01.

[6]  Rajeev Motwani,et al.  Sampling from a moving window over streaming data , 2002, SODA '02.

[7]  A. Prasad Sistla,et al.  Modeling and querying moving objects , 1997, Proceedings 13th International Conference on Data Engineering.

[8]  R. K. Shyamasundar,et al.  Introduction to algorithms , 1996 .

[9]  A. N. Wilschut,et al.  Dataflow query execution in a parallel main-memory environment , 1991, Distributed and Parallel Databases.

[10]  Philippe Bonnet,et al.  Towards Sensor Database Systems , 2001, Mobile Data Management.

[11]  Lukasz Golab,et al.  On Indexing Sliding Windows over Online Data Streams , 2004, EDBT.