Optimizations enabled by a relational data model view to querying data streams

We postulate that the popularity and efficiency of SQL for querying relational databases makes the language a viable solution to retrieving data from data streams. In response, we have developed a system, dQUOB, that uses SQL queries to extract data from streaming data in real time. The high performance needs of applications such as scientific visualization motivates our search for optimizations to improve query evaluation efficiency. The purpose of this paper is to discuss the unique optimizations we have realized by a database point of view to streaming data and to show that the enhanced conceptual model of viewing data streams as relations has reasonable overhead.

[1]  John A. Reed,et al.  Development of an intelligent monitoring and control system for a heterogeneous numerical propulsion system simulation , 1995, Proceedings of Simulation Symposium.

[2]  Ling Liu,et al.  Query routing in large-scale digital library systems , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[3]  Karsten Schwan,et al.  dQCOB: managing large data flows using dynamic embedded queries , 2000, Proceedings the Ninth International Symposium on High-Performance Distributed Computing.

[4]  Clement T. Yu,et al.  Priniples of Database Query Processing for Advanced Applications , 1997 .

[5]  Karsten Schwan,et al.  Run-time detection in parallel and distributed systems: application to safety-critical systems , 1999, Proceedings. 19th IEEE International Conference on Distributed Computing Systems (Cat. No.99CB37003).

[6]  Christian S. Jensen,et al.  Transitioning Temporal Support in TSQL2 to SQL3 , 1997, Temporal Databases, Dagstuhl.

[7]  Gregg Rothermel,et al.  Performing data flow testing on classes , 1994, SIGSOFT '94.

[8]  Jeffrey S. Vetter,et al.  Autopilot: adaptive control of distributed applications , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).

[9]  Karsten Schwan,et al.  ACDS: Adapting computational data streams for high performance , 2000, Proceedings 14th International Parallel and Distributed Processing Symposium. IPDPS 2000.

[10]  Richard T. Snodgrass,et al.  A relational approach to monitoring complex systems , 1988, TOCS.

[11]  Klara Nahrstedt,et al.  2K: a distributed operating system for dynamic heterogeneous environments , 2000, Proceedings the Ninth International Symposium on High-Performance Distributed Computing.

[12]  Calton Pu,et al.  Operational information systems: an example from the airline industry , 2000, WIESS'00.

[13]  Karsten Schwan,et al.  Application-Dependent Dynamic Monitoring of Distributed and Parallel Systems , 1993, IEEE Trans. Parallel Distributed Syst..

[14]  Karsten Schwan,et al.  Event services for high performance computing , 2000, Proceedings the Ninth International Symposium on High-Performance Distributed Computing.

[15]  Calton Pu,et al.  Differential evaluation of continual queries , 1996, Proceedings of 16th International Conference on Distributed Computing Systems.

[16]  Joel H. Saltz,et al.  Object-Relational Queries into Multidimensional Databases with the Active Data Repository , 1999, Parallel Process. Lett..

[17]  Dennis Gannon,et al.  A component based services architecture for building distributed applications , 2000, Proceedings the Ninth International Symposium on High-Performance Distributed Computing.

[18]  David J. DeWitt,et al.  Equi-depth multidimensional histograms , 1988, SIGMOD '88.

[19]  Laks V. S. Lakshmanan,et al.  Revisiting the Hierarchical Data Model , 1999 .