Linear Road : benchmarking stream-based data management systems

This thesis describes the design, implementation, and execution of the Linear Road benchmark for stream-based data management systems. The motivation for benchmarking and the selection of the benchmark application are described. Test harness implementation is discussed, as are experiences using the benchmark to evaluate the Aurora engine. Effects of this work on the evolution of the Aurora engine are also discussed. Streams consist of continuous feeds of data from external data sources such as sensor networks or other monitoring systems. Stream data management systems execute continuous and historical queries over these streams, producing query results in real-time. This benchmark provides a means of comparing the functionality and performance of stream-based data management systems relative to each other and to relational systems. The benchmark presented is motivated by the increasing prevalence of “variable tolling” on highway systems throughout the world. Variable tolling uses dynamically determined factors such as congestion levels and accident proximity to calculate tolls. Linear Road specifies a variable tolling system for a fictional urban area, including such features as accident detection and alerts, traffic congestion measurements, toll calculations, and ad hoc requests for travel time predictions and account balances. This benchmark has already been adopted in the Aurora [ACC03] and STREAM [MWA03] streaming data management systems. Thesis Supervisor: Michael Stonebraker Title: Adjunct Professor

[1]  Michael Stonebraker,et al.  Load Shedding in a Data Stream Manager , 2003, VLDB.

[2]  Jennifer Widom,et al.  A Data Stream Management System for Network Traffic Management , 2001 .

[3]  John T. Kohl,et al.  The Zephyr Notification Service , 1988, USENIX Winter.

[4]  Qi Yang,et al.  A SIMULATION LABORATORY FOR EVALUATION OF DYNAMIC TRAFFIC MANAGEMENT SYSTEM , 1997 .

[5]  David J. DeWitt,et al.  NiagaraCQ: a scalable continuous query system for Internet databases , 2000, SIGMOD '00.

[6]  Frederick Reiss,et al.  TelegraphCQ: Continuous Dataflow Processing for an Uncertain World , 2003, CIDR.

[7]  Michael Stonebraker,et al.  A measure of transaction processing power , 1985 .

[8]  Samuel Madden,et al.  Fjording the stream: an architecture for queries over streaming sensor data , 2002, Proceedings 18th International Conference on Data Engineering.

[9]  Jennifer Widom,et al.  Query Processing, Resource Management, and Approximation ina Data Stream Management System , 2002 .

[10]  Haris N. Koutsopoulos,et al.  Simulation Laboratory for Evaluating Dynamic Traffic Management Systems , 1997 .

[11]  David J. DeWitt,et al.  The oo7 Benchmark , 1993, SIGMOD Conference.

[12]  Michael Stonebraker,et al.  The Sequoia 2000 Benchmark , 1993, SIGMOD Conference.

[13]  David Maier,et al.  Applying Punctuation Schemes to Queries Over Continuous Data Streams , 2003, IEEE Data Engineering Bulletin.

[14]  StonebrakerMichael,et al.  Aurora: a new model and architecture for data stream management , 2003, VLDB 2003.

[15]  David J. DeWitt,et al.  Benchmarking Database Systems A Systematic Approach , 1983, VLDB.

[16]  David J. DeWitt,et al.  The BUCKY object-relational benchmark , 1997, SIGMOD '97.

[17]  Jim Gray,et al.  The Benchmark Handbook for Database and Transaction Systems , 1993 .

[18]  Jennifer Widom,et al.  Continuous queries over data streams , 2001, SGMD.

[19]  Harilaos N. Koutsopoulos,et al.  A microscopic traffic simulator for evaluation of dynamic traffic management systems , 1996 .

[20]  Bruno Miguel Fernández Ruiz Architecture for the integration of dynamic traffic management systems , 2000 .