Quality Management in Database Systems-A Thesis Proposal

In this proposal, we argue that quality parameters such as delay and uncertainty are more important performance metrics than processing time, which is the only concern in query optimization in traditional DBMS. We explore specific problems on how to maintain quality in multimedia and stream databases, with a focus on the latter. Data stream processing have attracted a great deal of attention from the database community. The continuous feature of both data and queries in data stream management systems (DSMSs) place great demand on system resources. However, queries can be processed with different levels of quality such as timeliness, reliability, and uncertainty. In this proposal, we discuss the problem of how to maintain quality in query processing in DSMSs. We focus on two types of quality metrics: general Quality-of-Service parameters that apply to all types of queries and uncertainty parameter for probabilistic queries. A widely-used approach to maintain QoS (especially tuple delays) in DSMS query processing is load shedding, i.e., dropping data. Current load shedding solutions utilize simple, intuitive ideas in determining the time and amount of load to be discarded and do not work well in the presence of system/environmental disturbances. We propose a solution based on feedback control theory with significantly improved long-term performance. For probabilistic queries, an important optimization is to have the least uncertainty in query results under resource contraints. We propose to approach this problem by exploiting the temporal dependence in stream data.

[1]  Klara Nahrstedt,et al.  Resource Management in Networked Multimedia Systems , 1995, Computer.

[2]  Sunil Prabhakar,et al.  Evaluating probabilistic queries over imprecise data , 2003, SIGMOD '03.

[3]  Christos Faloutsos,et al.  Data mining meets performance evaluation: fast algorithms for modeling bursty traffic , 2002, Proceedings 18th International Conference on Data Engineering.

[4]  Dennis Shasha,et al.  StatStream: Statistical Monitoring of Thousands of Data Streams in Real Time , 2002, VLDB.

[5]  Rajeev Rastogi,et al.  Processing complex aggregate queries over data streams , 2002, SIGMOD '02.

[6]  Sunil Prabhakar,et al.  An analytical study of peer-to-peer media streaming systems , 2005, TOMCCAP.

[7]  David K. Y. Yau,et al.  Operating System Techniques for Distributed Multimedia , 1996 .

[8]  Klara Nahrstedt,et al.  QoS-aware middleware for ubiquitous and heterogeneous environments , 2001, IEEE Commun. Mag..

[9]  Wei Hong,et al.  Model-Driven Data Acquisition in Sensor Networks , 2004, VLDB.

[10]  Samuel Madden,et al.  Using Probabilistic Models for Data Management in Acquisitional Environments , 2005, CIDR.

[11]  Jeffrey F. Naughton,et al.  Rate-based query optimization for streaming information sources , 2002, SIGMOD '02.

[12]  Rajeev Motwani,et al.  Chain: operator scheduling for memory minimization in data stream systems , 2003, SIGMOD '03.

[13]  Jianping Fan,et al.  VDBMS: A testbed facility for research in video database benchmarking , 2004, Multimedia Systems.

[14]  Michael Stonebraker,et al.  Load Shedding in a Data Stream Manager , 2003, VLDB.

[15]  Rajeev Motwani,et al.  Load shedding for aggregation queries over data streams , 2004, Proceedings. 20th International Conference on Data Engineering.

[16]  Sang Hyuk Son,et al.  Feedback Control Real-Time Scheduling: Framework, Modeling, and Algorithms* , 2001, Real-Time Systems.

[17]  Sunil Prabhakar,et al.  Quality-Aware Replication of Multimedia Data , 2005, DEXA.

[18]  Michael Stonebraker,et al.  Monitoring Streams - A New Class of Data Management Applications , 2002, VLDB.

[19]  Edward Y. Chang,et al.  Adaptive stream resource management using Kalman Filters , 2004, SIGMOD '04.

[20]  Qiang Chen,et al.  Aurora : a new model and architecture for data stream management ) , 2006 .

[21]  Yi-Cheng Tu,et al.  Change Point Estimation of Bar Code Signals , 2005, CSC.

[22]  Michael Stonebraker,et al.  Operator Scheduling in a Data Stream Manager , 2003, VLDB.

[23]  Sunil Prabhakar,et al.  Indexing and Querying Constantly Evolving Data Using Time Series Analysis , 2005, DASFAA.

[24]  Shanzhong Zhu,et al.  Stochastic Consistency, and Scalable Pull-Based Caching for Erratic Data Sources , 2004, VLDB.

[25]  Frederick Reiss,et al.  Data Triage: an adaptive architecture for load shedding in TelegraphCQ , 2005, 21st International Conference on Data Engineering (ICDE'05).

[26]  Sunil Prabhakar,et al.  Adaptive Stream Filters for Entity-based Queries with Non-Value Tolerance , 2005, VLDB.

[27]  Leming Qu,et al.  Change Point Estimation of Bilevel Functions , 2005 .

[28]  Jennifer Widom,et al.  Adaptive precision setting for cached approximate values , 2001, SIGMOD '01.

[29]  Radu Sion,et al.  QuaSAQ: An Approach to Enabling End-to-End QoS for Multimedia Databases , 2004, EDBT.

[30]  Klara Nahrstedt,et al.  A control-based middleware framework for quality-of-service adaptations , 1999, IEEE J. Sel. Areas Commun..

[31]  Song Liu,et al.  Control-Based Quality Adaptation in Data Stream Management Systems , 2005, DEXA.

[32]  Sang Hyuk Son,et al.  Managing deadline miss ratio and sensor data freshness in real-time databases , 2004, IEEE Transactions on Knowledge and Data Engineering.

[33]  Song Liu,et al.  Load shedding in stream databases: a control-based approach , 2006, VLDB.

[34]  Jianping Fan,et al.  A distributed database server for continuous media , 2002, Proceedings 18th International Conference on Data Engineering.

[35]  Jennifer Widom,et al.  STREAM: The Stanford Stream Data Manager , 2003, IEEE Data Eng. Bull..

[36]  Sunil Prabhakar,et al.  Performance analysis of a hybrid media streaming system , 2003, IS&T/SPIE Electronic Imaging.

[37]  Jennifer Widom,et al.  Adaptive filters for continuous queries over distributed data streams , 2003, SIGMOD '03.

[38]  Walid G. Aref,et al.  Scheduling for shared window joins over data streams , 2003, VLDB.

[39]  Jennifer Widom,et al.  Query Processing, Resource Management, and Approximation ina Data Stream Management System , 2002 .

[40]  Frederick Reiss,et al.  TelegraphCQ: Continuous Dataflow Processing for an Uncertain World , 2003, CIDR.

[41]  Ying Xing,et al.  The Design of the Borealis Stream Processing Engine , 2005, CIDR.