Query optimization to meet performance targets for wide area applications

Recent technology advances have enabled mediated query processing with Internet accessible WebSources. A characteristic of WebSources is that their access costs exhibit transient behavior These costs depend on the network and server workloads, which are often affected by, the time of day,, day, etc. Given transient behavior, an appropriate performance target (PT) for a noisy, environment will correspond to "at least X percentage of queries will have a latency of less than T units of time". In this paper we propose an optimizer strategy that is sensitive to the objective of meeting such performance targets (PT). For each query plan, a PT sensitive optimizer uses both the expected value of the cost distribution of the plan, as well as the expected delay, of the plan. We validate our strategy using a simulation based study of the optimizers behavior. We also experimentally validate the optimizer using traces of access costs for real WebSources.

[1]  Laurent Amsaleg,et al.  Dynamic Query Operator Scheduling for Wide-Area Remote Access , 1998, Distributed and Parallel Databases.

[2]  Scott Shenker,et al.  Fundamental Design Issues for the Future Internet (Invited Paper) , 1995, IEEE J. Sel. Areas Commun..

[3]  Laura M. Haas,et al.  Capabilities-based query rewriting in mediator systems , 1996 .

[4]  Laurent Amsaleg,et al.  Cost-based query scrambling for initial delays , 1998, SIGMOD '98.

[5]  Asuman Dogac,et al.  Dynamic query optimization on a distributed object management platform , 1996, CIKM '96.

[6]  Luc Bouganim,et al.  A Dynamic Query Processing Architecture for Data Integration Systems , 2000, IEEE Data Eng. Bull..

[7]  Michael J. Franklin,et al.  XJoin: Getting Fast Answers From Slow and Bursty Networks , 1999 .

[8]  Laura M. Haas,et al.  Cost Models DO Matter: Providing Cost Information for Diverse Data Sources in a Federated System , 1999, VLDB.

[9]  Peter J. Haas,et al.  Ripple joins for online aggregation , 1999, SIGMOD '99.

[10]  Vladimir Zadorozhny,et al.  Learning response time for WebSources using query feedback and application in query optimization , 2000, The VLDB Journal.

[11]  Ioana Manolescu,et al.  Query optimization in the presence of limited access patterns , 1999, SIGMOD '99.

[12]  Qiang Zhu,et al.  Developing cost models with qualitative variables for dynamic multidatabase environments , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[13]  Vladimir Zadorozhny,et al.  Efficient evaluation of queries in a mediator for WebSources , 2002, SIGMOD '02.

[14]  Gio Wiederhold,et al.  Mediators in the architecture of future information systems , 1992, Computer.

[15]  Joseph M. Hellerstein,et al.  Eddies: continuously adaptive query processing , 2000, SIGMOD '00.

[16]  Joseph Y. Halpern,et al.  Least expected cost query optimization: an exercise in utility , 1999, PODS.

[17]  Donald F. Towsley,et al.  Detecting shared congestion of flows via end-to-end measurement , 2000, SIGMETRICS '00.

[18]  Hubert Naacke,et al.  Leveraging mediator cost models with heterogeneous data sources , 1998, Proceedings 14th International Conference on Data Engineering.

[19]  Panos Vassiliadis,et al.  ARKTOS: A Tool For Data Cleaning and Transformation in Data Warehouse Environments , 2000, IEEE Data Eng. Bull..

[20]  Vladimir Zadorozhny,et al.  Learning response times for WebSources: a comparison of a web prediction tool (WebPT) and a neural network , 1999, Proceedings Fourth IFCIS International Conference on Cooperative Information Systems. CoopIS 99 (Cat. No.PR00384).

[21]  Timos K. Sellis,et al.  Parametric query optimization , 1992, The VLDB Journal.

[22]  KabraNavin,et al.  Efficient mid-query re-optimization of sub-optimal query execution plans , 1998 .

[23]  Miron Livny,et al.  The Case for Enhanced Abstract Data Types , 1997, VLDB.

[24]  Vladimir Zadorozhny,et al.  Validating a Cost Model for Wide Area Applications , 2000 .

[25]  Alon Y. Halevy,et al.  An adaptive query execution system for data integration , 1999, SIGMOD '99.