Minimizing the response time of executing a join between fragmented relations in a distributed database system

The problem of describing minimal response time execution strategies in evaluating the join of several fragmented database relations, is considered. The consequential optimization problem assumes the convenient form of a min-max integer program. With further attention, various generalizations are realized that also include the performance objective of total execution cost. Tables of data logically conforming to the relational model of information are, at the physical level, frequently divided into numerous pieces. These fragments are found disseminated amongst the various sites of a distributed database system, with each one possibly replicated at any number of separate facilities. A submission demanding the amalgamation of many such relations is resolved by joining together their sets of component fragments in an appropriate manner, as defined by complicated patterns of overlapping attribute values. The final result is realized by then concatenating the products of these computations. This process is to be performed under the supervision of the database management system in such a way as to minimize the time taken, as perceived by the user who issued the request. These developments are based upon earlier investigations [1-5] that consider only the alternative optimization goal of minimal execution cost. With this in mind, several different different approaches may be taken to realize distinct hybrid models that give due regard to both measures of join query performance.

[1]  Ramez Elmasri,et al.  Fundamentals of Database Systems , 1989 .

[2]  Alfred V. Aho,et al.  The theory of joins in relational data bases , 1977, 18th Annual Symposium on Foundations of Computer Science (sfcs 1977).

[3]  Jeffrey D. Ullman,et al.  Principles of Database Systems , 1980 .

[4]  Yahiko Kambayashi,et al.  Processing Cyclic Queries , 1985, Query Processing in Database Systems.

[5]  D. J. Reid Evaluating multiple join queries in a distributed database system , 1995 .

[6]  M. J. Garber,et al.  Introduction to Linear Programming. , 1973 .

[7]  Maria E. Orlowska,et al.  The propagation of updates to relational tables in a distributed database system , 1996 .

[8]  Masatoshi Yoshikawa,et al.  Query processing for distributed databases using generalized semi-joins , 1982, SIGMOD '82.

[9]  Philip A. Bernstein,et al.  Using Semi-Joins to Solve Relational Queries , 1981, JACM.

[10]  Don Batory,et al.  Query Processing in Database Systems , 2011, Topics in Information Systems.

[11]  Alexander Schrijver,et al.  Theory of linear and integer programming , 1986, Wiley-Interscience series in discrete mathematics and optimization.

[12]  Stefano Ceri,et al.  Distributed Databases: Principles and Systems , 1984 .

[13]  Chihping Wang The complexity of processing tree queries in distributed databases , 1990, Proceedings of the Second IEEE Symposium on Parallel and Distributed Processing 1990.

[14]  Randall L. Hyde,et al.  An Analysis of Degenerate Sharing and False Coherence , 1996, J. Parallel Distributed Comput..

[15]  Philip S. Yu,et al.  Determining beneficial semijoins for a join sequence in distributed query processing , 1991, [1991] Proceedings. Seventh International Conference on Data Engineering.

[16]  S. Vajda,et al.  Integer Programming and Network Flows , 1970 .

[17]  Catriel Beeri,et al.  On the Desirability of Acyclic Database Schemes , 1983, JACM.

[18]  D. J. Reid Optimal distributed execution of join queries , 1994 .

[19]  M. W. Orlowski On Optimisation of Joins in Distributed Database System , 1992, Future Databases.

[20]  Hamdy A. Taha,et al.  Integer Programming: Theory, Applications, and Computations , 1975 .

[21]  Ronald Fagin,et al.  Degrees of acyclicity for hypergraphs and relational database schemes , 1983, JACM.

[22]  Serge Abiteboul,et al.  Equivalence and optimization of relational transactions , 1988, JACM.

[23]  Salah E. Elmaghraby,et al.  Activity networks: Project planning and control by network models , 1977 .

[24]  Chao-Chih Yang Relational databases , 1985 .

[25]  伊理 正夫,et al.  Network flow, transportation and scheduling : theory and algorithms , 1969 .

[26]  Averill M. Law,et al.  The art and theory of dynamic programming , 1977 .

[27]  D. J. Reid,et al.  Executing join queries in an uncertain distributed environment , 1995 .

[28]  Maria E. Orlowska,et al.  Allocating relations in a distributed database system , 1995 .

[29]  G. A. Garreau,et al.  Elementary Dynamic Programming , 1976 .

[30]  Nathan Goodman,et al.  Tree queries: a simple class of relational queries , 1982, TODS.

[31]  Dennis Shasha,et al.  Optimizing equijoin queries in distributed databases where relations are hash partitioned , 1991, TODS.

[32]  Sakti Pramanik,et al.  Optimizing Join Queries in Distributed Databases , 1988, IEEE Trans. Software Eng..

[33]  Jorma Rissanen,et al.  Independent components of relations , 1977, TODS.

[34]  D. J. Reid Incorporating processor costs in optimizing the distributed execution of join queries , 1994 .