Executing join queries in an uncertain distributed environment

The uncertainty inherent in the distributed environment poses new challenges to the efficient utilization of system resources in managing database transactions. In response to this realization, the execution of a join query in a system with probabilistic resource and cost parameters is contemplated, leading to the development of stochastic programming models. Information in the form of relational tables and scattered amongst the sites of a distributed database system is to be collated and presented to the appropriate user, in response to an issued request. Performing this task demands the usage of limited resources; the ultimate goal is the determination of an execution strategy incurring minimal cost to the system. The actual state of any network component at the moment of its exploitation cannot be exactly ascertained in advance. Any interrogation of a distant element must be communicated by the network, and this involves a delay, as perceived by the questioner, during which the state of the system may change. Indeed, the time at which a task assigned to any particular component cannot itself be precisely predicted, even if the future state of the component could be known definitively. By considering the uncertain nature of the distributed environment, the earlier model of join query evaluation presented in [1] can be modified in different ways to account for system parameters known only in a stochastic sense. This new level of subjectivity is a revelation of the many different attitudes that may be taken towards the chance of infeasibility in the solution, for the major issue in dealing with uncertainty is the choice of an appropriate measure of risk.

[1]  Alfred V. Aho,et al.  The theory of joins in relational data bases , 1977, 18th Annual Symposium on Foundations of Computer Science (sfcs 1977).

[2]  Hendrik W. Lenstra,et al.  Integer Programming with a Fixed Number of Variables , 1983, Math. Oper. Res..

[3]  W. K. Haneveld Duality in Stochastic Linear and Dynamic Programming , 1986 .

[4]  D. Reid Genetic algorithms in constrained optimization , 1996 .

[5]  Paul Olsen Multistage Stochastic Programming with Recourse: The Equivalent Deterministic Problem , 1976 .

[6]  Jeffrey D. Ullman,et al.  Principles of Database Systems , 1980 .

[7]  S. Vajda,et al.  Integer Programming and Network Flows , 1970 .

[8]  Hamdy A. Taha,et al.  Integer Programming: Theory, Applications, and Computations , 1975 .

[9]  H. Kunzi,et al.  Lectu re Notes in Economics and Mathematical Systems , 1975 .

[10]  Chihping Wang The complexity of processing tree queries in distributed databases , 1990, Proceedings of the Second IEEE Symposium on Parallel and Distributed Processing 1990.

[11]  D. J. Laughhunn Quadratic Binary Programming with Application to Capital-Budgeting Problems , 1970, Oper. Res..

[12]  E. L. Lawler,et al.  Branch-and-Bound Methods: A Survey , 1966, Oper. Res..

[13]  Ramez Elmasri,et al.  Fundamentals of Database Systems , 1989 .

[14]  Catriel Beeri,et al.  On the Desirability of Acyclic Database Schemes , 1983, JACM.

[15]  Chao-Chih Yang Relational databases , 1985 .

[16]  M. W. Orlowski On Optimisation of Joins in Distributed Database System , 1992, Future Databases.

[17]  Paul Olsen When Is a Multistage Stochastic Programming Problem Well-Defined? , 1976 .

[18]  Randall L. Hyde,et al.  An Analysis of Degenerate Sharing and False Coherence , 1996, J. Parallel Distributed Comput..

[19]  Ronald Fagin,et al.  Degrees of acyclicity for hypergraphs and relational database schemes , 1983, JACM.

[20]  Dennis Shasha,et al.  Optimizing equijoin queries in distributed databases where relations are hash partitioned , 1991, TODS.

[21]  D. J. Reid Optimal distributed execution of join queries , 1994 .

[22]  R. Rockafellar,et al.  Stochastic Convex Programming: Relatively Complete Recourse and Induced Feasibility , 1976 .

[23]  R. J. Dakin,et al.  A tree-search algorithm for mixed integer programming problems , 1965, Comput. J..

[24]  Sakti Pramanik,et al.  Optimizing Join Queries in Distributed Databases , 1988, IEEE Trans. Software Eng..

[25]  András Prékopa,et al.  Contributions to the theory of stochastic programming , 1973, Math. Program..

[26]  Jorma Rissanen,et al.  Independent components of relations , 1977, TODS.

[27]  D. J. Reid Incorporating processor costs in optimizing the distributed execution of join queries , 1994 .

[28]  D. J. Reid Evaluating multiple join queries in a distributed database system , 1995 .

[29]  Peter Kall,et al.  Stochastic Linear Programming , 1975 .

[30]  John E. Mitchell,et al.  An improved branch and bound algorithm for mixed integer nonlinear programs , 1994, Comput. Oper. Res..

[31]  Philip A. Bernstein,et al.  Using Semi-Joins to Solve Relational Queries , 1981, JACM.

[32]  Yuri Ermoliev,et al.  Numerical techniques for stochastic optimization , 1988 .