Design considerations for data-flow database machines

This paper presents a discussion of the application of data-flow machine concepts to the design and implementation of database machines which execute relational algebra queries. We analyze the performance of multiprocessor nested-loops and sort-merge join algorithms and show that the nested-loops algorithm is generally superior. Three levels of operand granularity for data-flow database machines are introduced and compared using the nested-loops join algorithm. We demonstrate, that relation-level granularity is too coarse and that tuple-level granularity is too fine. The third level of granularity, a page of a relation, is shown to be the best choice from both hardware and software viewpoints. Finally, a preliminary design for a data-flow database machine which utilizes page-level granularity and supports distributed control of instruction execution is presented.

[1]  James E. Rumbaugh,et al.  A Data Flow Multiprocessor , 1977, IEEE Transactions on Computers.

[2]  Jaishankar Menon,et al.  The Post Processing Functions of a Database Computer. , 1979 .

[3]  David J. DeWitt,et al.  Processor allocation strategies for multiprocessor database machines , 1981, TODS.

[4]  D. J. De Witt,et al.  Direct—A Multiprocessor Organization for Supporting Relational Database Management Systems , 1979 .

[5]  Arvind,et al.  A Computer Capable of Exchanging Processors for Time , 1977, IFIP Congress.

[6]  M. W. Blasgen,et al.  Storage and Access in Relational Data Bases , 1977, IBM Syst. J..

[7]  John Miles Smith,et al.  Optimizing the performance of a relational algebra database interface , 1975, CACM.

[8]  S. B. Yao,et al.  Optimization of query evaluation algorithms , 1979, ACM Trans. Database Syst..

[9]  David A. Patterson,et al.  X-Tree: A tree structured multi-processor computer architecture , 1978, ISCA '78.

[10]  David J. DeWitt,et al.  DIRECT - A Multiprocessor Organization for Supporting Relational Database Management Systems , 1979, IEEE Trans. Computers.

[11]  Franco P. Preparata,et al.  New Parallel-Sorting Schemes , 1978, IEEE Transactions on Computers.

[12]  David J. DeWitt,et al.  Query execution in DIRECT , 1979, SIGMOD '79.

[13]  A. L. Davis,et al.  The architecture and system method of DDM1: A recursively structured Data Driven Machine , 1978, ISCA '78.

[14]  David J. DeWitt DIRECT - a multiprocessor organization for supporting relational data base management systems , 1978, ISCA '78.

[15]  W. Donald Frazer Potential Technology Implications for Computers and Telecommunications in the 1980s , 1979, IBM Syst. J..

[16]  Ming T. Liu,et al.  A loop network for simultaneous transmission of variable-length messages , 1974, ISCA '75.