GAMMA - A High Performance Dataflow Database Machine

In this paper, the design, implementation techniques, and initial performance evaluation of Gamma are presented. Gamma is a new relational database machine that exploits dataflow query processing techniques. Ganma is a fully operational prototype consisting of 20 VAX 11/750 computers. The design of Gamma is based on an earlier multiprocessor database machine prototype (DIRECT) and several years of subsequent research on the problems raised by the DIRECT prototype. In addition to demonstrating that parallelism can really be made to work in a database machine context, the Gamma prototype shows how parallelism can be controlled with minimal control overhead through a combination of the use of algorithms based on hashing and the pipelining of data between processes. Except for 2 messages to initiate each operator of a query tree and 1 message when the operator terminates, the execution of a query is entirely self-scheduling. 52 refs., 12 figs.

[1]  Robert E. Wagner,et al.  Indexing Design Considerations , 1973, IBM Syst. J..

[2]  James R. Bell,et al.  Threaded code , 1973, CACM.

[3]  Kenneth C. Smith,et al.  RAP: an associative processor for data base management , 1975, AFIPS '75.

[4]  Robert B. K. Dewar,et al.  Indirect threaded code , 1975, Commun. ACM.

[5]  Michael Stonebraker,et al.  The design and implementation of INGRES , 1976, TODS.

[6]  Irving L. Traiger,et al.  System R: relational approach to database management , 1976, TODS.

[7]  Hans Christoph Zeidler,et al.  A Search Processor for Data Base Management Systems , 1978, VLDB.

[8]  David J. DeWitt DIRECT - a multiprocessor organization for supporting relational data base management systems , 1978, ISCA '78.

[9]  D. J. De Witt,et al.  Direct—A Multiprocessor Organization for Supporting Relational Database Management Systems , 1979 .

[10]  David J. DeWitt,et al.  Query execution in DIRECT , 1979, SIGMOD '79.

[11]  Jim Gray,et al.  The convoy phenomenon , 1979, OPSR.

[12]  David J. DeWitt,et al.  DIRECT - A Multiprocessor Organization for Supporting Relational Database Management Systems , 1979, IEEE Trans. Computers.

[13]  Patricia G. Selinger,et al.  Access path selection in a relational database management system , 1979, SIGMOD '79.

[14]  Michael Stonebraker,et al.  Muffin: A Distributed Database Machine , 1979, University of California at Berkeley.

[15]  Edward Babb,et al.  Implementing a relational database by means of specialzed hardware , 1979, TODS.

[16]  James Richard Goodman An investigation of multiprocessor structures and algorithms for data base management , 1980 .

[17]  Richard W. Watson Timer-Based Mechanisms in Reliable Transport Protocol Connection Management , 1981, Comput. Networks.

[18]  Stanley Y. W. Su,et al.  Parallel Algorithms and Their Implementation in MICRONET , 1982, VLDB.

[19]  R. R. Johnson,et al.  Database machine architecture for performing aggregations , 1982 .

[20]  David J. DeWitt,et al.  Applying Data Flow Techniques to Data Base Machines , 1982, Computer.

[21]  David J. DeWitt,et al.  Benchmarking Database Systems A Systematic Approach , 1983, VLDB.

[22]  David J. DeWitt,et al.  Database Machines: An Idea Whose Time Passed? A Critique of the Future of Database Machines , 1989, IWDM.

[23]  Patrick Valduriez,et al.  Design of a Multiprocessor Relational Database System , 1983, IFIP Congress.

[24]  Michael Stonebraker,et al.  Performance enhancements to a relational database system , 1983, TODS.

[25]  Haruo Yokota,et al.  A relational data base machine: First step to knowledge base machine , 1983, ISCA '83.

[26]  Patrick Valduriez,et al.  Join and Semijoin Algorithms for a Multiprocessor Database Machine , 1984, TODS.

[27]  Kjell Bratbergsengen,et al.  Hashing Methods and Relational Algebra Operations , 1984, VLDB.

[28]  Matthias Jarke,et al.  Query Optimization in Database Systems , 1984, CSUR.

[29]  G. Jason An Idea Whose Time Has Passed. , 1984 .

[30]  Kevin Wilkinson,et al.  Overview of the Jasmin database machine , 1984, SIGMOD '84.

[31]  Chaitanya K. Baru,et al.  Performance evaluation of the statistical aggregation by categorization in the SM3 system , 1984, SIGMOD '84.

[32]  Michael Stonebraker,et al.  Implementation techniques for main memory database systems , 1984, SIGMOD '84.

[33]  David J. DeWitt,et al.  Design and implementation of the wisconsin storage system , 1985, Softw. Pract. Exp..

[34]  J DeWittDavid,et al.  Recovery architectures for multiprocessor database machines , 1985 .

[35]  Michael Ubell The Intelligent Database Machine (IDM) , 1985, Query Processing in Database Systems.

[36]  Haruo Yokota,et al.  The Design and Implementation of Relational Database Machine Delta , 1985, IWDM.

[37]  James C. Browne,et al.  A Parallel Multi-Stage I/O Architecture with Self-Managing Disk Cache for Database Management Applications , 1985, IWDM.

[38]  David J. DeWitt,et al.  Recovery architectures for multiprocessor database machines , 1985, SIGMOD Conference.

[39]  David J. DeWitt,et al.  Multiprocessor Hash-Based Join Algorithms , 1985, VLDB.

[40]  Michelle Y. Kim Parallel Operation of Magnetic Disk Storage Devices: Synchronized Disk Interleaving , 1985, IWDM.

[41]  Hiroshi Sakai,et al.  A hardware pipeline algorithm for relational database operation , 1985, ISCA '85.

[42]  Hector Garcia-Molina,et al.  Disk striping , 1986, 1986 IEEE Second International Conference on Data Engineering.

[43]  David K. Hsiao,et al.  A multi-backend database system for performance gains, capacity growth and hardware upgrade , 1986, 1986 IEEE Second International Conference on Data Engineering.

[44]  David J. DeWitt,et al.  The Crystal Multicomputer: Design and Implementation Experience , 1987, IEEE Transactions on Software Engineering.

[45]  Miron Livny,et al.  Multi-disk management algorithms , 1987, SIGMETRICS '87.