On a high-performance vlsi solution to database problems

Abstract : This thesis explores the design and use of custom-made VLSI hardware in the area of database problems. Our effort differs from most previous ones in that we search for structures and algorithms, directly implementable on silicon, for the solution of computation-intensive database problems. The types of target database systems include the general database management systems and the design database systems. The thesis deals mainly with database systems of the relational model. One common view concerning special-purpose hardware usage is that it performs a specific task. The proposed device is not a hardware solution to a specific problem, but provides a number of useful data structures and basic operations. It can be used to improve the performance of any sequential algorithm which makes extensive use of such data structures and basic operations. The design is based on a few basic cells, interconnected together in the form of a complete binary tree. The proposed device can handle all the basic relational operations: select, join, project, union, and intersection. With a special-purpose device of limited size attached to a host, the overall performance may ultimately be dictated by the I/O between the two sites. The ideal special-purpose device design is one that achieves a balance between computation and I/O. We propose a model to study the I/O complexity for sorting n numbers with any special-purpose hardware device of size s, and show a lower bound result of omega (n log n/log s). We present an optimal design achieving this bound. An important finding is that for practical ranges on the quantity of data to be sorted, systolic sorting devices of small sizes can beat fast sequential sorting algorithms.

[1]  Hans Christoph Zeidler,et al.  A Search Processor for Data Base Management Systems , 1978, VLDB.

[2]  Harold S. Stone,et al.  Parallel Processing with the Perfect Shuffle , 1971, IEEE Transactions on Computers.

[3]  G. Jack Lipovski,et al.  The Architectural Features and Implementation Techniques of the Multicell CASSM , 1979, IEEE Transactions on Computers.

[4]  Jayanta Banerjee,et al.  The Clustering and Security Mechanisms of a Database Computer (DBC). , 1979 .

[5]  Arthur M. Lesk,et al.  Detection of three-dimensional patterns of atoms in chemical structures , 1979, CACM.

[6]  George Coulouris,et al.  Towards content-addressing in data bases , 1972, Comput. J..

[7]  Chyuan Shiun Lin,et al.  The design of a rotating associative memory for relational database applications , 1976, TODS.

[8]  GILLES M. E. LAFUE An approach to automatic maintenance of semantic integrity in large design data bases , 1979, 1979 International Workshop on Managing Requirements Knowledge (MARK).

[9]  Lee A. Hollaar Specialized merge processor networks for combining sorted lists , 1978, TODS.

[10]  E. F. Codd,et al.  Further Normalization of the Data Base Relational Model , 1971, Research Report / RJ / IBM / San Jose, California.

[11]  Irving L. Traiger,et al.  A history and evaluation of System R , 1981, CACM.

[12]  E. F. Codd,et al.  Relational Completeness of Data Base Sublanguages , 1972, Research Report / RJ / IBM / San Jose, California.

[13]  George Stiny,et al.  Pictorial and Formal Aspects of Shape and Shape Grammars , 1975 .

[14]  Carlo H. Séquin,et al.  Communication In X-TREE, A Modular Multiprocessor System , 1978, ACM Annual Conference.

[15]  M. Rem,et al.  Cost and performance of VLSI computing structures , 1979, IEEE Transactions on Electron Devices.

[16]  Richard J. Lipton,et al.  Multidimensional Searching Problems , 1976, SIAM J. Comput..

[17]  Douglas Comer,et al.  Ubiquitous B-Tree , 1979, CSUR.

[18]  Ronald Fagin,et al.  Multivalued dependencies and a new normal form for relational databases , 1977, TODS.

[19]  Charles E. Leiserson,et al.  Area-Efficient VLSI Computation , 1983 .

[20]  Lynn Conway,et al.  Introduction to VLSI systems , 1978 .

[21]  Kenneth C. Smith,et al.  RAP.2—An Associative Processor for Databases and Its Applications , 1979, IEEE Transactions on Computers.

[22]  C. S. Lin Sorting with associative secondary storage devices , 1899 .

[23]  H. T. Kung Why systolic architectures? , 1982, Computer.

[24]  Ronald Fagin,et al.  On an authorization mechanism , 1978, TODS.

[25]  E. F. CODD,et al.  A relational model of data for large shared data banks , 1970, CACM.

[26]  Charles E. Leiserson Systolic Priority Queues , 1979 .

[27]  Jayanta Banerjee,et al.  DBC—A Database Computer for Very Large Databases , 1979, IEEE Transactions on Computers.

[28]  Glen G. Langdon,et al.  A note on associative processors for data management , 1978, TODS.

[29]  George Stiny Two Exercises in Formal Composition , 1976 .

[30]  G. Stiny Ice-Ray: A Note on the Generation of Chinese Lattice Designs , 1977 .

[31]  Henry S. Baird Fast algorithms for LSI artwork analysis , 1977, DAC '77.

[32]  D. T. Lee,et al.  Location of a point in a planar subdivision and its applications , 1976, STOC '76.

[33]  Eduardo B. Fernández,et al.  An Architectural Extension for a Large Database System Incorporating a Processor for Disk Search , 1977, VLDB.

[34]  David J. DeWitt,et al.  DIRECT - A Multiprocessor Organization for Supporting Relational Database Management Systems , 1979, IEEE Trans. Computers.

[35]  Raymond A. Lorie,et al.  XRM - An Extended (N-ary) Relational Memory , 1974, Research Report / G / IBM / Cambridge Scientific Center.