AMOS-SDDS: A Scalable Distributed Data Manager for Windows Multicomputers

Known parallel DBMS offer at present only static partitioning schemes. Adding a storage node is a cumbersome operation that typically requires the manual data redistribution. We present an architecture termed AMOS-SDDS for a share-nothing multicomputer. We have coupled a high-performance main-memory DBMS AMOS-II and a manager of Scalable Distributed Data Structures (SDDS) into a scalable distributed system. SDDS provides the scalable data partitioning in distributed RAM, supporting parallel scans with function shipping. AMOS-SDDS couples both systems using the AMOS-II foreign function interface. Its scalability abolishes the cumbersome storage limits of a single site RAM DBMS technology. Its distributed RAM query processing and scalable data partitioning is an improvement over the current parallel DBMSs technology. We validate AMOS-SDDS architecture by experiments with distributed nested loop join queries over a file scaling up to 300.000 tuples. It includes performance study of speed-up and scale-up characteristics. The results encourage the use of SDDS for highperformance database systems.

[1]  Witold Litwin,et al.  LH* - Linear Hashing for Distributed Files , 1993, SIGMOD Conference.

[2]  Patrick Valduriez,et al.  Principles of distributed database systems (2nd ed.) , 1999 .

[3]  David Thomas,et al.  The Art in Computer Programming , 2001 .

[4]  Tore Risch,et al.  Scalable Distributed Data Structures for High-Performance Databases , 2000, WDAS.

[5]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[6]  Witold Litwin,et al.  Performance Measurements of RP* : A Scalable Distributed Data Structure For Range Partitioning , 2000 .

[7]  Tore Risch Amos II External Interfaces , 2000 .

[8]  Tore Risch,et al.  Integrating Heterogenous Overlapping Databases through Object-Oriented Transformations , 1999, VLDB.

[9]  Terence Critchlow,et al.  Practical lessons in supporting large-scale computational science , 1999, SGMD.

[10]  Minesh B. Amin,et al.  An Adaptive, Load Balancing Parallel Join Algorithm , 1994, COMAD.

[11]  Clement T. Yu,et al.  Priniples of Database Query Processing for Advanced Applications , 1997 .

[12]  J. Farradane,et al.  Information Science , 1971, Nature.

[13]  Paul N. Weinberg,et al.  SQL, the complete reference , 1999 .

[14]  Witold Litwin,et al.  RP*: A Family of Order Preserving Scalable Distributed Data Structures , 1994, VLDB.

[15]  Andrew S. Grimshaw,et al.  The Legion vision of a worldwide virtual computer , 1997, Commun. ACM.

[16]  Jim Gray Super Servers: Commodity Computer Clusters Pose a Software Challenge , 1995, BTW.

[17]  Tore Risch,et al.  Design Issues for Scalable Availability LH* Schemes with Recor Grouping , 1999, WDAS.