Layering a DBMS on a DHT-Based Storage Engine

This article proposes an architecture for integrating a relational database management system (DBMS) with a distributed hash table (DHT).   Systems developed based on this architecture provide functionalities of DHTs, such as scalability, decentralization and fault tolerance, combined with a high level query language provided by a DBMS. The main component of this architecture is the storage engine,  responsible for implementing the interface between an SQL query processor and a DHT. We have implemented a system based on the MySQL DBMS and the Bamboo DHT, and conducted an experimental study to investigate two issues. The first determines the impact of developing an indexing structure  for processing range queries. The second investigates two approaches for mapping relations to DHT's key-value pairs: vertical and horizontal partition of relations.

[1]  Wilson C. Hsieh,et al.  Bigtable: A Distributed Storage System for Structured Data , 2006, TOCS.

[2]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[3]  Brighten Godfrey,et al.  OpenDHT: a public DHT service and its uses , 2005, SIGCOMM '05.

[4]  Carlo Curino,et al.  Relational Cloud: The Case for a Database Service , 2010 .

[5]  Srinivasan Seshan,et al.  Mercury: supporting scalable multi-attribute range queries , 2004, SIGCOMM '04.

[6]  Sriram Ramabhadran,et al.  A case study in building layered DHT applications , 2005, SIGCOMM '05.

[7]  Marta Mattoso,et al.  Towards an inductive design of distributed object oriented databases , 1998, Proceedings. 3rd IFCIS International Conference on Cooperative Information Systems (Cat. No.98EX122).

[8]  Marta Mattoso Database Clusters , 2009, Encyclopedia of Database Systems.

[9]  Florian Schintke,et al.  Scalaris: reliable transactional p2p key/value store , 2008, ERLANG '08.

[10]  Shipeng Li,et al.  Distributed Segment Tree: Support of Range Query and Cover Query over DHT , 2006, IPTPS.

[11]  A. Rowstron,et al.  Scalable, decentralized object location and routing for large-scale peer-to-peer systems , 2001 .

[12]  Jun Rao,et al.  Using Paxos to Build a Scalable, Consistent, and Highly Available Datastore , 2011, Proc. VLDB Endow..

[13]  Tim Kraska,et al.  An evaluation of alternative architectures for transaction processing in the cloud , 2010, SIGMOD Conference.

[14]  Hans-Arno Jacobsen,et al.  PNUTS: Yahoo!'s hosted data serving platform , 2008, Proc. VLDB Endow..