Chimera: data sharing flexibility, shared nothing simplicity

The current database market is fairly evenly split between shared nothing and data sharing systems. While shared nothing systems are easier to build and scale, data sharing systems have advantages in load balancing. In this paper we explore adding data sharing functionality as an extension to a shared nothing database system. Our approach isolates the data sharing functionality from the rest of the system and relies on well-studied, robust techniques to provide the data sharing extension. This reduces the difficulty in providing data sharing functionality, yet provides much of the flexibility of a data sharing system. We present the design and implementation of Chimera -- a hybrid database system, targeted at load balancing for many workloads, and scale-out for read-mostly workloads. The results of our experiments demonstrate that we can achieve almost linear scalability and effective load balancing with less than 2% overhead during normal operation.

[1]  Brett D. Fleisch,et al.  The Chubby lock service for loosely-coupled distributed systems , 2006, OSDI '06.

[2]  H. Apte,et al.  Serverless Network File Systems , 2006 .

[3]  Peter M. Spiro How the Rdb � VMS Data Sharing System Became Fast , 1992 .

[4]  Frank B. Schmuck,et al.  GPFS: A Shared-Disk File System for Large Computing Clusters , 2002, FAST.

[5]  David R. Cheriton,et al.  Leases: an efficient fault-tolerant mechanism for distributed file cache consistency , 1989, SOSP '89.

[6]  Marc Najork,et al.  Boxwood: Abstractions as the Foundation for Storage Infrastructure , 2004, OSDI.

[7]  Jang-Ping Sheu,et al.  Design and implementation of a distributed file system , 1991, Softw. Pract. Exp..

[8]  Kevin Loney Oracle Database 11g The Complete Reference , 2004 .

[9]  Chandramohan A. Thekkath,et al.  Petal: distributed virtual disks , 1996, ASPLOS VII.

[10]  Mahadev Konar,et al.  ZooKeeper: Wait-free Coordination for Internet-scale Systems , 2010, USENIX ATC.

[11]  C. Mohan,et al.  Efficient Locking and Caching of Data in the Multisystem Shared Disks Transaction Environment , 1992, EDBT.

[12]  Christopher Hertel Implementing CIFS: The Common Internet File System , 2003 .

[13]  David B. Lomet Recovery for Shared Disk Systems Using Multiple Redo Logs , 2002 .

[14]  Hans-Arno Jacobsen,et al.  PNUTS: Yahoo!'s hosted data serving platform , 2008, Proc. VLDB Endow..

[15]  David B. Lomet Private locking and distributed cache management , 1994, Proceedings of 3rd International Conference on Parallel and Distributed Information Systems.

[16]  Murthy V. Devarakonda,et al.  Recovery in the Calypso file system , 1996, TOCS.

[17]  Wilson C. Hsieh,et al.  Bigtable: A Distributed Storage System for Structured Data , 2006, TOCS.

[18]  Michael Stonebraker,et al.  The Case for Shared Nothing , 1985, HPTS.

[19]  Chandramohan A. Thekkath,et al.  Frangipani: a scalable distributed file system , 1997, SOSP.

[20]  GhemawatSanjay,et al.  The Google file system , 2003 .

[21]  David A. Goldberg,et al.  Design and Implementation of the Sun Network Filesystem , 1985, USENIX Conference Proceedings.

[22]  Paul C. Zikopoulos,et al.  DB2: The Complete Reference , 2001 .

[23]  Leslie Lamport,et al.  The part-time parliament , 1998, TOCS.