论文信息 - Modular data storage with Anvil

Modular data storage with Anvil

Databases have achieved orders-of-magnitude performance improvements by changing the layout of stored data -- for instance, by arranging data in columns or compressing it before storage. These improvements have been implemented in monolithic new engines, however, making it difficult to experiment with feature combinations or extensions. We present Anvil, a modular and extensible toolkit for building database back ends. Anvil's storage modules, called dTables, have much finer granularity than prior work. For example, some dTables specialize in writing data, while others provide optimized read-only formats. This specialization makes both kinds of dTable simple to write and understand. Unifying dTables implement more comprehensive functionality by layering over other dTables -- for instance, building a read/write store from read-only tables and a writable journal, or building a general-purpose store from optimized special-purpose stores. The dTable design leads to a flexible system powerful enough to implement many database storage layouts. Our prototype implementation of Anvil performs up to 5.5 times faster than an existing B-tree-based database back end on conventional workloads, and can easily be customized for further gains on specific data and workloads.

Eddie Kohler | Mike Mammarella | Shant Hovsepian

[1] Michael Stonebraker,et al. C-Store: A Column-oriented DBMS , 2005, VLDB.

[2] Rudolf Bayer,et al. Organization and maintenance of large ordered indexes , 1972, Acta Informatica.

[3] Patrick E. O'Neil,et al. The log-structured merge-tree (LSM-tree) , 1996, Acta Informatica.

[4] Erez Zadok,et al. Versatility and Unix semantics in namespace unification , 2006, TOS.

[5] RosenblumMendel,et al. The design and implementation of a log-structured file system , 1991 .

[6] Jason Flinn,et al. Rethink the sync , 2006, OSDI '06.

[7] Lei Zhang,et al. Generalized file system dependencies , 2007, SOSP.

[8] Michael Stonebraker,et al. The End of an Architectural Era (It's Time for a Complete Rewrite) , 2007, VLDB.

[9] Mendel Rosenblum,et al. The design and implementation of a log-structured file system , 1991, SOSP '91.

[10] Peter Boncz,et al. UvA-DARE ( Digital Academic Repository ) Monet ; a next-Generation DBMS Kernel For Query-Intensive Applications , 2007 .

[11] Don S. Batory,et al. GENESIS: An Extensible Database Management System , 1988, IEEE Trans. Software Eng..