...like Commanding an Anthill: A Case for Micro-Distributed (Data) Management Systems

Computer system architecture has changed: an assembly of autonomous components has replaced the omnipotent CPU and its legion of dumb devices. Database Management System (DBMS) architecture, however, does not yet reflect this change: it is still dominated by a centralized kernel that limits the autonomy of the devices and, thus, their ability to exploit their increased â smartnessâ . Distributed data management research can serve as an inspiration for an architecture that addresses this problem. However, the respective algorithms were never designed with CPU efficiency in mind implementing principles like dynamic programming and recursion. More than two decades ago, the transition to memory resident databases spawned a plethora of research on CPU-efficient query processors. We predict that hardware heterogeneity will trigger a similar line of research on CPU-efficient distributed algorithms and architectures. In this paper, we examine benefits and challenges that come with such a micro-distributed database management system. We also discuss a number of approaches that we consider steps towards a micro-distributed system

[1]  Joel H. Saltz,et al.  Active disks: programming model, algorithms and evaluation , 1998, ASPLOS VIII.

[2]  Gustavo Alonso,et al.  Ibex - An Intelligent Storage Engine with Support for Advanced SQL Off-loading , 2014, Proc. VLDB Endow..

[3]  Bingsheng He,et al.  Relational query coprocessing on graphics processors , 2009, TODS.

[4]  Anastasia Ailamaki,et al.  StagedDB: Designing Database Servers for Modern Hardware , 2005, IEEE Data Eng. Bull..

[5]  Douglas C. Schmidt,et al.  Reactor: an object behavioral pattern for concurrent event demultiplexing and event handler dispatching , 1995 .

[6]  James O. Coplien,et al.  Pattern languages of program design , 1995 .

[7]  Sanjeev Saxena,et al.  On Parallel Prefix Computation , 1994, Parallel Process. Lett..

[8]  Michael Cornwell,et al.  Anatomy of a solid-state drive , 2012, CACM.

[9]  Hector Garcia-Molina,et al.  Main Memory Database Systems: An Overview , 1992, IEEE Trans. Knowl. Data Eng..

[10]  David Sidler Column Storage for FPGA-accelerated Data Analytics , 2013 .

[11]  Douglas C. Schmidt,et al.  An Object Behavioral Pattern for Demultiplexing and Dispatching Handlers for Asynchronous Events , 1998 .

[12]  Adrian Schüpbach,et al.  The multikernel: a new OS architecture for scalable multicore systems , 2009, SOSP '09.

[13]  Gustavo Alonso,et al.  Database engines on multicores, why parallelize when you can distribute? , 2011, EuroSys '11.

[14]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[15]  Martin L. Kersten,et al.  Breaking the memory wall in MonetDB , 2008, CACM.