Icarus: Towards a multistore database system

The last years have seen a vast diversification on the database market. In contrast to the “one-size-fits-all” paradigm according to which systems have been designed in the past, today's database management systems (DBMS) are tuned for particular workloads. This has led to DBMSs optimized for high performance, high throughput read/write workloads in online transaction processing (OLTP) and systems optimized for complex analytical queries (OLAP). However, this approach reaches a limit when systems have to deal with mixed workloads that are neither pure OLAP nor pure OLTP workloads. In such cases, multistores are increasingly gaining popularity. Rather than supporting one single database paradigm and addressing one particular workload, multistores encompass several DBMSs that store data in different schemas and allow to route requests on a per-query level to the most appropriate system. In this paper, we introduce the multistore ICARUS. In our evaluation based on a workload that combines OLTP and OLAP elements, we show that ICARUS is able to speed-up queries up to a factor of three by properly routing queries to the best underlying DBMS.

[1]  Sami Bhiri,et al.  ODBAPI: A Unified REST API for Relational and NoSQL Data Stores , 2014, 2014 IEEE International Congress on Big Data.

[2]  Michael Stonebraker,et al.  The BigDAWG Polystore System , 2015, SGMD.

[3]  Dawoon Jung,et al.  HeteroDrive : Reshaping the Storage Access Pattern of OLTP Workload Using SSD , 2009 .

[4]  Peiquan Jin,et al.  Optimizing B+-tree for hybrid storage systems , 2014, Distributed and Parallel Databases.

[5]  Lin Ma,et al.  Self-Driving Database Management Systems , 2017, CIDR.

[6]  Scott Shenker,et al.  Spark: Cluster Computing with Working Sets , 2010, HotCloud.

[7]  J. T. Robinson,et al.  On optimistic methods for concurrency control , 1979, TODS.

[8]  Kenneth Salem,et al.  Hybrid Storage Management for Database Systems , 2013, Proc. VLDB Endow..

[9]  Ioana Manolescu,et al.  Flexible hybrid stores: Constraint-based rewriting to the rescue , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[10]  Wolfgang Lehner,et al.  SAP HANA database: data management for modern business applications , 2012, SGMD.

[11]  Ioana Manolescu,et al.  Invisible Glue: Scalable Self-Tunning Multi-Stores , 2015, CIDR.

[12]  Michael Stonebraker Technical perspectiveOne size fits all: an idea whose time has come and gone , 2008, CACM.

[13]  Dimitrios Tsoumakos,et al.  MuSQLE: Distributed SQL query execution over multiple engine environments , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[14]  Alexander Zeier,et al.  HYRISE - A Main Memory Hybrid Storage Engine , 2010, Proc. VLDB Endow..

[15]  Martin Grund,et al.  Hybrid graph and relational query processing in main memory , 2013, 2013 IEEE 29th International Conference on Data Engineering Workshops (ICDEW).

[16]  Le Gruenwald,et al.  Towards a hybrid row-column database for a cloud-based medical data management system , 2012, Cloud-I '12.

[17]  Martin Grund,et al.  A Demonstration of HYRISE - A Main Memory Hybrid Storage Engine , 2011, Proc. VLDB Endow..

[18]  Barzan Mozafari,et al.  SnappyData: A Unified Cluster for Streaming, Transactions and Interactice Analytics , 2017, CIDR.

[19]  Norbert Ritter,et al.  Towards a Scalable and Unified REST API for Cloud Data Stores , 2014, GI-Jahrestagung.

[20]  Philipp Rösch,et al.  A Storage Advisor for Hybrid-Store Databases , 2012, Proc. VLDB Endow..