Incremental join view maintenance on distributed log-structured storage

Modern database systems desperate for the ability to support highly scalable transactions and efficient queries simultaneously for real-time applications. One solution is to utilize query optimization techniques on the on-line transaction processing (OLTP) systems. The materialized view is considered as a panacea to decrease query latency. However, it also involves the significant cost of maintenance which trades away transaction performance. In this paper, we examine the design space and conclude several design features for the implementation of a view on a distributed log-structured merge-tree (LSM-tree), which is a well-known structure for improving data write performance. As a result, we develop two incremental view maintenance (IVM) approaches on LSM-tree. One avoids join computation in view maintenance transactions. Another with two optimizations is proposed to decouple the view maintenance with the transaction process. Under the asynchronous update, we also provide consistency queries for views. Experiments on TPC-H benchmark show our methods achieve better performance than straightforward methods on different workloads.

[1]  Liana L. Fong,et al.  Diff-Index: Differentiated Index in Distributed Log-Structured Data Stores , 2014, EDBT.

[2]  Howard Gobioff,et al.  The Google file system , 2003, SOSP '03.

[3]  Rada Chirkova,et al.  Materialized Views , 2012, Found. Trends Databases.

[4]  Yannis Papakonstantinou,et al.  Utilizing IDs to Accelerate Incremental View Maintenance , 2015, SIGMOD Conference.

[5]  Vivek R. Narasayya,et al.  Automatic physical design tuning: workload as a sequence , 2006, SIGMOD Conference.

[6]  Jim Gray,et al.  A critique of ANSI SQL isolation levels , 1995, SIGMOD '95.

[7]  GrayJim,et al.  A critique of ANSI SQL isolation levels , 1995 .

[8]  Raghu Ramakrishnan,et al.  bLSM: a general purpose log structured merge tree , 2012, SIGMOD Conference.

[9]  Milos Nikolic,et al.  DBToaster: Higher-order Delta Processing for Dynamic, Frequently Fresh Views , 2012, Proc. VLDB Endow..

[10]  LarsonPer-Åke,et al.  Optimizing queries using materialized views , 2001 .

[11]  GhemawatSanjay,et al.  The Google file system , 2003 .

[12]  Milos Nikolic,et al.  How to Win a Hot Dog Eating Contest: Distributed Incremental View Maintenance with Batch Updates , 2016, SIGMOD Conference.

[13]  Parag Agrawal,et al.  Asynchronous view maintenance for VLSD databases , 2009, SIGMOD Conference.

[14]  Werner Vogels,et al.  Dynamo: amazon's highly available key-value store , 2007, SOSP.

[15]  Prashant Malik,et al.  Cassandra: a decentralized structured storage system , 2010, OPSR.

[16]  Jennifer Widom,et al.  Database System Implementation , 2000 .

[17]  Rada Chirkova,et al.  Materialized Views , 2012, Found. Trends Databases.

[18]  Sudipta Sengupta,et al.  The Bw-Tree: A B-tree for new hardware platforms , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[19]  Surajit Chaudhuri,et al.  Automated Selection of Materialized Views and Indexes in SQL Databases , 2000, VLDB.

[20]  Wei Cao,et al.  X-Engine: An Optimized Storage Engine for Large-scale E-commerce Transaction Processing , 2019, SIGMOD Conference.

[21]  Patrick E. O'Neil,et al.  The log-structured merge-tree (LSM-tree) , 1996, Acta Informatica.

[22]  Jonathan Goldstein,et al.  Optimizing queries using materialized views: a practical, scalable solution , 2001, SIGMOD '01.

[23]  Zhe Chen,et al.  AnalyticDB: Real-time OLAP Database System at Alibaba Cloud , 2019, Proc. VLDB Endow..

[24]  Ralph Deters,et al.  Terms analytics service for CouchDB: a document-based NoSQL , 2015, Int. J. Big Data Intell..

[25]  Alfons Kemper,et al.  HyPer: A hybrid OLTP&OLAP main memory database system based on virtual memory snapshots , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[26]  Wilson C. Hsieh,et al.  Bigtable: A Distributed Storage System for Structured Data , 2006, TOCS.

[27]  César A. Galindo-Legaria,et al.  Outerjoins as disjunctions , 1994, SIGMOD '94.

[28]  Jingren Zhou,et al.  Efficient Maintenance of Materialized Outer-Join Views , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[29]  Daniel J. Abadi,et al.  Column-stores vs. row-stores: how different are they really? , 2008, SIGMOD Conference.

[30]  Hamid Pirahesh,et al.  Answering complex SQL queries using automatic summary tables , 2000, SIGMOD 2000.

[31]  Mohamed Ziauddin,et al.  Materialized Views in Oracle , 1998, VLDB.

[32]  Aoying Zhou,et al.  Incremental Materialized View Maintenance on Distributed Log-Structured Merge-Tree , 2018, DASFAA.

[33]  Hicham G. Elmongui,et al.  Lazy Maintenance of Materialized Views , 2007, VLDB.