论文信息 - Evaluating the Vector Supercomputer SX-Aurora TSUBASA as a Co-Processor for In-Memory Database Systems

Evaluating the Vector Supercomputer SX-Aurora TSUBASA as a Co-Processor for In-Memory Database Systems

In-memory column-store database systems are state of the art for the efficient processing of analytical workloads. In these systems, data compression as well as vectorization play an important role. Currently, the vectorized processing is done using regular SIMD (Single Instruction Multiple Data) extensions of modern processors. For example, Intel’s latest SIMD extension supports 512-bit vector registers which allows the parallel processing of 8× 64-bit values. From a database system perspective, this vectorization technique is not only very interesting for compression and decompression to reduce the computational overhead, but also for all database operators like joins, scan, as well as groupings. In contrast to these SIMD extensions, NEC Corporation has recently introduced a novel pure vector engine (supercomputer) as a co-processor called SX-Aurora TSUBASA. This vector engine features a vector length of 16.384 bits with the world’s highest bandwidth of up to 1.2 TB/s, which perfectly fits to data-intensive applications like in-memory database systems. Therefore, we describe the unique architecture and properties of this novel vector engine in this paper. Moreover, we present selected in-memory column-store-specific evaluation results to show the benefits of this vector engine compared to regular SIMD extensions. Finally, we conclude the paper with an outlook on our ongoing research activities in this direction.

Wolfgang Lehner | Dirk Habich | Patrick Damme | Johannes Pietrzyk | Erich Focht

[1] Johannes Gehrke,et al. Query optimization in compressed database systems , 2001, SIGMOD '01.

[2] Michael Stonebraker,et al. C-Store: A Column-oriented DBMS , 2005, VLDB.

[3] Wolfgang Lehner,et al. From a Comprehensive Experimental Survey to a Cost-based Selection Strategy for Lightweight Integer Compression Algorithms , 2019, ACM Trans. Database Syst..

[4] Alfons Kemper,et al. Data Blocks: Hybrid OLTP and OLAP on Compressed Storage using both Vectorization and Compilation , 2016, SIGMOD Conference.

[5] Wolfgang Lehner,et al. Fighting the Duplicates in Hashing: Conflict Detection-aware Vectorization of Linear Probing , 2019, BTW.

[6] Wolfgang Lehner,et al. MorphStore - In-Memory Query Processing based on Morphing Compressed Intermediates LIVE , 2019, SIGMOD Conference.

[7] Wolfgang Lehner,et al. Compression-Aware In-Memory Query Processing: Vision, System Design and Beyond , 2016, ADMS/IMDM@VLDB.

[8] Wolfgang Lehner,et al. Lightweight Data Compression Algorithms: An Experimental Survey (Experiments and Analyses) , 2017, EDBT.

[9] Jae-Gil Lee,et al. Joins on Encoded and Partitioned Data , 2014, Proc. VLDB Endow..

[10] Daniel J. Abadi,et al. Integrating compression and execution in column-oriented database systems , 2006, SIGMOD Conference.

[11] Samuel Madden,et al. Voodoo - A Vector Algebra for Portable Database Performance on Modern Hardware , 2016, Proc. VLDB Endow..

[12] Hector Garcia-Molina,et al. Main Memory Database Systems: An Overview , 1992, IEEE Trans. Knowl. Data Eng..

[13] Andreas Kipf,et al. Make the most out of your SIMD investments: counter control flow divergence in compiled query pipelines , 2018, DaMoN.

[14] Carsten Binnig,et al. Dictionary-based order-preserving string compression for main memory column stores , 2009, SIGMOD Conference.

[15] Kenneth A. Ross,et al. Rethinking SIMD Vectorization for In-Memory Databases , 2015, SIGMOD Conference.

[16] Hiroaki Kobayashi,et al. Performance Evaluation of a Vector Supercomputer SX-Aurora TSUBASA , 2018, SC18: International Conference for High Performance Computing, Networking, Storage and Analysis.

[17] Wolfgang Lehner,et al. Column Scan Acceleration in Hybrid CPU-FPGA Systems , 2018, ADMS@VLDB.

[18] Wolfgang Lehner,et al. Conflict Detection-Based Run-Length Encoding - AVX-512 CD Instruction Set in Action , 2018, 2018 IEEE 34th International Conference on Data Engineering Workshops (ICDEW).

[19] Martin L. Kersten,et al. MonetDB: Two Decades of Research in Column-oriented Database Architectures , 2012, IEEE Data Eng. Bull..

[20] Martin L. Kersten,et al. Breaking the memory wall in MonetDB , 2008, CACM.

[21] Wolfgang Lehner,et al. First Investigations of the Vector Supercomputer SX-Aurora TSUBASA as a Co-Processor for Database Systems , 2019, BTW.

[22] Wolfgang Lehner,et al. Direct Transformation Techniques for Compressed Data: General Approach and Application Scenarios , 2015, ADBIS.

[23] Leonid Boytsov,et al. Decoding billions of integers per second through vectorization , 2012, Softw. Pract. Exp..

[24] Wolfgang Lehner,et al. Make Larger Vector Register Sizes New Challenges?: Lessons Learned from the Area of Vectorized Lightweight Compression Algorithms , 2018, DBTest@SIGMOD.

[25] Wolfgang Lehner,et al. Adaptive Work Placement for Query Processing on Heterogeneous Computing Resources , 2017, Proc. VLDB Endow..

[26] Daniel J. Abadi,et al. Column oriented Database Systems , 2009, Proc. VLDB Endow..

[27] Jignesh M. Patel,et al. BitWeaving: fast scans for main memory data processing , 2013, SIGMOD '13.

[28] Marcin Zukowski,et al. Vectorwise: A Vectorized Analytical DBMS , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[29] Marcin Zukowski,et al. Super-Scalar RAM-CPU Cache Compression , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[30] Patrick Damme. Query Processing Based on Compressed Intermediates , 2017, PhD@VLDB.

[31] Bingsheng He,et al. In-Cache Query Co-Processing on Coupled CPU-GPU Architectures , 2014, Proc. VLDB Endow..

[32] Wolfgang Lehner,et al. QPPT: Query Processing on Prefix Trees , 2013, CIDR.

[33] Feng Li,et al. Accelerating Relational Databases by Leveraging Remote Memory and RDMA , 2016, SIGMOD Conference.

[34] Setrag Khoshafian,et al. A decomposition storage model , 1985, SIGMOD Conference.

[35] Ismail Oukid,et al. Memory Management Techniques for Large-Scale Persistent-Main-Memory Systems , 2017, Proc. VLDB Endow..

[36] Guy M. Lohman,et al. Optimizing GPU-accelerated Group-By and Aggregation , 2015, ADMS@VLDB.