Multi-core vs. I/O Wall: The Approaches to Conquer and Cooperate

Multi-core comes to be the mainstream of processor techniques. The data-intensive OLAP relies on inexpensive disks as massive data storage device, so the enhanced processing power oppose to I/O bottleneck in big data OLAP applications becomes more critical because the latency gap between I/O and multi-core gets even larger. In this paper, we focus on the disk resident OLAP with large dataset, exploiting the power of multi-core processing under I/O bottleneck. We propose optimizations for schema-aware storage layout, parallel accessing and I/O latency aware concurrent processing. On the one hand I/O bottleneck should be conquered to reduce latency for multi-core processing, on the other hand we can make good use of I/O latency for heavy concurrent query workload with multi-core power. We design experiments to exploit parallel and concurrent processing power for multi-core with DDTA-OLAP engine which minimizes the star-join cost by directly dimension tuple accessing technique. The experimental results show that we can achieve maximal speedup ratio of 103 for multi-core concurrent query processing in DRDB scenario.

[1]  Ryan Johnson,et al.  Row-wise parallel predicate evaluation , 2008, Proc. VLDB Endow..

[2]  Roger MacNicol,et al.  Sybase IQ Multiplex - Designed For Analytics , 2004, VLDB.

[3]  David J. DeWitt,et al.  Data page layouts for relational databases on deep memory hierarchies , 2002, The VLDB Journal.

[4]  Phillip M. Fernandez Red brick warehouse: a read-mostly RDBMS for open SMP platforms , 1994, SIGMOD '94.

[5]  Martin L. Kersten,et al.  Database Architecture Optimized for the New Bottleneck: Memory Access , 1999, VLDB.

[6]  Kenneth Salem,et al.  Workload-aware storage layout for database systems , 2010, SIGMOD Conference.

[7]  Anastasia Ailamaki,et al.  StagedDB: Designing Database Servers for Modern Hardware , 2005, IEEE Data Eng. Bull..

[8]  Shan Wang,et al.  MOSS-DB: A Hardware-Aware OLAP Database , 2010, WAIM.

[9]  Marcin Zukowski,et al.  Cooperative Scans: Dynamic Bandwidth Sharing in a DBMS , 2007, VLDB.

[10]  Michael Stonebraker,et al.  C-Store: A Column-oriented DBMS , 2005, VLDB.

[11]  Daniel J. Abadi,et al.  Column-stores vs. row-stores: how different are they really? , 2008, SIGMOD Conference.

[12]  Carsten Binnig,et al.  Dictionary-based order-preserving string compression for main memory column stores , 2009, SIGMOD Conference.

[13]  Marcin Zukowski,et al.  DSM vs. NSM: CPU performance tradeoffs in block-oriented query processing , 2008, DaMoN '08.

[14]  George Candea,et al.  A Scalable, Predictable Join Operator for Highly Concurrent Data Warehouses , 2009, Proc. VLDB Endow..

[15]  Nicolas Bruno Teaching an Old Elephant New Tricks , 2009, CIDR.

[16]  Subramanian Arumugam,et al.  The DataPath system: a data-centric analytic processing engine for large data warehouses , 2010, SIGMOD Conference.