论文信息 - From A to E: analyzing TPC's OLTP benchmarks: the obsolete, the ubiquitous, the unexplored

From A to E: analyzing TPC's OLTP benchmarks: the obsolete, the ubiquitous, the unexplored

Introduced in 2007, TPC-E is the most recently standardized OLTP benchmark by TPC. Even though TPC-E has already been around for six years, it has not gained the popularity of its predecessor TPC-C: all the published results for TPC-E use a single database vendor's product. TPC-E is significantly different than its predecessors. Some of its distinguishing characteristics are the non-uniform input creation, longer-running and more complicated transactions, more difficult partitioning etc. These factors slow down the adoption of TPC-E. In turn, there is little knowledge in the community about how TPC-E behaves micro-architecturally and within the database engine. To shed light on TPC-E, we implement it on top of a scalable open-source database engine, Shore-MT, and perform a workload characterization study, comparing it with the previous, much better known OLTP benchmarks of TPC: TPC-B and TPC-C. In parallel, we study the evolution of the OLTP benchmarks throughout the decades. Our results demonstrate that TPC-E exhibits similar micro-architectural behavior to TPC-B and TPC-C, even though it incurs less stall time and higher instructions per cycle. On the other hand, within the database engine it suffers more from logical lock contention. Therefore, we argue that, on the hardware side, TPC-E needs less aggressive processors. Whereas on the software side it can benefit from designs based on intra-transaction parallelism, logical partitioning, and optimistic concurrency control to minimize the effects of lock contention without introducing distributed transactions.

[1] Ippokratis Pandis,et al. Improving OLTP Scalability using Speculative Lock Inheritance , 2009, Proc. VLDB Endow..

[2] David A. Patterson,et al. Performance characterization of a Quad Pentium Pro SMP using OLTP workloads , 1998, ISCA.

[3] Sarita V. Adve,et al. Performance of database workloads on shared-memory systems with out-of-order processors , 1998, ASPLOS VIII.

[4] Ippokratis Pandis,et al. Data-oriented transaction execution , 2010, Proc. VLDB Endow..

[5] Qi Zhang,et al. Characterization of storage workload traces from production Windows Servers , 2008, 2008 IEEE International Symposium on Workload Characterization.

[6] Michael Stonebraker,et al. The End of an Architectural Era (It's Time for a Complete Rewrite) , 2007, VLDB.

[7] Babak Falsafi,et al. Database Servers on Chip Multiprocessors: Limitations and Opportunities , 2007, CIDR.

[8] Luiz André Barroso,et al. Memory system characterization of commercial workloads , 1998, ISCA.

[9] R. Stets,et al. A detailed comparison of two transaction processing workloads , 2002, 2002 IEEE International Workshop on Workload Characterization.

[10] Ippokratis Pandis,et al. Aether: A Scalable Approach to Logging , 2010, Proc. VLDB Endow..

[11] James E. Smith,et al. A performance counter architecture for computing accurate CPI components , 2006, ASPLOS XII.

[12] C. Mohan,et al. ARIES/KVL: A Key-Value Locking Method for Concurrency Control of Multiaction Transactions Operating on B-Tree Indexes , 1990, VLDB.

[13] Goetz Graefe,et al. Foster b-trees , 2012, TODS.

[14] Anastasia Ailamaki,et al. SLICC: Self-Assembly of Instruction Cache Collectives for OLTP Workloads , 2012, 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture.

[15] Ippokratis Pandis,et al. OLTP on Hardware Islands , 2012, Proc. VLDB Endow..

[16] Babak Falsafi,et al. Shore-MT: a scalable storage manager for the multicore era , 2009, EDBT '09.

[17] Koushik Chakraborty,et al. Computation spreading: employing hardware migration to specialize CMP cores on-the-fly , 2006, ASPLOS XII.

[18] Ippokratis Pandis,et al. PLP: Page Latch-free Shared-everything OLTP , 2011, Proc. VLDB Endow..

[19] David J. DeWitt,et al. Data page layouts for relational databases on deep memory hierarchies , 2002, The VLDB Journal.

[20] C. Mohan,et al. ARIES/IM: an efficient and high concurrency index management method using write-ahead logging , 1992, SIGMOD '92.

[21] Philip A. Bernstein,et al. Categories and Subject Descriptors: H.2.4 [Database Management]: Systems. , 2022 .

[22] Jignesh M. Patel,et al. Wimpy node clusters: what about non-wimpy workloads? , 2010, DaMoN '10.

[23] David J. DeWitt,et al. Shoring up persistent applications , 1994, SIGMOD '94.

[24] Ippokratis Pandis,et al. TPC-E vs. TPC-C: characterizing the new TPC-E benchmark via an I/O comparison study , 2011, SGMD.

[25] Anastasia Ailamaki,et al. Optimistic Intra-Transaction Parallelism on Chip Multiprocessors , 2005, VLDB.

[26] David J. DeWitt,et al. DBMSs on a Modern Processor: Where Does Time Go? , 1999, VLDB.

[27] Jignesh M. Patel,et al. High-Performance Concurrency Control Mechanisms for Main-Memory Databases , 2011, Proc. VLDB Endow..

[28] Babak Falsafi,et al. Clearing the clouds: a study of emerging scale-out workloads on modern hardware , 2012, ASPLOS XVII.

[29] Michael Stonebraker,et al. A measure of transaction processing power , 1985 .