Scheduling OLTP transactions via learned abort prediction

Current main memory database system architectures are still challenged by high contention workloads and this challenge will continue to grow as the number of cores in processors continues to increase [23]. These systems schedule transactions randomly across cores to maximize concurrency and to produce a uniform load across cores. Scheduling never considers potential conflicts. Performance could be improved if scheduling balanced between concurrency to maximize throughput and scheduling transactions linearly to avoid conflicts. In this paper, we present the design of several intelligent transaction scheduling algorithms that consider both potential transaction conflicts and concurrency. To incorporate reasoning about transaction conflicts, we develop a supervised machine learning model that estimates the probability of conflict. This model is incorporated into several scheduling algorithms. In addition, we integrate an unsupervised machine learning algorithm into an intelligent scheduling algorithm. We then empirically measure the performance impact of different scheduling algorithms on OLTP and social networking workloads. Our results show that, with appropriate settings, intelligent scheduling can increase throughput by 54% and reduce abort rate by 80% on a 20-core machine, relative to random scheduling. In summary, the paper provides preliminary evidence that intelligent scheduling significantly improves DBMS performance.

[1]  Ippokratis Pandis,et al.  Data-oriented transaction execution , 2010, Proc. VLDB Endow..

[2]  Haibo Chen,et al.  Scaling Multicore Databases via Constrained Parallel Execution , 2016, SIGMOD Conference.

[3]  Carlo Curino,et al.  Schism , 2010, Proc. VLDB Endow..

[4]  Michael Stonebraker,et al.  Staring into the Abyss: An Evaluation of Concurrency Control with One Thousand Cores , 2014, Proc. VLDB Endow..

[5]  Bruce M. Maggs,et al.  Holistic Query Transformations for Dynamic Web Applications , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[6]  Alvin Cheung,et al.  Improving High Contention OLTP Performance via Transaction Scheduling , 2018, ArXiv.

[7]  Alvin Cheung,et al.  Automatic Partitioning of Database Applications , 2012, Proc. VLDB Endow..

[8]  Alvin Cheung,et al.  Leveraging Lock Contention to Improve OLTP Application Performance , 2016, Proc. VLDB Endow..

[9]  Daniel J. Abadi,et al.  Calvin: fast distributed transactions for partitioned database systems , 2012, SIGMOD Conference.

[10]  Srinivas Devadas,et al.  TicToc: Time Traveling Optimistic Concurrency Control , 2016, SIGMOD Conference.

[11]  Mor Harchol-Balter,et al.  Size-based scheduling to improve web performance , 2003, TOCS.

[12]  Ali Ghodsi,et al.  Coordination Avoidance in Database Systems , 2014, Proc. VLDB Endow..

[13]  Patrick Valduriez,et al.  Transaction chopping: algorithms and performance studies , 1995, TODS.

[14]  Carlo Curino,et al.  Skew-aware automatic database partitioning in shared-nothing, parallel OLTP systems , 2012, SIGMOD Conference.

[15]  Andrew Pavlo,et al.  Performance of OLTP via Intelligent Scheduling , 2018, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[16]  Eddie Kohler,et al.  Speedy transactions in multicore in-memory databases , 2013, SOSP.

[17]  Grant Schoenebeck,et al.  Contention-Aware Lock Scheduling for Transactional Databases , 2018, Proc. VLDB Endow..

[18]  Lin Ma,et al.  Self-Driving Database Management Systems , 2017, CIDR.

[19]  Alfons Kemper,et al.  HyPer: A hybrid OLTP&OLAP main memory database system based on virtual memory snapshots , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[20]  Michael Stonebraker,et al.  The End of an Architectural Era (It's Time for a Complete Rewrite) , 2007, VLDB.

[21]  R. Weisberg A-N-D , 2011 .

[22]  Daniel J. Abadi,et al.  Lightweight Locking for Main Memory Database Systems , 2012, Proc. VLDB Endow..

[23]  Ippokratis Pandis,et al.  ERMIA: Fast Memory-Optimized Database System for Heterogeneous Workloads , 2016, SIGMOD Conference.

[24]  Richard D. Secrest Data Base Administration , 1975, ACM Pacific.

[25]  Alfons Kemper,et al.  Fast Serializable Multi-Version Concurrency Control for Main-Memory Database Systems , 2015, SIGMOD Conference.