A methodology for auto-recognizing DBMS workloads

The type of the workload on a database management system (DBMS) is a key consideration in tuning the system. Allocations for resources such as main memory can be very different depending on whether the workload type is Online Transaction Processing (OLTP) or Decision Support System (DSS). A DBMS also typically experiences changes in the type of workload it handles during its normal processing cycle. Database administrators must, therefore, recognize the significant shifts of workload type that demand reconfiguring the system in order to maintain acceptable levels of performance. We envision autonomous, self-tuning DBMSs that have the capability to manage their own performance by automatically recognizing the workload type and then reconfiguring their resources accordingly. In this paper, we present an approach to automatically identifying a DBMS workload as either OLTP or DSS. We build a classification model based on the most significant workload characteristics that differenti ate OLTP from DSS and then use the model to identify any change in the workload type. We construct and compare classifiers built from two different sets of industry-standard workloads, namely the TPC-C and TPC-H benchmarks, and the Browsing and Ordering profiles from the TPC-W benchmark. We conduct various sets of experiments that show that our workload classifiers are reliable, and have high accuracy in recognizing the type of the workload mix and in estimating the degree of its concentration.

[1]  Philip S. Yu,et al.  On Workload Characterization of Relational Database Environments , 1992, IEEE Trans. Software Eng..

[2]  Luiz André Barroso,et al.  Memory system characterization of commercial workloads , 1998, ISCA.

[3]  Alan Jay Smith,et al.  Characteristics of production database workloads and the TPC benchmarks , 2001, IBM Syst. J..

[4]  Rakesh Agrawal,et al.  SPRINT: A Scalable Parallel Classifier for Data Mining , 1996, VLDB.

[5]  David A. Patterson,et al.  Towards a Simplified Database Workload for Computer Architecture Evaluations , 2000 .

[6]  Carsten Sapia,et al.  PROMISE: Predicting Query Behavior to Enable Predictive Caching Strategies for OLAP Systems , 2000, DaWaK.

[7]  BarrosoLuiz Andre,et al.  Memory system characterization of commercial workloads , 1998 .

[8]  Sreerama K. Murthy,et al.  Automatic Construction of Decision Trees from Data: A Multi-Disciplinary Survey , 1998, Data Mining and Knowledge Discovery.

[9]  Petr Jan Horn,et al.  Autonomic Computing: IBM's Perspective on the State of Information Technology , 2001 .

[10]  David A. Patterson,et al.  Performance characterization of a Quad Pentium Pro SMP using OLTP workloads , 1998, ISCA.

[11]  Michael Stonebraker,et al.  The Asilomar report on database research , 1998, SGMD.

[12]  Allan N. Packer Configuring and Tuning Databases on the Solaris Platform , 2001 .

[13]  Philip S. Yu,et al.  Performance Analysis of Affinity Clustering on Transaction Processing Coupling Architecture , 1994, IEEE Trans. Knowl. Data Eng..

[14]  David J. DeWitt,et al.  DBMSs on a Modern Processor: Where Does Time Go? , 1999, VLDB.

[15]  Said Elnaffar,et al.  Characterizing Computer Systems' Workloads , 2002 .