Predicting system performance for multi-tenant database workloads

Database consolidation is gaining wide acceptance as a means to reduce the cost and complexity of managing database systems. However, this new trend poses many interesting challenges for understanding and predicting system performance. The consolidated databases in multi-tenant settings share resources and compete with each other for these resources. In this work we present an experimental study to highlight how these interactions can be fairly complex. We argue that individual database staging or workload profiling is not an adequate approach to understanding the performance of the consolidated system. Our initial investigations suggest that machine learning approaches that use monitored data to model the system can work well for important tasks.

[1]  Ivan T. Bowman,et al.  A whitepaper from iAnywhere Solutions, Inc., a subsidiary of Sybase Inc. Capacity planning with SQL Anywhere , 2008 .

[2]  Carlo Curino,et al.  Relational Cloud: a Database Service for the cloud , 2011, CIDR.

[3]  Asser N. Tantawi,et al.  An analytical model for multi-tier internet services and its applications , 2005, SIGMETRICS '05.

[4]  Gregory R. Ganger,et al.  Argon: Performance Insulation for Shared Storage Servers , 2007, FAST.

[5]  Prashant J. Shenoy,et al.  Dynamic Provisioning of Multi-tier Internet Applications , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[6]  Shivnath Babu,et al.  Interaction-aware prediction of business intelligence workload completion times , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[7]  Yuan Zhou,et al.  Supporting Database Applications as a Service , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[8]  Torsten Grust,et al.  Multi-tenant databases for software as a service: schema-mapping techniques , 2008, SIGMOD Conference.

[9]  Sonja Kuhnt,et al.  Design and analysis of computer experiments , 2010 .

[10]  Archana Ganapathi,et al.  Predicting Multiple Metrics for Queries: Better Decisions Enabled by Machine Learning , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[11]  Qi Zhang,et al.  R-Capriccio: A Capacity Planning and Anomaly Detection Tool for Enterprise Services with Live Workloads , 2007, Middleware.

[12]  Shivnath Babu,et al.  Predicting completion times of batch query workloads using interaction-aware models and simulation , 2011, EDBT/ICDT '11.

[13]  Shivnath Babu,et al.  Query interactions in database workloads , 2009, DBTest '09.

[14]  Kamesh Munagala,et al.  Interaction-aware scheduling of report-generation workloads , 2011, The VLDB Journal.

[15]  PacificiGiovanni,et al.  An analytical model for multi-tier internet services and its applications , 2005 .

[16]  Kamesh Munagala,et al.  Modeling and exploiting query interactions in database systems , 2008, CIKM '08.

[17]  Kamesh Munagala,et al.  QShuffler: Getting the Query Mix Right , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[18]  Anastasia Ailamaki,et al.  Lachesis: Robust Database Storage Management Based on Device-specific Performance Characteristics , 2003, VLDB.