CubeLoad: A Parametric Generator of Realistic OLAP Workloads

Differently from OLTP workloads, OLAP workloads are hardly predictable due to their inherently extemporary nature. Besides, obtaining real OLAP workloads by monitoring the queries actually issued in companies and organizations is quite hard. On the other hand, hardware and software benchmarking in the industrial world, as well as comparative evaluation of novel approaches in the research community, both need reference databases and workloads. In this paper we present CubeLoad, a parametric generator of workloads in the form of OLAP sessions, based on a realistic profile-based model. After describing the main features of CubeLoad, we discuss the results of some tests that show how workloads with very different features can be generated.

[1]  Stefano Rizzi,et al.  Predicting Your Next OLAP Query Based on Recent Analytical Sessions , 2013, DaWaK.

[2]  Hakan Hacigümüs,et al.  Towards a workload for evolutionary analytics , 2013, DanaC '13.

[3]  Matteo Golfarelli,et al.  Similarity measures for OLAP sessions , 2013, Knowledge and Information Systems.

[4]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[5]  Meikel Pöss,et al.  TPC-DS, taking decision support benchmarking to the next level , 2002, SIGMOD '02.

[6]  Matteo Golfarelli,et al.  Efficient derivation of numerical dependencies , 2013, Inf. Syst..

[7]  J. G. Skellam,et al.  A New Method for determining the Type of Distribution of Plant Individuals , 1954 .

[8]  Sunita Sarawagi,et al.  User-Adaptive Exploration of Multidimensional Data , 2000, VLDB.

[9]  Carsten Sapia,et al.  PROMISE: Predicting Query Behavior to Enable Predictive Caching Strategies for OLAP Systems , 2000, DaWaK.

[10]  Meikel Pöss Controlled SQL query evolution for decision support benchmarks , 2007, WOSP '07.

[11]  Matteo Golfarelli,et al.  The workload you have, the workload you would like , 2003, DOLAP '03.

[12]  Omar Boussaïd,et al.  DWEB: A Data Warehouse Engineering Benchmark , 2005, DaWaK.

[13]  Xuedong Chen,et al.  The Star Schema Benchmark and Augmented Fact Table Indexing , 2009, TPCTC.

[14]  Matteo Golfarelli,et al.  Mining Preferences from OLAP Query Logs for Proactive Personalization , 2011, ADBIS.