A Performance Prediction Model for Database Environments: A Preliminary Analysis

Properly addressing the performance issues presented in database systems is and has been a significant technological challenge, this due to the uncontrolled fluctuation of user requests. Being able to predict the behaviour of such systems can greatly improve their performance. Several prediction methods, such as linear regression and autoregressive moving average, among others, have extensively been used to predict performance in shared environments where a workload is involved. However, not all them produce accurate predictions when the system is working under different workloads. In this paper, we present our preliminary results on exploring the accuracy of two different approaches (exact and approximate methods) used to predict the response time of a database system subject to different workloads in a controlled environment. Our results show that approximate methods present better prediction accuracy when compared to exact methods. Hence, we consider the main contributions of this work the following: (a) the results obtained from comparing exact and approximate methods, since they can be used as a basis for further works addressing similar problems, and (b) a preliminary prediction model also based on our findings.

[1]  Ian Witten,et al.  Data Mining , 2000 .

[2]  S. Upadhyaya,et al.  Comparison of NN and LR classifiers in the context of screening native American elders with diabetes , 2013, Expert Syst. Appl..

[3]  Yuhui Deng,et al.  Self-similarity: Behind workload reshaping and prediction , 2012, Future Gener. Comput. Syst..

[4]  A. Sarangi,et al.  Comparison of Artificial Neural Network and regression models for sediment loss prediction from Banha watershed in India , 2005 .

[5]  Wolfgang Lehner,et al.  Database as a service (DBaaS) , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[6]  Samik Raychaudhuri,et al.  Introduction to Monte Carlo simulation , 2008, 2008 Winter Simulation Conference.

[7]  Hubert Cardot,et al.  SOM time series clustering and prediction with recurrent neural networks , 2011, Neurocomputing.

[8]  Kaushik Dutta,et al.  Application performance modeling in a virtualized environment , 2010, HPCA - 16 2010 The Sixteenth International Symposium on High-Performance Computer Architecture.

[9]  Si-Qing Zheng,et al.  Online System for Grid Resource Monitoring and Machine Learning-Based Prediction , 2012, IEEE Transactions on Parallel and Distributed Systems.

[10]  Siegfried Benkner,et al.  A Service-Oriented Grid Environment with On-demand QoS Support , 2009, 2009 Congress on Services - I.

[11]  Enrico Schumann,et al.  Heuristic Methods in a Nutshell , 2011 .

[12]  Paolo Missier,et al.  Predicting the Execution Time of Workflow Activities Based on Their Input Features , 2012, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis.

[13]  Wei Xu,et al.  Advances and challenges in log analysis , 2011, Commun. ACM.

[14]  Nikos Parlavantzas,et al.  Using Lightweight Virtual Machines to Run High Performance Computing Applications: The Case of the Weather Research and Forecasting Model , 2011, 2011 Fourth IEEE International Conference on Utility and Cloud Computing.

[15]  Abir Awad Abir Awad , 2022 .

[16]  Lizy Kurian John,et al.  Predictive Heterogeneity-Aware Application Scheduling for Chip Multiprocessors , 2014, IEEE Transactions on Computers.

[17]  Enrico Schumann,et al.  Numerical Methods and Optimization in Finance , 2011 .

[18]  Stefan Wesner,et al.  Service level agreements for job control in high-performance computing , 2010, Proceedings of the International Multiconference on Computer Science and Information Technology.

[19]  Ali Afzal,et al.  Capacity planning and scheduling in Grid computing environments , 2008, Future Gener. Comput. Syst..

[20]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[21]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[22]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .