论文信息 - A CPU Contention Predictor for Business-Critical Workloads in Cloud Datacenters

A CPU Contention Predictor for Business-Critical Workloads in Cloud Datacenters

Resource contention is one of the major problems in cloud datacenters. Many types of resource contention occur, with important impact on the performance and sometimes even the reliability of applications running in cloud datacenters. Cloud applications run together on the same physical machines with different workloads resulting in non-synchronized accesses to the shared resources. This leads to cases where co-hosted applications are contending for the common resources and not receiving the demanded resource amounts. In this work, we investigate the contention in CPU resources, as CPU is allowed to be over-committed by typical SLAs. We propose a CPU-contention predictor for the demanding business-critical workloads, which require low resource contention to deliver the required performance to customers. Our predictor is based on a set of regression models and metrics which we evaluate extensively. We tune the predictor with data collected from a real-world cloud operation spanning multiple datacenters and servicing business-critical workloads.

Alexandru Iosup | Vincent van Beek | Giorgos Oikonomou

[1] V. Vovk,et al. Combining P-Values Via Averaging , 2012, Biometrika.

[2] Alexandra Fedorova,et al. Contention-Aware Scheduling on Multicore Systems , 2010, TOCS.

[3] Alexandru Iosup,et al. Self-Expressive Management of Business-Critical Workloads in Virtualized Datacenters , 2015, Computer.

[4] 安藤寛,et al. Cross-Validation , 1952, Encyclopedia of Machine Learning and Data Mining.

[5] Rudolf Eigenmann,et al. Prediction of Resource Availability in Fine-Grained Cycle Sharing Systems Empirical Evaluation , 2007, Journal of Grid Computing.

[6] Alexandru Iosup,et al. The Grid Workloads Archive , 2008, Future Gener. Comput. Syst..

[7] Michael J. Burke,et al. Averaging Correlations: Expected Values and Bias in Combined Pearson rs and Fisher's z Transformations , 1998 .

[8] Alexandru Iosup,et al. Statistical Characterization of Business-Critical Workloads Hosted in Cloud Datacenters , 2015, 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[9] Calton Pu,et al. vPerfGuard: an automated model-driven framework for application performance diagnosis in consolidated cloud environments , 2013, ICPE '13.