Host-Based Virtual Machine Workload Characterization Using Hypervisor Trace Mining

Cloud computing is a fast-growing technology that provides on-demand access to a pool of shared resources. This type of distributed and complex environment requires advanced resource management solutions that could model virtual machine (VM) behavior. Different workload measurements, such as CPU, memory, disk, and network usage, are usually derived from each VM to model resource utilization and group similar VMs. However, these course workload metrics require internal access to each VM with the available performance analysis toolkit, which is not feasible with many cloud environments privacy policies. In this article, we propose a non-intrusive host-based virtual machine workload characterization using hypervisor tracing. VM blockings duration, along with virtual interrupt injection rates, are derived as features to reveal multiple levels of resource intensiveness. In addition, the VM exit reason is considered, as well as the resource contention rate due to the host and other VMs. Moreover, the processes and threads preemption rates in each VM are extracted using the collected tracing logs. Our proposed approach further improves the selected features by exploiting a page ranking based algorithm to filter non-important processes running on each VM. Once the metric features are defined, a two-stage VM clustering technique is employed to perform both coarse- and fine-grain workload characterization. The inter-cluster and intra-cluster similarity metrics of the silhouette score is used to reveal distinct VM workload groups, as well as the ones with significant overlap. The proposed framework can provide a detailed vision of the underlying behavior of the running VMs. This can assist infrastructure administrators in efficient resource management, as well as root cause analysis.

[1]  Michel Dagenais,et al.  Fine-grained preemption analysis for latency investigation across virtual machines , 2014, Journal of Cloud Computing.

[2]  Renata Spolon Lobato,et al.  Improving Virtual Machine Consolidation for Heterogeneous Cloud Computing Datacenters , 2019, 2019 31st International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD).

[3]  Ravi S. Sandhu,et al.  Malware Detection in Cloud Infrastructures Using Convolutional Neural Networks , 2018, 2018 IEEE 11th International Conference on Cloud Computing (CLOUD).

[4]  Xiaohui Gu,et al.  Ieee Transactions on Parallel and Distributed Systems (tpds) Perfcompass: Online Performance Anomaly Fault Localization and Inference in Infrastructure-as-a-service Clouds , 2022 .

[5]  Michel Dagenais,et al.  An SVM-based framework for detecting DoS attacks in virtualized clouds under changing environment , 2018, Journal of Cloud Computing.

[6]  Seyed Vahid Azhari,et al.  Host Hypervisor Trace Mining for Virtual Machine Workload Characterization , 2019, 2019 IEEE International Conference on Cloud Engineering (IC2E).

[7]  Xiao Zhang,et al.  PerfInsight: A Robust Clustering-Based Abnormal Behavior Detection System for Large-Scale Cloud , 2018, 2018 IEEE 11th International Conference on Cloud Computing (CLOUD).

[8]  Michel Dagenais,et al.  Virtual CPU State Detection and Execution Flow Analysis by Host Tracing , 2016, 2016 IEEE International Conferences on Big Data and Cloud Computing (BDCloud), Social Computing and Networking (SocialCom), Sustainable Computing and Communications (SustainCom) (BDCloud-SocialCom-SustainCom).

[9]  David Lie,et al.  Manitou: a layer-below approach to fighting malware , 2006, ASID '06.

[10]  Claudia Canali,et al.  Detecting similarities in virtual machine behavior for cloud monitoring using smoothed histograms , 2014, J. Parallel Distributed Comput..

[11]  Ying Wang,et al.  A workload prediction-based multi-VM provisioning mechanism in cloud computing , 2013, 2013 15th Asia-Pacific Network Operations and Management Symposium (APNOMS).

[12]  M. Desnoyers,et al.  The LTTng tracer: A low impact performance and behavior monitor for GNU/Linux , 2006 .

[13]  Andrea C. Arpaci-Dusseau,et al.  VMM-based hidden process detection and identification using Lycosid , 2008, VEE '08.

[14]  Ravi S. Sandhu,et al.  Clustering-Based IaaS Cloud Monitoring , 2017, 2017 IEEE 10th International Conference on Cloud Computing (CLOUD).

[15]  Claudia Canali,et al.  Exploiting Classes of Virtual Machines for Scalable IaaS Cloud Management , 2015, 2015 IEEE Fourth Symposium on Network Cloud Computing and Applications (NCCA).

[16]  Inderjit S. Dhillon,et al.  Kernel k-means: spectral clustering and normalized cuts , 2004, KDD.

[17]  Michel Dagenais,et al.  State History Tree: An Incremental Disk-Based Data Structure for Very Large Interval Data , 2013, 2013 International Conference on Social Computing.

[18]  Michel Dagenais,et al.  VM processes state detection by hypervisor tracing , 2018, 2018 Annual IEEE International Systems Conference (SysCon).

[19]  Michel Dagenais,et al.  Multilayer Virtualized Systems Analysis with Kernel Tracing , 2016, 2016 IEEE 4th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW).

[20]  Robert G. Gallager,et al.  A new distributed algorithm to find breadth first search trees , 1987, IEEE Trans. Inf. Theory.

[21]  Wenpu Xing,et al.  Weighted PageRank algorithm , 2004, Proceedings. Second Annual Conference on Communication Networks and Services Research, 2004..

[22]  Wenbin Yao,et al.  A Weighted PageRank-Based Algorithm for Virtual Machine Placement in Cloud Computing , 2019, IEEE Access.

[23]  Michel Dagenais,et al.  Critical Path Analysis through Hierarchical Distributed Virtualized Environments using Host Kernel Tracing , 2020 .

[24]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[25]  Michel Dagenais,et al.  virtFlow: Guest Independent Execution Flow Analysis Across Virtualized Environments , 2020, IEEE Transactions on Cloud Computing.

[26]  Guofei Jiang,et al.  CLUE: System trace analytics for cloud service performance diagnosis , 2014, 2014 IEEE Network Operations and Management Symposium (NOMS).

[27]  Claudia Canali,et al.  Exploiting ensemble techniques for automatic virtual machine clustering in cloud systems , 2013, Automated Software Engineering.