Performance Evaluation of HPC Benchmarks on VMware's ESXi Server

A major obstacle to virtualizing HPC workloads is a concern about the performance loss due to virtualization. We will demonstrate that new features significantly enhance the performance and scalability of virtualized HPC workloads on VMware's virtualization platform. Specifically, we will discuss VMware's ESXi Server performance for virtual machines with up to 64 virtual CPUs as well as support for exposing virtual NUMA topology to guest operating systems, enabling the operating system and applications to make intelligent NUMA aware decisions about memory allocation and process/thread placement. NUMA support is especially important for large VMs which necessarily span host NUMA nodes on all modern hardware. We will show how the virtual NUMA topology is chosen to closely match physical host topology, while preserving the now expected virtualization benefits of portability and load balancing. We show that the benefit of exposing the virtual NUMA topology can lead to performance gains of up to 167%. Overall, we will show close to native performance on applications from SPEC MPI V2.0 and SPEC OMP V3.2 benchmarks virtualized on our prototype VMware's ESXi Server.

[1]  Vishakha Gupta,et al.  High-Performance Hypervisor Architectures: Virtualization in HPC Systems , 2007 .

[2]  Karsten Schwan,et al.  vNUMA-mgr: Managing VM memory on NUMA platforms , 2010, 2010 International Conference on High Performance Computing.

[3]  Christian Engelmann,et al.  Proactive fault tolerance for HPC with Xen virtualization , 2007, ICS '07.

[4]  Alex Garthwaite,et al.  The evolution of an x86 virtual machine monitor , 2010, OPSR.

[5]  Richard Wolski,et al.  The impact of paravirtualized memory hierarchy on linear algebra computational kernels and software , 2008, HPDC '08.

[6]  Adit Ranadive,et al.  Performance implications of virtualizing multicore cluster machines , 2008, HPCVirt '08.

[7]  Orran Krieger,et al.  Virtualization for high-performance computing , 2006, OPSR.

[8]  vSphereTM Vmware® Vsphere™: the Cpu Scheduler in Vmware Esx® 4.1 Scalable Infrastructure with the Cpu Scheduler in Vmware Esx 4.1 , 2010 .

[9]  Hong Ong,et al.  System-Level Virtualization for High Performance Computing , 2008, 16th Euromicro Conference on Parallel, Distributed and Network-Based Processing (PDP 2008).

[10]  Joshua E. Simons,et al.  Virtualizing high performance computing , 2010, OPSR.

[11]  Carl A. Waldspurger,et al.  Memory resource management in VMware ESX server , 2002, OSDI '02.

[12]  Wei Huang,et al.  High performance virtual machine migration with RDMA over modern interconnects , 2007, 2007 IEEE International Conference on Cluster Computing.