We investigate the effects of shared resources for high-performance computing in a commercial cloud environment where multiple virtual machines share a single hardware node. Although good performance is occasionally obtained, contention degrades the expected performance and introduces significant variance. Using the DGEMM kernel and the HPL benchmark, we show that the underutilization of resources considerably improves expected performance by reducing contention for the CPU and cache space. For instance, for some cluster configurations, the solution is reached almost an order of magnitude earlier on average when the available resources are underutilized. The performance benefits for single node computations are even more impressive: Underutilization improves the expected execution time by two orders of magnitude. Finally, in contrast to unshared clusters, extending underutilized clusters by adding more nodes often improves the execution time due to an increased parallelism even with a slow interconnect. In the best case, by underutilizing the nodes performance was improved enough to entirely offset the cost of an extra node in the cluster.
[1]
Alexander Stage,et al.
An economic decision model for business software application deployment on hybrid Cloud environments
,
2010,
MKWI.
[2]
Edward Walker,et al.
Benchmarking Amazon EC2 for High-Performance Scientific Computing
,
2008,
login Usenix Mag..
[3]
T. S. Eugene Ng,et al.
The Impact of Virtualization on Network Performance of Amazon EC2 Data Center
,
2010,
2010 Proceedings IEEE INFOCOM.
[4]
Paolo Bientinesi,et al.
HPC on Competitive Cloud Resources
,
2010,
Handbook of Cloud Computing.
[5]
Robert H. Halstead,et al.
Matrix Computations
,
2011,
Encyclopedia of Parallel Computing.
[6]
Paolo Bientinesi,et al.
Can cloud computing reach the top500?
,
2009,
UCHPC-MAW '09.
[7]
Robert A. van de Geijn,et al.
Scalability Issues Affecting the Design of a Dense Linear Algebra Library
,
1994,
J. Parallel Distributed Comput..
[8]
Gene H. Golub,et al.
Matrix computations (3rd ed.)
,
1996
.