Virtual-machine-based emulation of future generation high-performance computing systems

This paper describes the design of a system to enable research, development, and testing of new software stacks and hardware features for future high-end computing systems. Motivating uses include both small-scale research and development on simulated individual nodes of proposed high-performance computing systems, and large scaling studies that emulate a sizeable fraction of a future supercomputing system. The proposed architecture combines system virtualization, architectural simulation, time dilation, and slack simulation to provide scalable emulation of hypothetical systems. Virtualization-based full-system measurement and monitoring tools are also included to aid in using the proposed system for co-design of high-performance computing system software and architectural features for future systems. Finally, this paper provides a description of the implementation strategy and status of the system.

[1]  Lixin Zhang,et al.  Mambo: a full system simulator for the PowerPC architecture , 2004, PERV.

[2]  Torsten Hoefler,et al.  AM++: A generalized active message framework , 2010, 2010 19th International Conference on Parallel Architectures and Compilation Techniques (PACT).

[3]  Amin Vahdat,et al.  DieCast: Testing Distributed Systems with an Accurate Scale Model , 2008, TOCS.

[4]  Arthur B. Maccabe,et al.  Lightweight Online Performance Monitoring and Tuning with Embedded Gossip , 2009, IEEE Transactions on Parallel and Distributed Systems.

[5]  Peter A. Dinda,et al.  Minimal-overhead virtualization of a large scale supercomputer , 2011, VEE '11.

[6]  Jianwei Chen,et al.  Exploiting Simulation Slack to Improve Parallel Simulation Speed , 2009, 2009 International Conference on Parallel Processing.

[7]  Amin Vahdat,et al.  To infinity and beyond: time warped network emulation , 2005, SOSP '05.

[8]  Peter A. Dinda,et al.  Palacios and Kitten: New high performance operating systems for scalable virtualized and native supercomputing , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).

[9]  Arthur B. Maccabe,et al.  Light-weight application monitoring and tuning with embedded gossip , 2007, PERV.

[10]  Bernd Mohr,et al.  TAU: A Portable Parallel Program Analysis Environment for pC++ , 1994, CONPAR.

[11]  Standardview staff Author's biographies , 1997, STAN.

[12]  Somayeh Sardashti,et al.  The gem5 simulator , 2011, CARN.

[13]  R. M. Fujimoto,et al.  Parallel discrete event simulation , 1989, WSC '89.

[14]  Rolf Riesen,et al.  Designing and implementing lightweight kernels for capability computing , 2009 .

[15]  Keith D. Underwood,et al.  The structural simulation toolkit: exploring novel architectures , 2006, SC.

[16]  Jack J. Dongarra,et al.  A Portable Programming Interface for Performance Evaluation on Modern Processors , 2000, Int. J. High Perform. Comput. Appl..

[17]  Rolf Riesen,et al.  Instruction-level simulation of a cluster at scale , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.

[18]  Milo M. K. Martin,et al.  Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset , 2005, CARN.