On deciding between conservative and optimistic approaches on massively parallel platforms

Over 5000 publications on parallel discrete event simulation (PDES) have appeared in the literature to date. Nevertheless, few articles have focused on empirical studies of PDES performance on large supercomputer-based systems. This gap is bridged here, by undertaking a parameterized performance study on thousands of processor cores of a Blue Gene supercomputing system. In contrast to theoretical insights from analytical studies, our study is based on actual implementation in software, incurring the actual messaging and computational overheads for both conservative and optimistic synchronization approaches of PDES. Complex and counter-intuitive effects are uncovered and analyzed, with different event timestamp distributions and available levels of concurrency in the synthetic benchmark models. The results are intended to provide guidance to the PDES community in terms of how the synchronization protocols behave at high processor core counts using a state-of-the-art supercomputing systems.

[1]  David M. Nicol,et al.  Composite Synchronization in Parallel Discrete-Event Simulation , 2002, IEEE Trans. Parallel Distributed Syst..

[2]  Murat Yuksel,et al.  Seven-O'Clock: a new distributed GVT algorithm using network atomic operations , 2005, Workshop on Principles of Advanced and Distributed Simulation (PADS'05).

[3]  Sudip K. Seal,et al.  Reversible Parallel Discrete-Event Execution of Large-Scale Epidemic Outbreak Models , 2010, 2010 IEEE Workshop on Principles of Advanced and Distributed Simulation.

[4]  Fabian Gomes,et al.  Optimizing incremental state-saving and restoration , 1996 .

[5]  Fabian Gomes,et al.  A fast asynchronous GVT algorithm for shared memory multiprocessor architectures , 1995, PADS.

[6]  David R. Jefferson,et al.  Virtual time , 1985, ICPP.

[7]  Jeff S. Steinman,et al.  Breathing Time Warp , 1993, PADS '93.

[8]  Kalyan S. Perumalla,et al.  μπ: a scalable and transparent system for simulating MPI programs , 2010, SimuTools.

[9]  Friedemann Mattern,et al.  Efficient Algorithms for Distributed Snapshots and Global Virtual Time Approximation , 1993, J. Parallel Distributed Comput..

[10]  Susan Coghlan,et al.  Benchmarking the effects of operating system interference on extreme-scale parallel machines , 2008, Cluster Computing.

[11]  Christopher D. Carothers,et al.  Analysis of time warp on a 32,768 processor ibm blue Gene/L supercomputer , 2008 .

[12]  Richard M. Fujimoto,et al.  Computing global virtual time in shared-memory multiprocessors , 1997, TOMC.

[13]  K. Mani Chandy,et al.  Distributed Simulation: A Case Study in Design and Verification of Distributed Programs , 1979, IEEE Transactions on Software Engineering.

[14]  David F. Heidel,et al.  An Overview of the BlueGene/L Supercomputer , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[15]  Christopher D. Carothers,et al.  Efficient optimistic parallel simulations using reverse computation , 1999, Workshop on Parallel and Distributed Simulation.

[16]  Alan Weiss,et al.  An analysis of rollback-based simulation , 1991, TOMC.

[17]  John G. Cleary,et al.  Scheduling critical channels in conservative parallel discrete event simulation , 1999, Proceedings Thirteenth Workshop on Parallel and Distributed Simulation. PADS 99. (Cat. No.PR00155).

[18]  Richard J. Lipton,et al.  T ime Warp vs. Chandy-Misra: A worst-case comparison , 1990 .

[19]  Richard M. Fujimoto,et al.  Virtual time synchronization over unreliable network transport , 2001, Proceedings 15th Workshop on Parallel and Distributed Simulation.

[20]  Kalyan S. Perumalla Scaling time warp-based discrete event execution to 104 processors on a Blue Gene supercomputer , 2007, CF '07.

[21]  Steven F. Bellenot State skipping performance with the time warp operating system , 1991 .

[22]  David M. Nicol,et al.  Analysis of bounded time warp and comparison with YAWNS , 1996, TOMC.

[23]  Randal E. Bryant,et al.  SIMULATION OF PACKET COMMUNICATION ARCHITECTURE COMPUTER SYSTEMS , 1977 .

[24]  Paul F. Reynolds,et al.  Disseminating critical target-specific synchronization information in parallel discrete event simulations , 1993, PADS '93.

[25]  Christopher D. Carothers,et al.  Scalable Time Warp on Blue Gene Supercomputers , 2009, 2009 ACM/IEEE/SCS 23rd Workshop on Principles of Advanced and Distributed Simulation.

[26]  Philip A. Wilsey,et al.  pGVT: an algorithm for accurate GVT estimation , 1994, PADS '94.