Time Warp cancellation optimizations on high latency networks

We investigate the performance of the time warp kernel APSIS when running on various communication layers, in particular on a wide-area grid. Several cancellation strategies are tried, among them the lazy cancellation and a little known bulk anti-messages optimization. Our experiments with an Ising spin simulation indicate that the slowdown caused by high latency networks, while significant, is not catastrophic; and that it can be significantly reduced using the lazy cancellation. Experiments suggest that further improvements can be expected if a more elaborate communication infrastructure is put in place.

[1]  Christopher D. Carothers,et al.  Effect of communication overheads on Time Warp performance: an experimental study , 1994, PADS '94.

[2]  Ian T. Foster,et al.  MPICH-G2: A Grid-enabled implementation of the Message Passing Interface , 2002, J. Parallel Distributed Comput..

[3]  Peter M. A. Sloot,et al.  Spatio-temporal correlations and rollback distributions in optimistic simulations , 2001, Proceedings 15th Workshop on Parallel and Distributed Simulation.

[4]  Christopher D. Carothers,et al.  Efficient Execution of Time Warp Programs on Heterogeneous, NOW Platforms , 2000, IEEE Trans. Parallel Distributed Syst..

[5]  R.M. Fujimoto,et al.  Parallel and distributed simulation systems , 2001, Proceeding of the 2001 Winter Simulation Conference (Cat. No.01CH37304).

[6]  Stephen John Turner,et al.  Performance analysis of packet bundling techniques in DIS , 1999, Proceedings 3rd IEEE International Workshop on Distributed Interactive Simulation and Real-Time Applications.

[7]  Divyakant Agrawal,et al.  Replicated objects in time warp simulations , 1992, WSC '92.

[8]  Carl Tropper,et al.  On Process Migration and Load Balancing in Time Warp , 1993, IEEE Trans. Parallel Distributed Syst..

[9]  Nael B. Abu-Ghazaleh,et al.  Optimizing communication in time-warp simulators , 1998, Workshop on Parallel and Distributed Simulation.

[10]  David R. Jefferson,et al.  Virtual time , 1985, ICPP.

[11]  Simonetta Balsamo,et al.  Rollback overhead reduction methods for time warp distributed simulation , 1998, Simul. Pract. Theory.

[12]  B. J. Overeinder,et al.  Distributed Event-driven Simulation - Scheduling Strategies and Resource Management , 2000 .

[13]  Vijay K. Garg,et al.  Fault-tolerant distributed simulation , 1998, Workshop on Parallel and Distributed Simulation.

[14]  Azzedine Boukerche,et al.  Distributed simulation over loosely coupled domains , 2000, Proceedings Fourth IEEE International Workshop on Distributed Simulation and Real-Time Applications (DS-RT 2000).

[15]  Peter M. A. Sloot,et al.  The distributed ASCI Supercomputer project , 2000, OPSR.