In Time Warp parallel simulation, a process executes every message as soon as it arrives. If a message with a smaller timestamp subsequently arrives, the process rolls back its state to the time of the earlier message and re-executes from that point. Clearly, the state of each process must be saved (checkpointed) regularly in case a rollback is necessary. Although most existing Time Warp implementations checkpoint after every state transition, this is not necessary, and the checkpoint interval is in reality a tuning parameter of the simulation. Lin and Lazowska[7] proposed a model to derive the optimal checkpoint interval by assuming that the rollback behavior of Time Warp is not affected by the frequency of checkpointing. An experimental study conducted by Preiss et al.[11] indicates that the behavior of rollback is affected by the frequency of checkpointing in general, and that the Lin-Lazowska model may not reflect the real situations in general. This paper extends the Lin-Lazowska model to include the effect of the checkpoint interval on the rollback behavior. The relationship among the checkpoint interval, the rollback behavior, and the overhead associated with state saving and restoration is described. A checkpoint interval selection algorithm which quickly determines the optimal checkpoint interval during the execution of Time Warp simulation is proposed. Empirical results indicate that the algorithm converges quickly and always selects the optimal checkpoint interval.
[1]
Ganesh Gopalakrishnan,et al.
Design and Evaluation of the Rollback Chip: Special Purpose Hardware for Time Warp
,
1992,
IEEE Trans. Computers.
[2]
David R. Jefferson,et al.
Virtual time
,
1985,
ICPP.
[3]
Yi-Bing Lin,et al.
Understanding the limits of optimistic and conservative parallel simulation
,
1990
.
[4]
Steven F. Bellenot.
State skipping performance with the time warp operating system
,
1991
.
[5]
Richard M. Fujimoto,et al.
Optimistic approaches to parallel discrete event simulation
,
1990
.
[6]
Yi-Bing Lin,et al.
Optimality considerations of 'Time Warp' parallel simulation
,
1990
.
[7]
Bruno R. Preiss.
Performance of Discrete Event Simulation on a Multiprocessor Using Optimistic and Conservative Synchronization
,
1990,
ICPP.
[8]
Wayne M. Loucks,et al.
On the Trade-off between Time and Space in Optimistic Parallel Discrete-Event Simulation
,
1992
.
[9]
B. R. Preiss.
THE YADDES DISTRIBUTED DISCRETE EVENT SIMULATION SPECIFICATION LANGUAGE AND EXECUTION ENVIRONMENTS
,
1989
.
[10]
John W. Young,et al.
A first order approximation to the optimum checkpoint interval
,
1974,
CACM.
[11]
David M. Nicol,et al.
Parallel discrete-event simulation of FCFS stochastic queueing networks
,
1988,
PPEALS '88.