Large-scale network simulation: leveraging the strengths of modern SMP-based compute clusters

Parallelization is crucial for efficient execution of large-scale network simulation. Today's computing clusters commonly used for that purpose are built from a large amount of multi-processor machines. The traditional approach to utilize all CPU cores in such a system is to partition the network and distribute the partitions to the cores. This, however, does not incorporate the presence of shared memory into the design, such that messages between partitions on the same computing node have to be serialized and synchronization becomes more complex. In this paper, we present an approach that combines the shared-memory parallelization scheme Horizon [9] with the standard approach to distributed simulation to leverage the strengths of today's computing clusters. To further reduce the synchronization overhead, we introduce a novel synchronization algorithm that takes domain knowledge into account to reduce the number of synchronization points. In a case study with a UMTS LTE model, we show that both contributions combined enable much higher scalability achieving almost linear speedup when simulating 1,536 LTE cells on 1,536 CPU cores.

[1]  R. M. Fujimoto,et al.  Parallel discrete event simulation , 1989, WSC '89.

[2]  Gregory K. Egan,et al.  Implementing MPI Based Portable Parallel Discrete Event Simulation Support in the OMNeT + + Framework , 2002 .

[3]  Christopher D. Carothers,et al.  Warp speed: executing time warp on 1,966,080 cores , 2013, SIGSIM-PADS.

[4]  David M. Nicol,et al.  Conservative Parallel Simulation of Continuous Time Markov Chains Using Uniformization , 1993, IEEE Trans. Parallel Distributed Syst..

[5]  Boris D. Lubachevsky,et al.  Efficient distributed event driven simulations of multiple-loop networks , 1988, SIGMETRICS '88.

[6]  John G. Cleary,et al.  Scheduling critical channels in conservative parallel discrete event simulation , 1999, Proceedings Thirteenth Workshop on Parallel and Distributed Simulation. PADS 99. (Cat. No.PR00155).

[7]  Nael B. Abu-Ghazaleh,et al.  Optimization of Parallel Discrete Event Simulator for Multi-core Systems , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium.

[8]  Rajive L. Bagrodia,et al.  Path lookahead: a data flow view of PDES models , 1999, Proceedings Thirteenth Workshop on Parallel and Distributed Simulation. PADS 99. (Cat. No.PR00155).

[9]  Klaus Wehrle,et al.  Runtime efficient event scheduling in multi-threaded network simulation , 2011, SimuTools.

[10]  A. Varga,et al.  THE OMNET++ DISCRETE EVENT SIMULATION SYSTEM , 2003 .

[11]  K M Chandy,et al.  The Conditional-Event Approach to Distributed Simulation , 1989 .

[12]  R. Fujimoto,et al.  Parallel and distributed simulation , 1995, Winter Simulation Conference Proceedings, 1995..

[13]  Jason Liu,et al.  Hierarchical Composite Synchronization , 2012, 2012 ACM/IEEE/SCS 26th Workshop on Principles of Advanced and Distributed Simulation.

[14]  Hassan Rajaei,et al.  The local Time Warp approach to parallel simulation , 1993, PADS '93.

[15]  Matthias Pätzold,et al.  Stochastic Modeling and Simulation of Frequency-Correlated Wideband Fading Channels , 2007, IEEE Transactions on Vehicular Technology.

[16]  Klaus Wehrle,et al.  Know thy simulation model: analyzing event interactions for probabilistic synchronization in parallel simulations , 2012, SimuTools.

[17]  David M. Nicol,et al.  Composite Synchronization in Parallel Discrete-Event Simulation , 2002, IEEE Trans. Parallel Distributed Syst..

[18]  David M. Nicol Parallel discrete-event simulation of FCFS stochastic queueing networks , 1988, PPoPP 1988.

[19]  Rajive L. Bagrodia,et al.  Improving lookahead in parallel wireless network simulation , 1998, Proceedings. Sixth International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (Cat. No.98TB100247).

[20]  David M. Nicol,et al.  Learning not to share , 2001, Workshop on Parallel and Distributed Simulation.

[21]  Klaus Wehrle,et al.  Expanding the Event Horizon in Parallelized Network Simulations , 2010, 2010 IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[22]  K. Mani Chandy,et al.  Distributed Simulation: A Case Study in Design and Verification of Distributed Programs , 1979, IEEE Transactions on Software Engineering.