Comparison of message aggregation strategies for parallel simulations on a high performance cluster

Parallel simulations of fine grain applications usually generate a large amount of messages. The overhead for sending these messages over an interconnection network can dramatically limit the speedup of a parallel simulation. In this case, message aggregation techniques can increase the granularity of the application and reduce the communication overhead. This paper compares sender-initiated and receiver-initiated message aggregation strategies and addresses their performance when applied to parallel simulations of high-speed communication network models on a high performance cluster. Receiver-initiated strategies were initially motivated by the cyclic behavior of the parallel simulation kernel and by the capabilities of the test-bed environment to handle large messages up to 2000 bytes with small overheads.

[1]  Philip A. Wilsey,et al.  Adressing Comminication Latency Issues on Clusters for Fine Grained Asynchronous Applications - A Case Study , 1999, IPPS/SPDP Workshops.

[2]  CongDuc Pham,et al.  Optimizing message aggregation for parallel simulation on high performance clusters , 1999, MASCOTS '99. Proceedings of the Seventh International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[3]  Bernard Tourancheau,et al.  BIP: A New Protocol Designed for High Performance Networking on Myrinet , 1998, IPPS/SPDP Workshops.

[4]  Nael B. Abu-Ghazaleh,et al.  Optimizing communication in time-warp simulators , 1998, Workshop on Parallel and Distributed Simulation.

[5]  K. Mani Chandy,et al.  Distributed Simulation: A Case Study in Design and Verification of Distributed Programs , 1979, IEEE Transactions on Software Engineering.

[6]  Nael B. Abu-Ghazaleh,et al.  An Active Layer Extension to MPI , 1998, PVM/MPI.

[7]  Charles L. Seitz,et al.  Myrinet: A Gigabit-per-Second Local Area Network , 1995, IEEE Micro.