Parallel Simulation of Large-Scale Parallel Applications

Accurate and efficient simulation of large parallel applications can be facilitated with the use of direct execution and parallel discrete-event simulation. This paper describes MPI-SIM, a direct execution-driven parallel simulator designed to predict the performance of existing MPI and MPI-IO application. MPI-SIM can be used to predict the performance of these programs as a function of architectural characteristics, including number of processors, message communication latencies, caching algorithms, and alternative implementations of collective I/O operations. Results are presented, which show the use of MPI-SIM in performing a scalability study of real-world applications. The benchmarks chosen for the study include Sweep3D, one of the ASCI benchmarks, and BTIO, an I/O-intensive benchmark from the NAS Parallel Benchmark suite. MPI-SIM is shown to accurately and efficiently predict the performance of Sweep3D running on an Origin 2000. It is also used to demonstrate the impact of the number of I/O nodes on BTIO’s performance.

[1]  Mary K. Vernon,et al.  Predictive analysis of a wavefront application using LogGP , 1999, PPoPP '99.

[2]  Eric A. Brewer,et al.  PROTEUS: a high-performance parallel-architecture simulator , 1992, SIGMETRICS '92/PERFORMANCE '92.

[3]  Rizos Sakellariou,et al.  Application Representations for Multiparadigm Performance Modeling of Large-Scale Parallel Scientific Codes , 2000, Int. J. High Perform. Comput. Appl..

[4]  Adolfy Hoisie,et al.  Performance and Scalability Analysis of Teraflop-Scale Parallel Architectures Using Multidimensional Wavefront Applications , 2000, Int. J. High Perform. Comput. Appl..

[5]  Corporate The MPI Forum,et al.  MPI: a message passing interface , 1993, Supercomputing '93.

[6]  J. Robert Jump,et al.  The rice parallel processing testbed , 1988, SIGMETRICS '88.

[7]  Thomas Phan,et al.  Performance prediction of large parallel applications using parallel simulations , 1999, PPoPP '99.

[8]  James R. Larus,et al.  Wisconsin Wind Tunnel II: a fast, portable parallel architecture simulator , 2000, IEEE Concurr..

[9]  Ewa Deelman,et al.  Asynchronous Parallel Simulation of Parallel Programs , 2000, IEEE Trans. Software Eng..

[10]  James C. Browne,et al.  Compositional Development of Performance Models in Poems , 2000, Int. J. High Perform. Comput. Appl..

[11]  Rajive L. Bagrodia,et al.  Parallel Simulation of Data Parallel Programs , 1995, LCPC.

[12]  David Kotz,et al.  The galley parallel file system , 1997, ICS '96.

[13]  Dror G. Feitelson,et al.  The Vesta parallel file system , 1996, TOCS.

[14]  William E. Weihl,et al.  Reducing synchronization overhead in parallel simulation , 1996, Workshop on Parallel and Distributed Simulation.

[15]  Rizos Sakellariou,et al.  Application representations for a multi-paradigm performance modeling environment for parallel syste , 2000 .

[16]  John L. Hennessy,et al.  Multiprocessor Simulation and Tracing Using Tango , 1991, ICPP.

[17]  James R. Larus,et al.  The Wisconsin Wind Tunnel: virtual prototyping of parallel computers , 1993, SIGMETRICS '93.

[18]  Rajive L. Bagrodia,et al.  Parallel Simulation of Parallel File Systems and I/O Programs , 1997, SC.

[19]  Mary K. Vernon,et al.  Poems: end-to-end performance design of large parallel adaptive computational systems , 1998, WOSP '98.

[20]  R. Bagrodia,et al.  Parallel Simulation of Parallel File Systems and I/O Programs , 1997, ACM/IEEE SC 1997 Conference (SC'97).

[21]  Rajive L. Bagrodia,et al.  MPI-SIM: using parallel simulation to evaluate MPI programs , 1998, 1998 Winter Simulation Conference. Proceedings (Cat. No.98CH36274).

[22]  K. Mani Chandy,et al.  Distributed Simulation: A Case Study in Design and Verification of Distributed Programs , 1979, IEEE Transactions on Software Engineering.

[23]  David M. Nicol,et al.  Parallelized Direct Execution Simulation of Message-Passing Parallel Programs , 1996, IEEE Trans. Parallel Distributed Syst..

[24]  Forum Mpi MPI: A Message-Passing Interface , 1994 .

[25]  Rajive L. Bagrodia,et al.  Transparent implementation of conservative algorithms in parallel simulation languages , 1993, WSC '93.

[26]  K. Mani Chandy,et al.  UC: A Set-Based Language for Data-Parallel Programming , 1995, J. Parallel Distributed Comput..