Performance evaluation of parallel systems

In this paper performance evaluation methodologies that have been applied to the analysis of parallel systems are reviewed together with the specific performance metrics. We concentrate on a few selected performance studies of parallel system components, i.e., processor, memory, interconnection network, input/output, and operating system. We demonstrate the utility of the performance evaluation methodologies for identification of system bottlenecks, performance forecasting, and future system design.

[1]  Michael K. Molloy Performance Analysis Using Stochastic Petri Nets , 1982, IEEE Transactions on Computers.

[2]  Edward D. Lazowska,et al.  Speedup Versus Efficiency in Parallel Systems , 1989, IEEE Trans. Computers.

[3]  Mark S. Squillante,et al.  The impact of I/O on program behavior and parallel scheduling , 1998, SIGMETRICS '98/PERFORMANCE '98.

[4]  D.A. Reed,et al.  Scalable performance analysis: the Pablo performance analysis environment , 1993, Proceedings of Scalable Parallel Libraries Conference.

[5]  Anthony J. G. Hey,et al.  Selected Results from the ParkBench Benchmark , 1996, Euro-Par, Vol. II.

[6]  Satish K. Tripathi,et al.  The Processor Working Set and Its Use in Scheduling Multiprocessor Systems , 1991, IEEE Trans. Software Eng..

[7]  Jerome A. Rolia,et al.  The Method of Layers , 1995, IEEE Trans. Software Eng..

[8]  T.M. Madhyastha,et al.  Intelligent, adaptive file system policy selection , 1996, Proceedings of 6th Symposium on the Frontiers of Massively Parallel Computation (Frontiers '96).

[9]  Evgenia Smirni,et al.  The KSR1: experimentation and modeling of poststore , 1993, SIGMETRICS '93.

[10]  Evgenia Smirni,et al.  Lessons from Characterizing the Input/Output Behavior of Parallel Scientific Applications , 1998, Perform. Evaluation.

[11]  Erik Hagersten,et al.  DDM - A Cache-Only Memory Architecture , 1992, Computer.

[12]  Carl Staelin,et al.  The HP AutoRAID hierarchical storage system , 1995, SOSP.

[13]  Marcin Paprzycki Structured development of parallel programs , 1999, IEEE Concurr..

[14]  Kenneth C. Sevcik,et al.  Coordinated allocation of memory and processors in multiprocessors , 1996, SIGMETRICS '96.

[15]  G. Amdhal,et al.  Validity of the single processor approach to achieving large scale computing capabilities , 1967, AFIPS '67 (Spring).

[16]  Mary K. Vernon,et al.  Performance Analysis of Mesh Interconnection Networks with Deterministic Routing , 1994, IEEE Trans. Parallel Distributed Syst..

[17]  Giuseppe Serazzi,et al.  Workload Characterization For Performance Engineering Of Parallel Applications , 1994, Proceedings. Second Euromicro Workshop on Parallel and Distributed Processing.

[18]  Anoop Gupta,et al.  SPLASH: Stanford parallel applications for shared-memory , 1992, CARN.

[19]  Carole Dulong,et al.  The IA-64 Architecture at Work , 1998, Computer.

[20]  Marco Ajmone Marsan,et al.  Performance models of multiprocessor systems , 1987, MIT Press series in computer systems.

[21]  Evgenia Smirni,et al.  Modeling speedup of SPMD applications on the Intel Paragon: a case study , 1995, HPCN Europe.

[22]  Dror G. Feitelson,et al.  Mpi-io: a parallel file i/o interface for mpi , 1995 .

[23]  Giuseppe Serazzi,et al.  Processor Saving Scheduling Policies for Multiprocessor Systems , 1998, IEEE Trans. Computers.

[24]  W. Gropp,et al.  The Scalable I/O Initiative , 1995 .

[25]  Leslie Lamport,et al.  How to Make a Multiprocessor Computer That Correctly Executes Multiprocess Programs , 2016, IEEE Transactions on Computers.

[26]  Edward D. Lazowska,et al.  Quantitative system performance - computer system analysis using queueing network models , 1983, Int. CMG Conference.