Performance Modeling and Prediction of Parallel and Distributed Computing Systems: A Survey of the State of the Art

Performance is one of the key features of parallel and distributed computing systems. Therefore, in the past a significant research effort was invested in the development of approaches for performance modeling and prediction of parallel and distributed computing systems. In this paper we identify the trends, contributions, and drawbacks of the state of the art approaches. We describe a wide range of the performance modeling approaches that spans from the high-level mathematical modeling to the detailed instruction-level simulation. For each approach we describe how the program and machine are modeled and estimate the model development and evaluation effort, the efficiency, and the accuracy. Furthermore, we present an overall evaluation of the presented approaches

[1]  R.M. Fujimoto,et al.  Parallel and distributed simulation systems , 2001, Proceeding of the 2001 Winter Simulation Conference (Cat. No.01CH37304).

[2]  Laxmikant V. Kalé,et al.  Performance prediction using simulation of large-scale interconnection networks in POSE , 2005, Workshop on Principles of Advanced and Distributed Simulation (PADS'05).

[3]  Marc Snir,et al.  GETTING UP TO SPEED THE FUTURE OF SUPERCOMPUTING , 2004 .

[4]  Mary K. Vernon,et al.  Poems: end-to-end performance design of large parallel adaptive computational systems , 1998, WOSP '98.

[5]  R. Bagrodia,et al.  Parallel Simulation of Parallel File Systems and I/O Programs , 1997, ACM/IEEE SC 1997 Conference (SC'97).

[6]  John Donners Simulating planet Earth , 2007 .

[7]  Christopher J. Hughes,et al.  RSIM: Simulating Shared-Memory Multiprocessors with ILP Processors , 2002, Computer.

[8]  Adolfy Hoisie,et al.  Use of Predictive Performance Modeling during Large-scale System Installation , 2005, Parallel Process. Lett..

[9]  Thomas Fahringer,et al.  Performance Prophet: a performance modeling and prediction tool for parallel and distributed programs , 2005, 2005 International Conference on Parallel Processing Workshops (ICPPW'05).

[10]  Jack J. Dongarra An Overview of High Performance Computing and Challenges for the Future , 2008, VECPAR.

[11]  Todd M. Austin,et al.  The SimpleScalar tool set, version 2.0 , 1997, CARN.

[12]  Thomas Fahringer,et al.  UML based modeling of performance oriented parallel and distributed applications , 2002, Proceedings of the Winter Simulation Conference.

[13]  Vikram S. Adve,et al.  Compiler-supported simulation of highly scalable parallel applications , 1999, ACM/IEEE SC 1999 Conference (SC'99).

[14]  Toru Sugiyama,et al.  The Earth Simulator Center , 2009 .

[15]  Fabrizio Petrini,et al.  Predictive Performance and Scalability Modeling of a Large-Scale Application , 2001, ACM/IEEE SC 2001 Conference (SC'01).

[16]  Laxmikant V. Kalé,et al.  BigSim: a parallel simulator for performance prediction of extremely large parallel machines , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..