Performance Prediction of HPC Applications on Intel Processors

It is commonly the case that a small number of widely used applications make up a large fraction of the workload of HPC centers. Predicting the performance of important applications running on specific processors enables HPC centers to design best performing system configurations and to insure good performance for the most popular applications on new systems. In the analyses presented in this paper we use applications that are widely used on current open science HPC systems. We characterize the performance of these applications across a spectrum of modern processors and then create a mathematical model to predict their behavior on possible future processors. The hardware sensitivity studies required to build the predictive model are carried out in an HPC cloud resource with bare metal access, and we describe the process and advantages of this approach in detail. We define and discuss the mathematical model that we have designed and compare the predicted performance of these codes with the empirical results obtained in different chips. Finally, we also use the model to estimate the efficiency of future chips. The results indicate that the model is able to estimate the performance of these codes with a relatively small error across a fairly wide spectrum of chips.

[1]  Wu Jigang,et al.  Practical techniques for performance estimation of processors , 2005, Fifth International Workshop on System-on-Chip for Real-Time Applications (IWSOC'05).

[2]  Ruay-Shiung Chang,et al.  A performance estimation model for high-performance computing on clouds , 2012, 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings.

[3]  Mark R. Fahey,et al.  User Environment Tracking and Problem Detection with XALT , 2014, 2014 First International Workshop on HPC User Support Tools.

[4]  Yanli Wang,et al.  Quantum ESPRESSO: a modular and open-source software project for quantum simulations of materials , 2009 .

[5]  Yuichi Inadomi,et al.  Performance prediction of large-scale parallell system and application using macro-level simulation , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[6]  Yen-Chen Liu,et al.  Knights Landing: Second-Generation Intel Xeon Phi Product , 2016, IEEE Micro.

[7]  D. van der Spoel,et al.  GROMACS: A message-passing parallel molecular dynamics implementation , 1995 .

[8]  Rod A. Fatoohi Performance evaluation of NSF application benchmarks on parallel systems , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[9]  Darren J. Kerbyson A look at application performance sensitivity to the bandwidth and latency of InfiniBand networks , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[10]  Martin Schulz,et al.  Modeling the Impact of Reduced Memory Bandwidth on HPC Applications , 2014, Euro-Par.

[11]  Xiaofeng Gao,et al.  Performance Sensitivity Studies for Strategic Applications , 2005, 2005 Users Group Conference (DOD-UGC'05).

[12]  Laxmikant V. Kalé,et al.  Understanding Application Performance via Micro-benchmarks on Three Large Supercomputers: Intrepid, Ranger and Jaguar , 2010, Int. J. High Perform. Comput. Appl..

[13]  Seyong Lee,et al.  COMPASS: A Framework for Automated Performance Modeling and Prediction , 2015, ICS.

[14]  G. Powers,et al.  A Description of the Advanced Research WRF Version 3 , 2008 .

[15]  Anurag Kumar,et al.  Performance Analysis and Scheduling of Stochastic Fork-Join Jobs in a Multicomputer System , 1993, IEEE Trans. Parallel Distributed Syst..

[16]  Kenneth Levenberg A METHOD FOR THE SOLUTION OF CERTAIN NON – LINEAR PROBLEMS IN LEAST SQUARES , 1944 .

[17]  Wu Jigang,et al.  Performance Estimation: IPC , 2008, 2008 The 9th International Conference for Young Computer Scientists.

[18]  Laxmikant V. Kalé,et al.  Scalable molecular dynamics with NAMD , 2005, J. Comput. Chem..

[19]  Jesús Labarta,et al.  A Framework for Performance Modeling and Prediction , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[20]  Dong Li,et al.  Quantifying Architectural Requirements of Contemporary Extreme-Scale Scientific Applications , 2013, PMBS@SC.