Efficiency Analysis of Intel and AMD x86_64 Architectures for Ab Initio Calculations: A Case Study of VASP

Nowadays, the wide spectrum of Intel Xeon processors is available. The new Zen CPU architecture developed by AMD has extended the number of options for x86_64 HPC hardware. This large number of options makes the optimal CPU choice for HPC systems not a straightforward procedure. Such a co-design procedure should follow the requests from the end-users community. Modern computational materials science studies are among the major consumers of HPC resources worldwide. The VASP code is perhaps the most popular tool for these research. In this work, we discuss the benchmark metric and results based on a VASP test model that give us the possibility to compare different CPUs and to select best options with respect to time-to-solution and energy-to-solution criteria.

[1]  Vladimir V. Stegailov,et al.  HPC Hardware Efficiency for Quantum and Classical Molecular Dynamics , 2015, PaCT.

[2]  Kari Laasonen Ab initio molecular dynamics. , 2013, Methods in molecular biology.

[3]  G. Kresse,et al.  Ab initio molecular dynamics for liquid metals. , 1993 .

[4]  Kresse,et al.  Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. , 1996, Physical review. B, Condensed matter.

[5]  Zhengji Zhao,et al.  Estimating the Performance Impact of the MCDRAM on KNL Using Dual-Socket Ivy Bridge Nodes on Cray XC 30 , 2016 .

[6]  Torsten Hoefler,et al.  Scientific Benchmarking of Parallel Computing Systems Twelve ways to tell the masses when reporting performance results , 2017 .

[7]  Leonel Sousa,et al.  Energy‐aware mechanism for stencil‐based MPDATA algorithm with constraints , 2017, Concurr. Comput. Pract. Exp..

[8]  John Shalf,et al.  Exascale Computing Trends: Adjusting to the "New Normal"' for Computer Architecture , 2013, Computing in Science & Engineering.

[9]  Hafner,et al.  Ab initio molecular-dynamics simulation of the liquid-metal-amorphous-semiconductor transition in germanium. , 1994, Physical review. B, Condensed matter.

[10]  Martin Fechner,et al.  Best bang for your buck: GPU nodes for GROMACS biomolecular simulations , 2015, J. Comput. Chem..

[11]  G. Kresse,et al.  Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set , 1996 .

[12]  Raffaele Tripiccione,et al.  Energy-Performance Tradeoffs for HPC Applications on Low Power Processors , 2015, Euro-Par Workshops.

[13]  Vladimir V. Stegailov,et al.  Efficiency of the Tegra K1 and X1 systems-on-chip for classical molecular dynamics , 2016, 2016 International Conference on High Performance Computing & Simulation (HPCS).

[14]  Vladimir V. Stegailov,et al.  GPU-Accelerated Molecular Dynamics: Energy Consumption and Performance , 2016 .

[15]  Fabiano Corsetti,et al.  Performance Analysis of Electronic Structure Codes on HPC Systems: A Case Study of SIESTA , 2014, PloS one.

[16]  Mariana Rossi,et al.  Ab initio molecular dynamics , 2013 .

[17]  James C. Browne,et al.  Enhancing Performance Optimization of Multicore/Multichip Nodes with Data Structure Metrics , 2014, ACM Trans. Parallel Comput..

[18]  Vladimir V. Stegailov,et al.  Floating-point performance of ARM cores and their efficiency in classical molecular dynamics , 2016 .

[19]  Lars Koesterke,et al.  PerfExpert: An Easy-to-Use Performance Diagnosis Tool for HPC Applications , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.

[20]  Thomas Scogland,et al.  Node variability in large-scale power measurements: perspectives from the Green500, Top500 and EEHPCWG , 2015, SC15: International Conference for High Performance Computing, Networking, Storage and Analysis.

[21]  Lucas Mello Schnorr,et al.  Characterizing the Performance of Modern Architectures Through Opaque Benchmarks: Pitfalls Learned the Hard Way , 2017, 2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).