Prediction of Time-to-Solution in Material Science Simulations Using Deep Learning

Predicting the time to solution for massively parallel scientific codes is a complex task. The reason for this is the presence of multiple, strongly interconnected algorithms that possibly react differently to the changes in compute power, vectorization length, memory and network bandwidth and latency and I/O throughput. A reliable prediction of execution time is however of great importance to the user who wants to plan on large scale simulations or virtual screening procedures characteristic of high throughput computing. In this article we present a practical approach based on machine learning techniques to achieve very accurate predictions of the time to solution for a DFT-based material science code. We compare our results with the predictions provided by a parametrized analytical performance model showing that deep learning solutions allow for a greater accuracy without the need of domain knowledge to introduce an explicit description of the algorithms implemented in the code.

[1]  P. Schwaller,et al.  Two-dimensional materials from high-throughput computational exfoliation of experimentally known compounds , 2016, Nature Nanotechnology.

[2]  W. Kohn,et al.  Self-Consistent Equations Including Exchange and Correlation Effects , 1965 .

[3]  Stefano de Gironcoli,et al.  Advanced capabilities for materials modelling with Quantum ESPRESSO , 2017, Journal of physics. Condensed matter : an Institute of Physics journal.

[4]  Stefano de Gironcoli,et al.  QUANTUM ESPRESSO: a modular and open-source software project for quantum simulations of materials , 2009, Journal of physics. Condensed matter : an Institute of Physics journal.

[5]  E. Davidson The iterative calculation of a few of the lowest eigenvalues and corresponding eigenvectors of large real-symmetric matrices , 1975 .

[6]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[7]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[8]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[9]  Eric Jones,et al.  SciPy: Open Source Scientific Tools for Python , 2001 .

[10]  Seyong Lee,et al.  COMPASS: A Framework for Automated Performance Modeling and Prediction , 2015, ICS.

[11]  Luca Benini,et al.  COUNTDOWN: a run-time library for application-agnostic energy saving in MPI communication primitives , 2018, ANDARE '18.

[12]  Luca Benini,et al.  COUNTDOWN - three, two, one, low power! A Run-time Library for Energy Saving in MPI Communication Primitives , 2018, ArXiv.

[13]  D. Vanderbilt,et al.  Soft self-consistent pseudopotentials in a generalized eigenvalue formalism. , 1990, Physical review. B, Condensed matter.

[14]  Richard M. Martin Electronic Structure: Frontmatter , 2004 .

[15]  Maryam Amiri,et al.  Survey on prediction models of applications for resources provisioning in cloud , 2017, J. Netw. Comput. Appl..

[16]  G. Kresse,et al.  From ultrasoft pseudopotentials to the projector augmented-wave method , 1999 .

[17]  T. Arias,et al.  Iterative minimization techniques for ab initio total energy calculations: molecular dynamics and co , 1992 .

[18]  Paolo Missier,et al.  Predicting the Execution Time of Workflow Activities Based on Their Input Features , 2012, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis.

[19]  Daniyal M. Alghazzawi,et al.  Modeling and predicting execution time of scientific workflows in the Grid using radial basis function neural network , 2017, Cluster Computing.

[20]  Stephen A. Jarvis,et al.  WARPP: a toolkit for simulating high-performance parallel scientific codes , 2009, SimuTools.

[21]  K. Schwarz,et al.  WIEN2k: An Augmented Plane Wave Plus Local Orbitals Program for Calculating Crystal Properties , 2019 .

[22]  R. Martin,et al.  Electronic Structure: Basic Theory and Practical Methods , 2004 .

[23]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[24]  Allan,et al.  Solution of Schrödinger's equation for large systems. , 1989, Physical review. B, Condensed matter.

[25]  Kresse,et al.  Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set. , 1996, Physical review. B, Condensed matter.