ELPA: A Parallel Solver for the Generalized Eigenvalue Problem

For symmetric (hermitian) (dense or banded) matrices the computation of eigenvalues and eigenvectors Ax = λBx is an important task, e.g. in electronic structure calculations. If a larger number of eigenvectors are needed, often direct solvers are applied. On parallel architectures the ELPA implementation has proven to be very efficient, also compared to other parallel solvers like EigenExa or MAGMA. The main improvement that allows better parallel efficiency in ELPA is the two-step transformation of dense to band to tridiagonal form. This was the achievement of the ELPA project. The continuation of this project has been targeting at additional improvements like allowing monitoring and autotuning of the ELPA code, optimizing the code for different architectures, developing curtailed algorithms for banded A and B, and applying the improved code to solve typical examples in electronic structure calculations. In this paper we will present the outcome of this project.

[1]  L. Kaufman,et al.  Squeezing the most out of eigenvalue solvers on high-performance computers , 1986 .

[2]  A Marek,et al.  The ELPA library: scalable parallel eigenvalue solutions for electronic structure theory and computational science , 2014, Journal of physics. Condensed matter : an Institute of Physics journal.

[3]  Pavel Kůs,et al.  GPU Optimization of Large-Scale Eigenvalue Solver , 2017, Lecture Notes in Computational Science and Engineering.

[4]  W. Marsden I and J , 2012 .

[5]  Mathias Jacquelin,et al.  ELSI: A unified software interface for Kohn-Sham electronic structure solvers , 2017, Comput. Phys. Commun..

[6]  Thorsten Kurth,et al.  Eigensolver performance comparison on Cray XC systems , 2018, Concurr. Comput. Pract. Exp..

[7]  W. Kohn,et al.  Self-Consistent Equations Including Exchange and Correlation Effects , 1965 .

[8]  Bruno Lang,et al.  Cannon-type triangular matrix multiplication for the reduction of generalized HPD eigenproblems to standard form , 2020, Parallel Comput..

[9]  Lynn Elliot Cannon,et al.  A cellular computer to implement the kalman filter algorithm , 1969 .

[10]  R. Fischer,et al.  Doping of metal-organic frameworks with functional guest molecules and nanoparticles. , 2010, Topics in current chemistry.

[11]  P. Hohenberg,et al.  Inhomogeneous electron gas , 1964 .

[12]  Andreas Marek,et al.  Optimizations of the Eigensolvers in the ELPA Library , 2018, Parallel Comput..

[13]  Jack Dongarra,et al.  Parallel matrix transpose algorithms on distributed memory concurrent computers , 1993, Proceedings of Scalable Parallel Libraries Conference.

[14]  Bruno Lang Efficient Reduction of Banded Hermitian Positive Definite Generalized Eigenvalue Problems to Banded Standard Eigenvalue Problems , 2019, SIAM J. Sci. Comput..

[15]  C. R. Crawford,et al.  Reduction of a band-symmetric generalized eigenvalue problem , 1973, Commun. ACM.

[16]  Lukas Krämer,et al.  Parallel solution of partial symmetric eigenvalue problems from electronic structure calculations , 2011, Parallel Comput..

[17]  Hyuk-Jae Lee,et al.  Generalized Cannon's algorithm for parallel matrix multiplication , 1997, ICS '97.

[18]  Robert A. van de Geijn,et al.  Elemental: A New Framework for Distributed Memory Dense Matrix Computations , 2013, TOMS.

[19]  Thomas Huckle,et al.  Parallel eigenvalue computation for banded generalized eigenvalue problems , 2019, Parallel Comput..

[20]  Matthias Scheffler,et al.  Ab initio molecular simulations with numeric atom-centered orbitals , 2009, Comput. Phys. Commun..