On the achievement of high fidelity and scalability for large‐scale diagonalizations in grid‐based DFT simulations

Recent advance in high performance computing (HPC) resources has opened the possibility to expand the scope of density functional theory (DFT) simulations toward large and complex molecular systems. This work proposes a numerically robust method that enables scalable diagonalizations of large DFT Hamiltonian matrices, particularly with thousands of computing CPUs (cores) that are usual these days in terms of sizes of HPC resources. The well-known Lanczos method is extensively refactorized to overcome its weakness for evaluation of multiple degenerate eigenpairs that is the substance of DFT simulations, where a multilevel parallelization is adopted for scalable simulations in as many cores as possible. With solid benchmark tests for realistic molecular systems, the fidelity of our method are validated against the locally optimal block preconditioned conjugated gradient (LOBPCG) method that is widely used to simulate electronic structures. Our method may waste computing resources for simulations of molecules whose degeneracy cannot be reasonably estimated. But, compared to LOBPCG method, it is fairly excellent in perspectives of both speed and scalability, and particularly has remarkably less (< 10%) sensitivity of performance to the random nature of initial basis vectors. As a promising candidate for solving electronic structures of highly degenerate systems, the proposed method can make a meaningful contribution to migrating DFT simulations toward extremely large computing environments that normally have more than several tens of thousands of computing cores.

[1]  W. Kim,et al.  Update to ACE‐molecule: Projector augmented wave method on lagrange‐sinc basis set , 2016 .

[2]  N. A. Romero,et al.  Electronic structure calculations with GPAW: a real-space implementation of the projector augmented-wave method , 2010, Journal of physics. Condensed matter : an Institute of Physics journal.

[3]  Woo Youn Kim,et al.  Feature of Exact Exchange Kohn–Sham Orbitals with Krieger–Li–Iafrate Approximation# , 2015 .

[4]  Sang-Yeon Hwang,et al.  Configuration interaction singles based on the real-space numerical grid method: Kohn-Sham versus Hartree-Fock orbitals. , 2015, Physical chemistry chemical physics : PCCP.

[5]  Hong Zhang,et al.  Shift‐and‐invert parallel spectral transformation eigensolver: Massively parallel performance for density‐functional based tight‐binding , 2016, J. Comput. Chem..

[6]  A. Nakano,et al.  Divide-and-conquer density functional theory on hierarchical real-space grids: Parallel implementation and applications , 2008 .

[7]  Woo Youn Kim,et al.  Accuracy of Lagrange-sinc functions as a basis set for electronic structure calculations of atoms and molecules. , 2015, The Journal of chemical physics.

[8]  B. K. Bhagavan,et al.  Generalized eigenvalue problems: Lanczos algorithm with a recursive partitioning method , 2000 .

[9]  Seongok Ryu,et al.  Supersampling method for efficient grid-based electronic structure calculations. , 2016, The Journal of chemical physics.

[10]  Gerhard Klimeck,et al.  A tight-binding study of single-atom transistors. , 2015, Small.

[11]  Hoon Ryu,et al.  Atomistic study on dopant-distributions in realistically sized, highly P-doped Si nanowires. , 2015, Nano letters.

[12]  Chao Yang,et al.  Parallel eigenvalue calculation based on multiple shift-invert Lanczos and contour integral based spectral projection method , 2014, Parallel Comput..

[13]  S. Goedecker Linear scaling electronic structure methods , 1999 .

[14]  Tejs Vegge,et al.  Implementation of Constrained DFT for Computing Charge Transfer Rates within the Projector Augmented Wave Method. , 2016, Journal of chemical theory and computation.

[15]  Masato Kobayashi,et al.  Two‐level hierarchical parallelization of second‐order Møller–plesset perturbation calculations in divide‐and‐conquer method , 2011, J. Comput. Chem..

[16]  Thierry Deutsch,et al.  Challenges in large scale quantum mechanical calculations , 2016, 1609.00252.

[17]  C. Lanczos An iteration method for the solution of the eigenvalue problem of linear differential and integral operators , 1950 .

[18]  Y. Saad On the Rates of Convergence of the Lanczos and the Block-Lanczos Methods , 1980 .

[19]  B. Parlett,et al.  The Lanczos algorithm with selective orthogonalization , 1979 .

[20]  Ji-Hoon Kang,et al.  Time-efficient simulations of tight-binding electronic structures with Intel Xeon PhiTM many-core processors , 2016, Comput. Phys. Commun..

[21]  Franccois Bottin,et al.  Large scale ab initio calculations based on three levels of parallelization , 2007, 0707.3405.

[22]  C. Rajakumar,et al.  The Lanczos algorithm applied to unsymmetric generalized eigenvalue problem , 1991 .

[23]  M. E. Casida,et al.  Wavelet-Based Linear-Response Time-Dependent Density-Functional Theory , 2011, 1108.3475.

[24]  Andrew V. Knyazev,et al.  Toward the Optimal Preconditioned Eigensolver: Locally Optimal Block Preconditioned Conjugate Gradient Method , 2001, SIAM J. Sci. Comput..

[25]  Phanish Suryanarayana,et al.  SPARC: Accurate and efficient finite-difference formulation and parallel implementation of Density Functional Theory: Isolated clusters , 2016, Comput. Phys. Commun..

[26]  Daniel Jean Baye,et al.  Lagrange‐mesh method for quantum‐mechanical problems , 2006 .

[27]  Johnson,et al.  Modified Broyden's method for accelerating convergence in self-consistent calculations. , 1988, Physical review. B, Condensed matter.

[28]  Xavier Andrade,et al.  Real-space grids and the Octopus code as tools for the development of new simulation approaches for electronic systems. , 2015, Physical chemistry chemical physics : PCCP.

[29]  J. Burdett,et al.  Orbital Interactions in Chemistry: Albright/Orbital Interactions in Chemistry , 2013 .

[30]  Paweł Sałek,et al.  Linear-scaling implementation of molecular electronic self-consistent field theory. , 2007, The Journal of chemical physics.

[31]  Jaechang Lim,et al.  Improvement of initial guess via grid-cutting for efficient grid-based density functional calculations , 2016 .

[32]  Jianwei Sun,et al.  Accurate first-principles structures and energies of diversely bonded systems from an efficient density functional. , 2016, Nature chemistry.

[33]  P. Pulay Convergence acceleration of iterative sequences. the case of scf iteration , 1980 .

[34]  Jaechang Lim,et al.  Outstanding performance of configuration interaction singles and doubles using exact exchange Kohn-Sham orbitals in real-space numerical grid method. , 2016, The Journal of chemical physics.

[35]  Richard B. Lehoucq,et al.  Anasazi software for the numerical solution of large-scale eigenvalue problems , 2009, TOMS.

[36]  U. Hetmaniuk,et al.  A comparison of eigensolvers for large‐scale 3D modal analysis using AMG‐preconditioned iterative methods , 2005 .

[37]  C. Y. Fong,et al.  Real-space local polynomial basis for solid-state electronic-structure calculations: A finite-element approach , 1999, cond-mat/9903313.

[38]  A. Pecchia,et al.  A comprehensive study of popular eigenvalue methods employed for quantum calculation of energy eigenstates in nanostructures using GPUs , 2015 .

[39]  Luigi Genovese,et al.  A wavelet-based Projector Augmented-Wave (PAW) method: Reaching frozen-core all-electron precision with a systematic, adaptive and localized wavelet basis set , 2016, Comput. Phys. Commun..

[40]  G. Klimeck,et al.  Eigenvalue solvers for atomistic simulations of electronic structures with NEMO-3D , 2008 .

[41]  Burke,et al.  Generalized Gradient Approximation Made Simple. , 1996, Physical review letters.