An out-of-core implementation of the COLUMBUS massively-parallel multireference configuration interaction program

In this paper, we describe a novel parallelization approach we developed to solve the largest multireference configuration interaction (MRCI) problem ever attempted. From the mathematical perspective, the program solves the eigenvalue problem for a very large, sparse, symmetric Hamilton matrix. Using an out-of-core approach, shared memory programming model, improved data compression algorithms, and dynamic load balancing we were able to solve a problem six times larger than previously reported. The potential curve for the chromium dimer was calculated with a Hamilton matrix of dimension 1.3 billion (1,295,937,374). This task involved moving 1.5 terabytes of data between main memory and secondary storage per MRCI iteration. Furthermore, by employing Active Messages and user-level striping to combine multiple files on local disks on the IBM SP into a single logically-shared file, the execution time of the program was reduced by a factor of three, as compared to our initial implementation on top of the IBM PIOFS parallel filesystem.

[1]  J. Hinze,et al.  The Unitary group for the evaluation of electronic energy matrix elements , 1981 .

[2]  Robert J. Harrison,et al.  Performance and experience with LAPI-a new high-performance communication library for the IBM RS/6000 SP , 1998, Proceedings of the First Merged International Parallel Processing Symposium and Symposium on Parallel and Distributed Processing.

[3]  Ian T. Foster,et al.  Distant I/O: one-sided access to secondary storage on remote processors , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).

[4]  Harry Partridge,et al.  Cr 2 revisited , 1994 .

[5]  Holger Dachsel,et al.  An efficient data compression method for the Davidson subspace diagonalization scheme , 1995 .

[6]  Russell M. Pitzer,et al.  A progress report on the status of the COLUMBUS MRCI program system , 1988 .

[7]  Rodney J. Bartlett,et al.  Approximately extensive modifications of the multireference configuration interaction method: A theoretical and practical analysis , 1995 .

[8]  Emily A. Carter,et al.  Pseudospectral correlation methods on distributed memory parallel architectures , 1995 .

[9]  Hermann Stoll,et al.  The Cr2 potential curve: a multireference pair functional treatment , 1996 .

[10]  Ian T. Foster,et al.  ChemIo: High Performance Parallel I/o for Computational Chemistry Applications , 1998, Int. J. High Perform. Comput. Appl..

[11]  Roberto Ansaloni,et al.  A full CI algorithm on the CRAY T3D. Application to the NH3 molecule , 1995 .

[12]  Isaiah Shavitt,et al.  The Graphical Unitary Group Approach and Its Application to Direct Configuration Interaction Calculations , 1981 .

[13]  Hans Lischka,et al.  Implementation of an electronic structure program system on the CYBER 205 , 1985 .

[14]  E. Davidson The iterative calculation of a few of the lowest eigenvalues and corresponding eigenvectors of large real-symmetric matrices , 1975 .