Divide-Conquer-Recombine: An Algorithmic Pathway toward Metascalability

We propose an extension of the divide-and-conquer (DC) algorithmic paradigm called divide-conquer-recombine (DCR) in order to develop O(N) applications that will continue to scale on future parallel supercomputing, i.e., making them metascalable (or "design once, scale on new architectures"). In DCR, the DC phase constructs globally informed local solutions, which in the recombine phase are synthesized into a global solution. Innovative recombination algorithms allow the synthesis of a variety of global properties in broad applications. To enable large spatial-scale molecular dynamics (MD) simulations, DCR-in-space is empowered by globally scalable and locally fast (GSLF) hybrid solvers based on spatial locality. In addition, DCR-in-time is used to predict long-time dynamics based on temporal locality, while utilizing space-time-ensemble parallelism (STEP). We have used DCR to perform quantum molecular dynamics (QMD) and reactive molecular dynamics (RMD) simulations that encompass unprecedented spatiotemporal scales. Our 50.3 million-atom QMD benchmark achieved a parallel efficiency of 0.984 and 50.5% of the peak floating-point performance on 786,432 IBM Blue Gene/Q cores. Production QMD simulation involving 16,661 atoms for 21,140 time steps (or 129,208 self-consistent-field iterations) revealed a novel nanostructural design for on-demand hydrogen production from water, advancing renewable energy technologies. Nonadiabatic QMD simulation of photoexcitation dynamics involving 6,400 atoms reached the experimental time scales, elucidating molecular mechanisms of a novel singlet-fission phenomenon to realize low-cost, high-efficiency solar cells. Our billion-atom RMD simulation revealed the role of focused nanojet for the damage of solid surface caused by shock-induced collapse of nanobubbles in water, and suggested how to mitigate the damage by filling the bubble with inert gas.

[1]  Vijay S. Pande,et al.  Everything you wanted to know about Markov State Models but were afraid to ask. , 2010, Methods.

[2]  A. Nakano,et al.  Multiresolution molecular dynamics algorithm for realistic materials modeling on parallel computers , 1994 .

[3]  R. Gutierrez,et al.  Charge migration in organic materials: Can propagating charges affect the key physical quantities controlling their motion? , 2012, 1201.4652.

[4]  G. Henkelman,et al.  Improved tangent estimate in the nudged elastic band method for finding minimum energy paths and saddle points , 2000 .

[5]  Aiichiro Nakano,et al.  Nanoscopic mechanisms of singlet fission in amorphous molecular solid , 2013 .

[6]  Nir Goldman,et al.  Catalytic behaviour of dense hot water. , 2009, Nature chemistry.

[7]  Rajiv K. Kalia,et al.  A scalable parallel algorithm for dynamic range-limited n-tuple computation in many-body molecular dynamics simulation , 2013, 2013 SC - International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[8]  C. Wilmer,et al.  Large-scale screening of hypothetical metal-organic frameworks. , 2012, Nature chemistry.

[9]  John Shalf,et al.  The International Exascale Software Project roadmap , 2011, Int. J. High Perform. Comput. Appl..

[10]  Aiichiro Nakano,et al.  Parallel multilevel preconditioned conjugate-gradient approach to variable-charge molecular dynamics , 1997 .

[11]  Priya Vashishta,et al.  A divide-conquer-recombine algorithmic paradigm for large spatiotemporal quantum molecular dynamics simulations. , 2014, The Journal of chemical physics.

[12]  Priya Vashishta,et al.  Molecular dynamics simulations of rapid hydrogen production from water using aluminum clusters as catalyzers. , 2010, Physical review letters.

[13]  S. L. Dixon,et al.  Fast, accurate semiempirical molecular orbital calculations for macromolecules , 1997 .

[14]  Rajiv K. Kalia,et al.  A scalable parallel algorithm for large-scale reactive force-field molecular dynamics simulations , 2008, Comput. Phys. Commun..

[15]  Martin Head-Gordon,et al.  Derivation and efficient implementation of the fast multipole method , 1994 .

[16]  Koji Ando,et al.  Fragment molecular orbital study on electron tunneling mechanisms in bacterial photosynthetic reaction center. , 2012, The journal of physical chemistry. B.

[17]  G. Lu,et al.  First-principles simulations of exciton diffusion in organic semiconductors , 2011 .

[18]  Takao Otsuka,et al.  Trimer effects in fragment molecular orbital-linear combination of molecular orbitals calculation of one-electron orbitals for biomolecules. , 2013, The Journal of chemical physics.

[19]  Arthur F Voter,et al.  Parallel replica dynamics with a heterogeneous distribution of barriers: application to n-hexadecane pyrolysis. , 2004, The Journal of chemical physics.

[20]  Rajiv K. Kalia,et al.  Scalable and portable implementation of the fast multipole method on parallel computers , 2003 .

[21]  Priya Vashishta,et al.  Hydrogen-on-demand using metallic alloy nanoparticles in water. , 2014, Nano letters.

[22]  A. Nakano,et al.  Molecular control of photoexcited charge transfer and recombination at a quaterthiophene/zinc oxide interface , 2012 .

[23]  Adri C. T. van Duin,et al.  Connectivity-Based Parallel Replica Dynamics for Chemically Reactive Systems: From Femtoseconds to Microseconds , 2013 .

[24]  Aiichiro Nakano,et al.  Pathfinder: A parallel search algorithm for concerted atomistic events , 2007, Comput. Phys. Commun..

[25]  Paul Messina,et al.  Metascalable Quantum Molecular Dynamics Simulations of Hydrogen-on-Demand , 2014, SC14: International Conference for High Performance Computing, Networking, Storage and Analysis.

[26]  Shigenori Tanaka,et al.  Statistical correction to effective interactions in the fragment molecular orbital method , 2013 .

[27]  V. Pande,et al.  Calculation of the distribution of eigenvalues and eigenvectors in Markovian state models for molecular dynamics. , 2007, The Journal of chemical physics.

[28]  Aiichiro Nakano,et al.  A space-time-ensemble parallel nudged elastic band algorithm for molecular kinetics simulation , 2008, Comput. Phys. Commun..

[29]  A. Voter Parallel replica method for dynamics of infrequent events , 1998 .

[30]  Wilfried N. Gansterer,et al.  An extension of the divide-and-conquer method for a class of symmetric block-tridiagonal eigenproblems , 2002, TOMS.

[31]  Priya Vashishta,et al.  Nanobubble collapse on a silica surface in water: billion-atom reactive molecular dynamics simulations. , 2013, Physical review letters.

[32]  Laxmikant V. Kalé,et al.  NAMD: Biomolecular Simulation on Thousands of Processors , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[33]  Weiqiang Wang,et al.  A metascalable computing framework for large spatiotemporal-scale atomistic simulations , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[34]  John L. Klepeis,et al.  Anton, a special-purpose machine for molecular dynamics simulation , 2007, ISCA '07.

[35]  Kiyoyuki Terakura,et al.  Molecular orbital calculation of biomolecules with fragment molecular orbitals , 2009 .

[36]  Masato Kobayashi,et al.  Extension of linear-scaling divide-and-conquer-based correlation method to coupled cluster theory with singles and doubles excitations. , 2008, The Journal of chemical physics.

[37]  P. Wipf,et al.  Stochastic voyages into uncharted chemical space produce a representative library of all possible drug-like compounds. , 2013, Journal of the American Chemical Society.

[38]  A. Voter,et al.  Extending the Time Scale in Atomistic Simulation of Materials Annual Re-views in Materials Research , 2002 .

[39]  Taisuke Ozaki O(N) Krylov-subspace method for large-scale ab initio electronic structure calculations , 2006 .

[40]  Subhash Saini,et al.  Scalable Atomistic Simulation Algorithms for Materials Research , 2001, ACM/IEEE SC 2001 Conference (SC'01).

[41]  A. Nakano,et al.  Divide-and-conquer density functional theory on hierarchical real-space grids: Parallel implementation and applications , 2008 .

[42]  Rajiv K. Kalia,et al.  Embedded divide-and-conquer algorithm on hierarchical real-space grids: parallel molecular dynamics simulation based on linear-scaling density functional theory , 2005, Comput. Phys. Commun..

[43]  Kaori Fukuzawa,et al.  Modeling of peptide–silica interaction based on four-body corrected fragment molecular orbital (FMO4) calculations , 2013 .

[44]  W. Kohn,et al.  Nearsightedness of electronic matter. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[45]  J. Cuppen A divide and conquer method for the symmetric tridiagonal eigenproblem , 1980 .

[46]  Ryo Kobayashi,et al.  Linear scaling algorithm of real-space density functional theory of electrons with correlated overlapping domains , 2012, Comput. Phys. Commun..

[47]  Zhongwei Zhao,et al.  Hydrogen generation by splitting water with Al–Li alloys , 2013 .

[48]  Leslie Greengard,et al.  A fast algorithm for particle simulations , 1987 .

[49]  Yang,et al.  Direct calculation of electron density in density-functional theory. , 1991, Physical review letters.