Early Experiences Porting the NAMD and VMD Molecular Simulation and Analysis Software to GPU-Accelerated OpenPOWER Platforms

All-atom molecular dynamics simulations of biomolecules provide a powerful tool for exploring the structure and dynamics of large protein complexes within realistic cellular environments. Unfortunately, such simulations are extremely demanding in terms of their computational requirements, and they present many challenges in terms of preparation, simulation methodology, and analysis and visualization of results. We describe our early experiences porting the popular molecular dynamics simulation program NAMD and the simulation preparation, analysis, and visualization tool VMD to GPU-accelerated OpenPOWER hardware platforms. We report our experiences with compiler-provided autovectorization and compare with hand-coded vector intrinsics for the POWER8 CPU. We explore the performance benefits obtained from unique POWER8 architectural features such as 8-way SMT and its value for particular molecular modeling tasks. Finally, we evaluate the performance of several GPU-accelerated molecular modeling kernels and relate them to other hardware platforms.

[1]  Klaus Schulten,et al.  GPU-accelerated analysis and visualization of large structures solved by molecular dynamics flexible fitting. , 2014, Faraday discussions.

[2]  Klaus Schulten,et al.  QwikMD — Integrative Molecular Dynamics Toolkit for Novices and Experts , 2016, Scientific Reports.

[3]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[4]  Klaus Schulten,et al.  Mature HIV-1 capsid structure by cryo-electron microscopy and all-atom molecular dynamics , 2013, Nature.

[5]  Klaus Schulten,et al.  GPU-accelerated molecular visualization on petascale supercomputing platforms , 2013, UltraVis@SC.

[6]  Klaus Schulten,et al.  High performance computation and interactive display of molecular orbitals on GPUs and multi-core CPUs , 2009, GPGPU-2.

[7]  Klaus Schulten,et al.  Adapting a message-driven parallel application to GPU-accelerated clusters , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[8]  John E. Stone,et al.  An efficient library for parallel ray tracing and animation , 1998 .

[9]  Klaus Schulten,et al.  Atomic detail visualization of photosynthetic membranes with GPU-accelerated ray tracing , 2016, Parallel Comput..

[10]  Klaus Schulten,et al.  Immersive Molecular Visualization with Omnidirectional Stereoscopic Ray Tracing and Remote Rendering , 2016, 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).

[11]  Celso L. Mendes,et al.  Deploying a Large Petascale System: The Blue Waters Experience , 2014, ICCS.

[12]  T. Darden,et al.  Particle mesh Ewald: An N⋅log(N) method for Ewald sums in large systems , 1993 .

[13]  Benjamin A. Himes,et al.  Cyclophilin A stabilizes the HIV-1 capsid through a novel non-canonical binding site , 2016, Nature Communications.

[14]  Ray W. Grout,et al.  Accelerated application development: The ORNL Titan experience , 2015, Comput. Electr. Eng..

[15]  LarssonPer,et al.  GROMACS 4.5 , 2013 .

[16]  Duncan Poole,et al.  Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 1. Generalized Born , 2012, Journal of chemical theory and computation.

[17]  T. Darden,et al.  A smooth particle mesh Ewald method , 1995 .

[18]  John E. Stone,et al.  GPU-accelerated computation and interactive display of molecular orbitals , 2010 .

[19]  Laxmikant V. Kalé,et al.  Scalable molecular dynamics with NAMD , 2005, J. Comput. Chem..

[20]  Andreas Dietrich,et al.  OptiX: a general purpose ray tracing engine , 2010, SIGGRAPH 2010.

[21]  John E. Stone,et al.  TopoGromacs: Automated Topology Conversion from CHARMM to GROMACS within VMD , 2016, J. Chem. Inf. Model..

[22]  Ingo Wald,et al.  Embree: a kernel framework for efficient CPU ray tracing , 2014, ACM Trans. Graph..

[23]  Klaus Schulten,et al.  Petascale Tcl with NAMD, VMD, and Swift/T , 2014, 2014 First Workshop for High Performance Technical Computing in Dynamic Languages.

[24]  Klaus Schulten,et al.  Early experiences scaling VMD molecular visualization and analysis jobs on blue waters , 2013, 2013 Extreme Scaling Workshop (xsw 2013).

[25]  Klaus Schulten,et al.  Atomic Model of Rabbit Hemorrhagic Disease Virus by Cryo-Electron Microscopy and Crystallography , 2013, PLoS pathogens.

[26]  Klaus Schulten,et al.  Macrolide antibiotics allosterically predispose the ribosome for translation arrest , 2014, Proceedings of the National Academy of Sciences.

[27]  Klaus Schulten,et al.  High Performance Molecular Visualization: In-Situ and Parallel Rendering with EGL , 2016, 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).

[28]  M. Pharr,et al.  ispc: A SPMD compiler for high-performance CPU programming , 2012, 2012 Innovative Parallel Computing (InPar).

[29]  Kevin Skadron,et al.  Scalable parallel programming , 2008, 2008 IEEE Hot Chips 20 Symposium (HCS).

[30]  John E. Stone,et al.  Evaluation of Emerging Energy-Efficient Heterogeneous Computing Platforms for Biomolecular and Cellular Simulation Workloads , 2016, 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).

[31]  Peter M. Kasson,et al.  GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit , 2013, Bioinform..