Evaluation of Emerging Energy-Efficient Heterogeneous Computing Platforms for Biomolecular and Cellular Simulation Workloads

Many of the continuing scientific advances achieved through computational biology are predicated on the availability of ongoing increases in computational power required for detailed simulation and analysis of cellular processes on biologically-relevant timescales. A critical challenge facing the development of future exascale supercomputer systems is the development of new computing hardware and associated scientific applications that dramatically improve upon the energy efficiency of existing solutions, while providing increased simulation, analysis, and visualization performance. Mobile computing platforms have recently become powerful enough to support interactive molecular visualization tasks that were previously only possible on laptops and workstations, creating future opportunities for their convenient use for meetings, remote collaboration, and as head mounted displays for immersive stereoscopic viewing. We describe early experiences adapting several biomolecular simulation and analysis applications for emerging heterogeneous computing platforms that combine power-efficient system-on-chip multi-core CPUs with high-performance massively parallel GPUs. We present low-cost power monitoring instrumentation that provides sufficient temporal resolution to evaluate the power consumption of individual CPU algorithms and GPU kernels. We compare the performance and energy efficiency of scientific applications running on emerging platforms with results obtained on traditional platforms, identify hardware and algorithmic performance bottlenecks that affect the usability of these platforms, and describe avenues for improving both the hardware and applications in pursuit of the needs of molecular modeling tasks on mobile devices and future exascale computers.

[1]  John E. Stone,et al.  Simulation of reaction diffusion processes over biologically relevant size and time scales using multi-GPU workstations , 2014, Parallel Comput..

[2]  Kevin Skadron,et al.  Scalable parallel programming , 2008, 2008 IEEE Hot Chips 20 Symposium (HCS).

[3]  John E. Stone,et al.  Lattice microbes: High‐performance stochastic simulation method for the reaction‐diffusion master equation , 2013, J. Comput. Chem..

[4]  Klaus Schulten,et al.  High performance computation and interactive display of molecular orbitals on GPUs and multi-core CPUs , 2009, GPGPU-2.

[5]  John E. Stone,et al.  Long time-scale simulations of in vivo diffusion using GPU hardware , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[6]  Yao Zhang,et al.  Parallel Computing Experiences with CUDA , 2008, IEEE Micro.

[7]  Klaus Schulten,et al.  GPU-accelerated molecular modeling coming of age. , 2010, Journal of molecular graphics & modelling.

[8]  Klaus Schulten,et al.  Fast Visualization of Gaussian Density Surfaces for Molecular Dynamics and Particle System Trajectories , 2012, EuroVis.

[9]  Klaus Schulten,et al.  GPU-accelerated analysis and visualization of large structures solved by molecular dynamics flexible fitting. , 2014, Faraday discussions.

[10]  M J Harvey,et al.  The impact of accelerator processors for high-throughput molecular modeling and simulation. , 2008, Drug discovery today.

[11]  Klaus Schulten,et al.  Immersive Molecular Visualization with Omnidirectional Stereoscopic Ray Tracing and Remote Rendering , 2016, 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).

[12]  Xi Yang,et al.  Looking back on the language and hardware revolutions: measured power, performance, and scaling , 2011, ASPLOS XVI.

[13]  John Shalf,et al.  Exascale Computing Technology Challenges , 2010, VECPAR.

[14]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[15]  Klaus Schulten,et al.  GPU-accelerated molecular visualization on petascale supercomputing platforms , 2013, UltraVis@SC.

[16]  Dong Li,et al.  PowerPack: Energy Profiling and Analysis of High-Performance Systems and Applications , 2010, IEEE Transactions on Parallel and Distributed Systems.

[17]  Klaus Schulten,et al.  Accelerating Molecular Modeling Applications with GPU Computing , 2009 .

[18]  John E. Stone,et al.  GPU-accelerated computation and interactive display of molecular orbitals , 2010 .

[19]  Erik Lindholm,et al.  NVIDIA Tesla: A Unified Graphics and Computing Architecture , 2008, IEEE Micro.

[20]  Ivan S Ufimtsev,et al.  Quantum Chemistry on Graphical Processing Units. 1. Strategies for Two-Electron Integral Evaluation. , 2008, Journal of chemical theory and computation.

[21]  Klaus Schulten,et al.  Immersive Out-of-Core Visualization of Large-Size and Long-Timescale Molecular Dynamics Trajectories , 2011, ISVC.

[22]  James H. Laros,et al.  PowerInsight - A commodity power measurement capability , 2013, 2013 International Green Computing Conference Proceedings.

[23]  Tomas Akenine-Möller,et al.  Power efficiency for software algorithms running on graphics processors , 2012, EGGH-HPG'12.

[24]  Klaus Schulten,et al.  Multilevel summation of electrostatic potentials using graphics processing units , 2009, Parallel Comput..

[25]  John E. Stone,et al.  OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems , 2010, Computing in Science & Engineering.

[26]  John E. Stone,et al.  Unlocking the Full Potential of the Cray XK 7 Accelerator , 2014 .

[27]  Klaus Schulten,et al.  Petascale Tcl with NAMD, VMD, and Swift/T , 2014, 2014 First Workshop for High Performance Technical Computing in Dynamic Languages.

[28]  Klaus Schulten,et al.  Immersive Molecular Visualization and Interactive Modeling with Commodity Hardware , 2010, ISVC.

[29]  Rong Ge,et al.  Power and energy profiling of scientific applications on distributed systems , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[30]  Vijay S. Pande,et al.  Accelerating molecular dynamic simulation on graphics processing units , 2009, J. Comput. Chem..

[31]  Klaus Schulten,et al.  Atomic detail visualization of photosynthetic membranes with GPU-accelerated ray tracing , 2016, Parallel Comput..

[32]  Joshua A. Anderson,et al.  General purpose molecular dynamics simulations fully implemented on graphics processing units , 2008, J. Comput. Phys..

[33]  Laxmikant V. Kalé,et al.  Scalable molecular dynamics with NAMD , 2005, J. Comput. Chem..

[34]  Kirk W. Cameron,et al.  Energy and performance characteristics of different parallel implementations of scientific applications on multicore systems , 2011, Int. J. High Perform. Comput. Appl..

[35]  Klaus Schulten,et al.  Adapting a message-driven parallel application to GPU-accelerated clusters , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[36]  John E. Stone,et al.  Quantifying the impact of GPUs on performance and energy efficiency in HPC clusters , 2010, International Conference on Green Computing.

[37]  John D. Owens,et al.  GPU Computing , 2008, Proceedings of the IEEE.

[38]  Klaus Schulten,et al.  Early experiences scaling VMD molecular visualization and analysis jobs on blue waters , 2013, 2013 Extreme Scaling Workshop (xsw 2013).