Performance characteristics of biomolecular simulations on high-end systems with multi-core processors

Biological processes occurring inside cell involve multiple scales of time and length; many popular theoretical and computational multi-scale techniques utilize biomolecular simulations based on molecular dynamics. Till recently, the computing power required for simulating the relevant scales was even beyond the reach of fastest supercomputers. The availability of petaFLOPS-scale computing power in near future holds great promise. Unfortunately, the biosimulations software technology has not kept up with the changes in hardware. In particular, with the introduction of multi-core processing technologies in systems with tens of thousands of processing cores, it is unclear whether the existing biomolecular simulation frameworks will be able to scale and to utilize these resources effectively. While the multi-core processing systems provide higher processing capabilities, their memory and network subsystems are posing new challenges to application and system software developers. In this study, we attempt to characterize computation, communication and memory efficiencies of biomolecular simulations on Teraflops-scale Cray XT systems, which contain dual-core Opteron processors. We identify that the application efficiencies using the multi-core processors reduce with the increase of the simulated system size. Further, we measure the communication overhead of using both cores in the processor simultaneously and identify that the slowdown in the MPI communication performance can significantly lower the achievable performance in the dual-core execution mode. We conclude that not only the biomolecular simulations need to be aware of the underlying multi-core hardware in order to achieve maximum performance but also the system software needs to provide processor and memory placement features in the high-end systems. Our results on stand-alone multi-core AMD and Intel systems confirm that combinations of processor and memory affinity schemes cause significant performance variations for our target test cases.

[1]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[2]  Franz Franchetti,et al.  Large-Scale First-Principles Molecular Dynamics simulations on the BlueGene/L Platform using the Qbox code , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[3]  Stephen L. Olivier,et al.  Porting the GROMACS Molecular Dynamics Code to the Cell Processor , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[4]  P. Agarwal Role of protein dynamics in reaction rate enhancement by enzymes. , 2005, Journal of the American Chemical Society.

[5]  Laxmikant V. Kalé,et al.  Scalable molecular dynamics with NAMD , 2005, J. Comput. Chem..

[6]  A. Geist,et al.  Protein dynamics and enzymatic catalysis: investigating the peptidyl-prolyl cis-trans isomerization activity of cyclophilin A. , 2004, Biochemistry.

[7]  Sadaf R. Alam,et al.  Characterization of Scientific Workloads on Systems with Multi-Core Processors , 2006, 2006 IEEE International Symposium on Workload Characterization.

[8]  Sadaf R. Alam,et al.  Performance characterization of molecular dynamics techniques for biomolecular simulations , 2006, PPoPP '06.

[9]  Sadaf R. Alam,et al.  An Evaluation of the Oak Ridge National Laboratory Cray XT3 , 2008, Int. J. High Perform. Comput. Appl..

[10]  Peter A. Kollman,et al.  AMBER, a package of computer programs for applying molecular mechanics, normal mode analysis, molecular dynamics and free energy calculations to simulate the structural and energetic properties of molecules , 1995 .

[11]  Sadaf R. Alam,et al.  Using FPGA Devices to Accelerate Biomolecular Simulations , 2007, Computer.

[12]  Sadaf R. Alam,et al.  Biomolecular simulations on petascale: promises and challenges , 2006 .

[13]  P. Agarwal Enzymes: An integrated view of structure, dynamics and function , 2006, Microbial cell factories.

[14]  Steve Plimpton,et al.  Fast parallel algorithms for short-range molecular dynamics , 1993 .

[15]  Laxmikant V. Kalé,et al.  Achieving strong scaling with NAMD on Blue Gene/L , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[16]  P. Agarwal Cis/trans isomerization in HIV‐1 capsid protein catalyzed by cyclophilin A: Insights from computational and theoretical studies , 2004, Proteins.

[17]  Daniel A. Menascé Workload Characterization , 2003, IEEE Internet Comput..