Accelerating Molecular Structure Determination Based on Inter-Atomic Distances Using OpenCL

Fast and accurate determination of the 3D structure of molecules is essential for better understanding their physical, chemical, and biological properties. We focus on an existing method for molecular structure determination: restrained molecular dynamics with simulated annealing. In this method a hybrid function, composed by a physical model and experimental restraints, is minimized by simulated annealing. Our goal is to accelerate computation time using commodity multi-core CPUs and GPUs in a heterogeneous computing model. We present a parallel and portable OpenCL implementation of this method. Experimental results are discussed in terms of accuracy, execution time, and parallel scalability. With respect to the XPLOR-NIH professional software package, compared to the single CPU core implementation, we obtain speedups of three to five times (increasing with problem size) on commodity GPUs. We achieve these performances by writing specialized kernels for different problem sizes and hardware architectures.

[1]  Michael Griebel,et al.  Numerical Simulation in Molecular Dynamics: Numerics, Algorithms, Parallelization, Applications , 2007 .

[2]  Rajeev Raman,et al.  Converting to and from Dilated Integers , 2008, IEEE Transactions on Computers.

[3]  Diwakar Shukla,et al.  OpenMM 4: A Reusable, Extensible, Hardware Independent Library for High Performance Molecular Simulation. , 2013, Journal of chemical theory and computation.

[4]  L. Verlet Computer "Experiments" on Classical Fluids. I. Thermodynamical Properties of Lennard-Jones Molecules , 1967 .

[5]  D. Baker,et al.  De novo protein structure generation from incomplete chemical shift assignments , 2009, Journal of biomolecular NMR.

[6]  Duncan Poole,et al.  Routine Microsecond Molecular Dynamics Simulations with AMBER on GPUs. 1. Generalized Born , 2012, Journal of chemical theory and computation.

[7]  A. Gronenborn,et al.  Determination of three‐dimensional structures of proteins from interproton distance data by hybrid distance geometry‐dynamical simulated annealing calculations , 1988, FEBS letters.

[8]  M. Nilges,et al.  Influence of non-bonded parameters on the quality of NMR structures: A new force field for NMR structure calculation , 1999, Journal of biomolecular NMR.

[9]  Christodoulos A. Floudas,et al.  A review of recent advances in global optimization , 2009, J. Glob. Optim..

[10]  Joshua A. Anderson,et al.  General purpose molecular dynamics simulations fully implemented on graphics processing units , 2008, J. Comput. Phys..

[11]  M. Karplus,et al.  Stochastic boundary conditions for molecular dynamics simulations of ST2 water , 1984 .

[12]  A. Bax,et al.  Dipolar couplings in macromolecular structure determination. , 2001, Methods in enzymology.

[13]  K Wüthrich,et al.  Efficient computation of three-dimensional protein structures in solution from nuclear magnetic resonance data using the program DIANA and the supporting programs CALIBA, HABAS and GLOMSA. , 1991, Journal of molecular biology.

[14]  M Karplus,et al.  Three-dimensional structure of proteins determined by molecular dynamics with interproton distance restraints: application to crambin. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Vijay S. Pande,et al.  Efficient nonbonded interactions for molecular dynamics on a graphics processing unit , 2010, J. Comput. Chem..

[16]  R. Huber,et al.  Accurate Bond and Angle Parameters for X-ray Protein Structure Refinement , 1991 .

[17]  Klaus Schulten,et al.  Adapting a message-driven parallel application to GPU-accelerated clusters , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[18]  D K Smith,et al.  Numerical Optimization , 2001, J. Oper. Res. Soc..

[19]  Charles D Schwieters,et al.  The Xplor-NIH NMR molecular structure determination package. , 2003, Journal of magnetic resonance.

[20]  Q. Teng Structural Biology: Practical NMR Applications , 2005 .

[21]  G M Clore,et al.  Exploring the limits of precision and accuracy of protein structures determined by nuclear magnetic resonance spectroscopy. , 1993, Journal of molecular biology.

[22]  Klaus Schulten,et al.  GPU-accelerated molecular modeling coming of age. , 2010, Journal of molecular graphics & modelling.

[23]  Leo Liberti,et al.  Molecular distance geometry methods: from continuous to discrete , 2010, Int. Trans. Oper. Res..

[24]  Peng Wang,et al.  Implementing molecular dynamics on hybrid high performance computers - short range forces , 2011, Comput. Phys. Commun..

[25]  Leonard M. Blumenthal,et al.  Theory and applications of distance geometry , 1954 .

[26]  R J Read,et al.  Crystallography & NMR system: A new software suite for macromolecular structure determination. , 1998, Acta crystallographica. Section D, Biological crystallography.

[27]  A T Brünger,et al.  Sampling and efficiency of metric matrix distance geometry: A novel partial metrization algorithm , 1992, Journal of biomolecular NMR.

[28]  Steven J. Plimpton,et al.  Implementing molecular dynamics on hybrid high performance computers - Particle-particle particle-mesh , 2012, Comput. Phys. Commun..

[29]  I. Lorentz,et al.  Solving molecular distance geometry problems in OpenCL , 2012, 2012 13th International Conference on Optimization of Electrical and Electronic Equipment (OPTIM).

[30]  M. Nilges,et al.  Sampling Properties of Simulated Annealing and Distance Geometry , 1991 .

[31]  Michael Habeck,et al.  Probabilistic Structure Calculation , 2007 .

[32]  W. Torgerson Multidimensional scaling: I. Theory and method , 1952 .

[33]  Timothy F. Havel,et al.  The theory and practice of distance geometry , 1983, Bulletin of Mathematical Biology.

[34]  A M Gronenborn,et al.  Determination of three-dimensional structures of proteins and nucleic acids in solution by nuclear magnetic resonance spectroscopy. , 1989, Critical reviews in biochemistry and molecular biology.

[35]  Henk Sips,et al.  Quantifying the performance impacts of using local memory for many-core processors , 2013, 2013 IEEE 6th International Workshop on Multi-/Many-core Computing Systems (MuCoCoS).

[36]  Michael Nilges,et al.  Materials and Methods Som Text Figs. S1 to S6 References Movies S1 to S5 Inferential Structure Determination , 2022 .

[37]  A. Brünger X-ray crystallography and NMR reveal complementary views of structure and dynamics. , 1997, Nature structural biology.

[38]  H. Berendsen,et al.  Molecular dynamics with coupling to an external bath , 1984 .

[39]  G. Marius Clore,et al.  Using Xplor-NIH for NMR molecular structure determination , 2006 .

[40]  N Go,et al.  Calculation of protein conformations by proton-proton distance constraints. A new efficient algorithm. , 1985, Journal of molecular biology.

[41]  Timothy F. Havel,et al.  A distance geometry program for determining the structures of small proteins and other macromolecules from nuclear magnetic resonance measurements of intramolecular1H−1H proximities in solution , 1984 .

[42]  Lu-Yun Lian,et al.  Protein NMR spectroscopy : practical techniques and applications , 2011 .

[43]  John E. Stone,et al.  OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems , 2010, Computing in Science & Engineering.

[44]  Russell W. Quong,et al.  ANTLR: A predicated‐LL(k) parser generator , 1995, Softw. Pract. Exp..

[45]  Carsten Kutzner,et al.  GROMACS 4:  Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation. , 2008, Journal of chemical theory and computation.

[46]  A. Gronenborn,et al.  Determination of three‐dimensional structures of proteins from interproton distance data by dynamical simulated annealing from a random array of atoms Circumventing problems associated with folding , 1988, FEBS letters.

[47]  J. Ponder,et al.  Force fields for protein simulations. , 2003, Advances in protein chemistry.

[48]  W. L. Jorgensen,et al.  The OPLS [optimized potentials for liquid simulations] potential functions for proteins, energy minimizations for crystals of cyclic peptides and crambin. , 1988, Journal of the American Chemical Society.

[49]  K. Wüthrich Protein structure determination in solution by NMR spectroscopy. , 1990, The Journal of biological chemistry.

[50]  Sharon C. Glotzer,et al.  Rigid body constraints realized in massively-parallel molecular dynamics on graphics processing units , 2011, Comput. Phys. Commun..

[51]  M. Karplus Contact Electron‐Spin Coupling of Nuclear Magnetic Moments , 1959 .

[52]  K Wüthrich,et al.  NMR - this other method for protein and nucleic acid structure determination. , 1995, Acta crystallographica. Section D, Biological crystallography.

[53]  Klaus Schulten,et al.  Accelerating Molecular Modeling Applications with GPU Computing , 2009 .

[54]  G. Vriend,et al.  Prediction of protein conformational freedom from distance constraints , 1997, Proteins.

[55]  A. Gronenborn,et al.  A novel, highly stable fold of the immunoglobulin binding domain of streptococcal protein G. , 1993, Science.

[56]  A. Bax,et al.  TALOS+: a hybrid method for predicting protein backbone torsion angles from NMR chemical shifts , 2009, Journal of biomolecular NMR.

[57]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[58]  Razvan Andonie,et al.  Molecular distance geometry optimization using geometric build-up and evolutionary techniques on GPU , 2012, 2012 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB).