Optimizing van der Waals calculi using Cell-lists and MPI

Van der Waals's energy models attraction and repulsion effects between pairs of atoms. This energy is used by ab initio methods to find the tertiary structure of a protein based only on its amino acid sequence and on a force field model. Several researches suggests Genetic Algorithms (GAs), are adequate for the development of ab initio approaches for protein structure prediction. A GA generates thousands of potential structures for a protein conformation, and evaluates the van der Waals' interaction in each generated structure. In practice, 99% of running time of the GA is used with the computation of van der Waals' energy. To compute the van der Waals energy for a given structure, we need to calculate effects of the interactions of all pairs of atoms in the structure. Using this cutoff, the complexity of the algorithm is O(n2) per conformation, where n is the number of atoms of the protein. For atoms separated by more than 8A˚ the van der Waals effect is relatively weak. Thus, we apply a Cell-lists method to the van der Waals function reducing the complexity of algorithm to O(n). Furthermore, we applied parallel programming to the Cell-lists method using MPI, reducing significatively the running time. The combination of the Cell-lists and MPI techniques resulted in a speedup of 1000 for a protein with 147,900 atoms.

[1]  Mohammed J. Zaki,et al.  Protein Structure Prediction , 2008, Methods in Molecular Biology™.

[2]  R. Friesner Computational Methods for Protein Folding , 2002 .

[3]  P. Strevens Iii , 1985 .

[4]  A. Baxevanis,et al.  A Practical Guide to the Analysis of Genes and Proteins , 1998 .

[5]  Enrique Alba,et al.  Heterogeneous Computing and Parallel Genetic Algorithms , 2002, J. Parallel Distributed Comput..

[6]  El-Ghazali Talbi,et al.  A parallel hybrid genetic algorithm for protein structure prediction on the computational grid , 2007, Future Gener. Comput. Syst..

[7]  M. J. Quinn,et al.  Parallel Computing: Theory and Practice , 1994 .

[8]  Enrique Alba,et al.  Parallel Metaheuristics: A New Class of Algorithms , 2005 .

[9]  Alexander D. MacKerell,et al.  Computational Biochemistry and Biophysics , 2001 .

[10]  Malcolm Atkinson,et al.  Memory Management , 2021, Professional C++.

[11]  George Karypis,et al.  Introduction to Parallel Computing Solution Manual , 2003 .

[12]  David M. Webster,et al.  Protein structure prediction : methods and protocols , 2000 .

[13]  Michael J. Quinn,et al.  Parallel programming in C with MPI and OpenMP , 2003 .

[14]  D. Nelson,et al.  Lehninger Principles of Biochemistry (5th edition) , 2008 .

[15]  J. Banavar,et al.  Computer Simulation of Liquids , 1988 .

[16]  D. Webster Protein Structure Prediction , 2000 .

[17]  William D. Mattson,et al.  Near-neighbor calculations using a modified cell-linked list method , 1999 .

[18]  Lubert Stryer,et al.  Biochemistry 5th ed , 2002 .

[19]  Y. Cui,et al.  Protein folding simulation with genetic algorithm and supersecondary structure constraints , 1998, Proteins.

[20]  Vipin Kumar,et al.  Introduction to Parallel Computing , 1994 .

[21]  Alex A. Freitas,et al.  Evolutionary Computation , 2002 .