An Analysis of Dynamic Mutation Operators for Conformational Sampling

A comparison analysis of dynamic mutation operators is proposed, having the conformational sampling problem as a case study. The analysis is sustained by a parallel Optimal Computing Budget Allocation (OCBA) selection procedure, employed in order to attain computational speedup. A Pearson system distribution based mutation operator is proposed, allowing for a highly flexible construction. As defined by a set of four parameters, the mean, variance, skewness and kurtosis, a large number of distributions can be simulated. As determined by the analysis outcomes, the class of operators exhibiting significant energy minimization or Root Mean Square Deviation (RMSD) bias is identified. Experiments are carried out on a large number of computational resources, allowing for the outline of an automatic a priori operator tuning and selection methodology. Although not presented in this chapter, similar complementary studies have been conducted on intensification operators and local search algorithms.

[1]  J. Ponder,et al.  Force fields for protein simulations. , 2003, Advances in protein chemistry.

[2]  D. Osguthorpe,et al.  Structure and energetics of ligand binding to proteins: Escherichia coli dihydrofolate reductase‐trimethoprim, a drug‐receptor system , 1988, Proteins.

[3]  Jack Dongarra,et al.  Proceedings of the 13th European PVM/MPI User's Group conference on Recent advances in parallel virtual machine and message passing interface , 1997 .

[4]  Hagit Attiya,et al.  Wiley Series on Parallel and Distributed Computing , 2004, SCADA Security: Machine Learning Concepts for Intrusion Detection and Prevention.

[5]  F. Herrera,et al.  Dynamic and heuristic fuzzy connectives‐based crossover operators for controlling the diversity and convergence of real‐coded genetic algorithms , 1996 .

[6]  Franck Cappello,et al.  Grid'5000: a large scale and highly reconfigurable grid experimental testbed , 2005, The 6th IEEE/ACM International Workshop on Grid Computing, 2005..

[7]  Francisco Herrera,et al.  A taxonomy for the crossover operator for real‐coded genetic algorithms: An experimental study , 2003, Int. J. Intell. Syst..

[8]  Rolf Drechsler,et al.  Applications of Evolutionary Computing , 2004, Lecture Notes in Computer Science.

[9]  El-Ghazali Talbi,et al.  ParadisEO: A Framework for the Reusable Design of Parallel and Distributed Metaheuristics , 2004, J. Heuristics.

[10]  Enrique Alba,et al.  Metaheuristics and Parallelism , 2005 .

[11]  Jorge Nocedal,et al.  Algorithm 778: L-BFGS-B: Fortran subroutines for large-scale bound-constrained optimization , 1997, TOMS.

[12]  Jürgen Branke,et al.  Integrating Techniques from Statistical Ranking into Evolutionary Algorithms , 2006, EvoWorkshops.

[13]  Chun-Hung Chen A lower bound for the correct subset-selection probability and its application to discrete-event system simulations , 1996, IEEE Trans. Autom. Control..

[14]  Stephen E. Chick,et al.  New results on procedures that select the best system using CRN , 2000, 2000 Winter Simulation Conference Proceedings (Cat. No.00CH37165).

[15]  El-Ghazali Talbi,et al.  A Taxonomy of Hybrid Metaheuristics , 2002, J. Heuristics.

[16]  Matthias S. Müller,et al.  Progress Towards Petascale Applications in Biology: Status in 2006 , 2006, Euro-Par Workshops.

[17]  Chun-Hung Chen,et al.  Simulation Budget Allocation for Further Enhancing the Efficiency of Ordinal Optimization , 2000, Discret. Event Dyn. Syst..

[18]  Arnold Neumaier,et al.  Molecular Modeling of Proteins and Mathematical Prediction of Protein Structure , 1997, SIAM Rev..

[19]  C. Anfinsen,et al.  The kinetics of formation of native ribonuclease during oxidation of the reduced polypeptide chain. , 1961, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Erick Cantú-Paz,et al.  Efficient and Accurate Parallel Genetic Algorithms , 2000, Genetic Algorithms and Evolutionary Computation.

[21]  Zbigniew Michalewicz,et al.  Genetic Algorithms + Data Structures = Evolution Programs , 1996, Springer Berlin Heidelberg.

[22]  Jürgen Branke,et al.  Selecting a Selection Procedure , 2007, Manag. Sci..

[23]  El-Ghazali Talbi,et al.  Grid-based evolutionary strategies applied to the conformational sampling problem , 2007, 2007 IEEE Congress on Evolutionary Computation.

[24]  Francisco Herrera,et al.  Fuzzy connectives based crossover operators to model genetic algorithms population diversity , 1997, Fuzzy Sets Syst..

[25]  William Gropp,et al.  Mpi the complete reference: the mpi-2 extensions , 1998 .

[26]  René Thomsen,et al.  Flexible ligand docking using evolutionary algorithms: investigating the effects of variation operators and local search hybrids. , 2003, Bio Systems.

[27]  El-Ghazali Talbi,et al.  An enabling framework for parallel optimization on the computational grid , 2005, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005..

[28]  William E. Hart,et al.  A Comparison of Global and Local Search Methods in Drug Docking , 1997, ICGA.

[29]  Yuichi Nagahara,et al.  The PDF and CF of Pearson type IV distributions and the ML estimation of the parameters , 1999 .

[30]  Jorge Nocedal,et al.  A Limited Memory Algorithm for Bound Constrained Optimization , 1995, SIAM J. Sci. Comput..

[31]  Lester Ingber,et al.  Adaptive simulated annealing (ASA): Lessons learned , 2000, ArXiv.

[32]  Griffin Caprio,et al.  Parallel Metaheuristics , 2008, IEEE Distributed Systems Online.

[33]  T. L. Blundell,et al.  Knowledge-based prediction of protein structures and the design of novel molecules , 1987, Nature.

[34]  Linlin Qiu,et al.  Internal friction in the ultrafast folding of the tryptophan cage , 2004 .

[35]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998 .

[36]  Jack Dongarra,et al.  MPI: The Complete Reference , 1996 .

[37]  El-Ghazali Talbi,et al.  A parallel hybrid genetic algorithm for protein structure prediction on the computational grid , 2007, Future Gener. Comput. Syst..

[38]  Alain J. Cozzone Proteins: Fundamental Chemical Properties , 2002 .

[39]  Xin Yao,et al.  Evolutionary programming made faster , 1999, IEEE Trans. Evol. Comput..

[40]  Lester Ingber,et al.  Adaptive Simulated Annealing (ASA) and Path-Integral (PATHINT) Algorithms: Generic Tools for Complex Systems , 2001 .

[41]  William Gropp,et al.  Mpi - The Complete Reference: Volume 2, the Mpi Extensions , 1998 .

[42]  William Gropp,et al.  MPICH2: A New Start for MPI Implementations , 2002, PVM/MPI.

[43]  Enrique Alba,et al.  Parallelism and evolutionary algorithms , 2002, IEEE Trans. Evol. Comput..

[44]  El-Ghazali Talbi,et al.  The Influence of Mutation on Protein-Ligand Docking Optimization: A Locality Analysis , 2008, PPSN.

[45]  Simon M. Lucas,et al.  Parallel Problem Solving from Nature - PPSN X, 10th International Conference Dortmund, Germany, September 13-17, 2008, Proceedings , 2008, PPSN.

[46]  El-Ghazali Talbi,et al.  A grid-based genetic algorithm combined with an adaptive simulated annealing for protein structure prediction , 2008, Soft Comput..

[47]  Magnus R. Hestenes,et al.  Iterative methods for solving linear equations , 1973 .

[48]  K. Dill Theory for the folding and stability of globular proteins. , 1985, Biochemistry.

[49]  H A Scheraga,et al.  Improved genetic algorithm for the protein folding problem by use of a Cartesian combination operator , 1996, Protein science : a publication of the Protein Society.

[50]  Vijay S Pande,et al.  Trp zipper folding kinetics by molecular dynamics and temperature-jump spectroscopy , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[51]  M. Hestenes,et al.  Methods of conjugate gradients for solving linear systems , 1952 .

[52]  Yuichi Nagahara,et al.  A method of simulating multivariate nonnormal distributions by the Pearson distribution system and estimation , 2004, Comput. Stat. Data Anal..

[53]  Ramkumar Hariharan,et al.  Detailed Comparison of the Protein-Ligand Docking Efficiencies of GOLD, a Commercial Package and ArgusLab, a Licensable Freeware , 2006, Silico Biol..

[54]  Jürgen Branke,et al.  New developments in ranking and selection: an empirical comparison of the three main approaches , 2005, Proceedings of the Winter Simulation Conference, 2005..