A Grid-Based Hybrid Hierarchical Genetic Algorithm for Protein Structure Prediction

A hybrid hierarchical conformational sampling evolutionary algorithm is presented in this chapter, relying on different parallelization models. After first reviewing general conformational sampling aspects, e.g. existing approaches, complexity matters, force field functions, a focus is considered for the protein structure prediction problem. Furthermore, having as basis the highly multimodal nature of the energy landscape structure, a hybrid evolutionary approach is defined, enclosing conjugate gradient and adaptive simulated annealing enforced components. An insular model is employed, the conformational sampling process being conducted on a collaborative basis. Nonetheless, although low energy conformations were obtained, no close to native conformations were attained. Consequently, a higher complexity hierarchical paradigm has been constructed, with incentive following results.

[1]  Yiyong Huang,et al.  Boron-based pronucleophiles in catalytic (asymmetric) C(sp3)–allyl cross-couplings , 2012 .

[2]  Arnold Neumaier,et al.  Molecular Modeling of Proteins and Mathematical Prediction of Protein Structure , 1997, SIAM Rev..

[3]  Vijay S. Pande,et al.  Screen Savers of the World Unite! , 2000, Science.

[4]  Michael R. Shirts,et al.  Atomistic protein folding simulations on the submillisecond time scale using worldwide distributed computing. , 2003, Biopolymers.

[5]  David R. Butenhof Programming with POSIX threads , 1993 .

[6]  Terry Jones,et al.  Fitness Distance Correlation as a Measure of Problem Difficulty for Genetic Algorithms , 1995, ICGA.

[7]  J. Shewchuk An Introduction to the Conjugate Gradient Method Without the Agonizing Pain , 1994 .

[8]  C. M. Reeves,et al.  Function minimization by conjugate gradients , 1964, Comput. J..

[9]  Peter Little DNA sequencing: the silent revolution , 2003 .

[10]  Ian T. Foster,et al.  The Anatomy of the Grid: Enabling Scalable Virtual Organizations , 2001, Int. J. High Perform. Comput. Appl..

[11]  Ian Buck GPU Computing: Programming a Massively Parallel Processor , 2007, International Symposium on Code Generation and Optimization (CGO'07).

[12]  William Gropp,et al.  MPICH2: A New Start for MPI Implementations , 2002, PVM/MPI.

[13]  M. Hestenes,et al.  Methods of conjugate gradients for solving linear systems , 1952 .

[14]  G. Grassy,et al.  Glossary of terms used in computational drug design (IUPAC Recommendations 1997) , 1997 .

[15]  George Bosilca,et al.  Open MPI: A High-Performance, Heterogeneous MPI , 2006, 2006 IEEE International Conference on Cluster Computing.

[16]  El-Ghazali Talbi,et al.  Building a Virtual Globus Grid in a Reconfigurable Environment - A case study: Grid5000 , 2007 .

[17]  Lester Ingber,et al.  Simulated annealing: Practice versus theory , 1993 .

[18]  El-Ghazali Talbi,et al.  A Taxonomy of Hybrid Metaheuristics , 2002, J. Heuristics.

[19]  Pierre-Yves Calland On the structural complexity of a protein. , 2003, Protein engineering.

[20]  William Gropp,et al.  Mpi - The Complete Reference: Volume 2, the Mpi Extensions , 1998 .

[21]  Marcus Randall,et al.  Biologically-Inspired Optimisation Methods: Parallel Algorithms, Systems and Applications , 2009 .

[22]  Rajkumar Buyya,et al.  A taxonomy and survey of grid resource management systems for distributed computing , 2002, Softw. Pract. Exp..

[23]  Jack Dongarra,et al.  MPI: The Complete Reference , 1996 .

[24]  H. Dorsett,et al.  Overview of Molecular Modelling and Ab initio Molecular Orbital Methods Suitable for Use with Energetic Materials , 2000 .

[25]  El-Ghazali Talbi,et al.  An enabling framework for parallel optimization on the computational grid , 2005, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005..

[26]  Lester Ingber,et al.  Adaptive simulated annealing (ASA): Lessons learned , 2000, ArXiv.

[27]  J T Ngo,et al.  Computational complexity of a problem in molecular structure prediction. , 1992, Protein engineering.

[28]  El-Ghazali Talbi,et al.  ParadisEO: A Framework for the Reusable Design of Parallel and Distributed Metaheuristics , 2004, J. Heuristics.

[29]  Jack Dongarra,et al.  Recent Advances in Parallel Virtual Machine and Message Passing Interface, 15th European PVM/MPI Users' Group Meeting, Dublin, Ireland, September 7-10, 2008. Proceedings , 2008, PVM/MPI.

[30]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[31]  H A Scheraga,et al.  Improved genetic algorithm for the protein folding problem by use of a Cartesian combination operator , 1996, Protein science : a publication of the Protein Society.

[32]  Enrique Alba,et al.  Metaheuristics and Parallelism , 2005 .

[33]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[34]  Anna Walsh STUDIES IN MOLECULAR DYNAMICS , 1965 .

[35]  M. Karplus,et al.  Dynamics of folded proteins , 1977, Nature.

[36]  Matthias S. Müller,et al.  Progress Towards Petascale Applications in Biology: Status in 2006 , 2006, Euro-Par Workshops.

[37]  El-Ghazali Talbi,et al.  A grid-based genetic algorithm combined with an adaptive simulated annealing for protein structure prediction , 2008, Soft Comput..

[38]  B. Alder,et al.  Phase Transition for a Hard Sphere System , 1957 .

[39]  Maarten Keijzer,et al.  Evolving Objects: A General Purpose Evolutionary Computation Library , 2001, Artificial Evolution.

[40]  El-Ghazali Talbi,et al.  Local vs. global search strategies in evolutionary GRID-based conformational sampling & docking , 2009, 2009 IEEE Congress on Evolutionary Computation.

[41]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[42]  J. Ponder,et al.  Force fields for protein simulations. , 2003, Advances in protein chemistry.

[43]  F. Zerilli,et al.  Ab Initio Calculation of Intermolecular Potential Parameters for Gaseous Decomposition Products of Energetic Materials , 2000 .

[44]  Franck Cappello,et al.  Grid'5000: a large scale and highly reconfigurable grid experimental testbed , 2005, The 6th IEEE/ACM International Workshop on Grid Computing, 2005..

[45]  Bruce E. Rosen,et al.  Genetic Algorithms and Very Fast Simulated Reannealing: A comparison , 1992 .

[46]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[47]  Lester Ingber,et al.  Adaptive Simulated Annealing (ASA) and Path-Integral (PATHINT) Algorithms: Generic Tools for Complex Systems , 2001 .

[48]  El-Ghazali Talbi,et al.  An Analysis of Dynamic Mutation Operators for Conformational Sampling , 2009 .

[49]  E. Polak,et al.  Note sur la convergence de méthodes de directions conjuguées , 1969 .

[50]  B. Alder,et al.  Studies in Molecular Dynamics. I. General Method , 1959 .

[51]  Alain J. Cozzone Proteins: Fundamental Chemical Properties , 2002 .

[52]  Tim Lincoln,et al.  A Century of Nature: Twenty-One Discoveries that Changed Science and the World , 2003 .

[53]  Avneesh Pant,et al.  Communicating efficiently on cluster based grids with MPICH-VMI , 2004, 2004 IEEE International Conference on Cluster Computing (IEEE Cat. No.04EX935).

[54]  D. Osguthorpe,et al.  Structure and energetics of ligand binding to proteins: Escherichia coli dihydrofolate reductase‐trimethoprim, a drug‐receptor system , 1988, Proteins.

[55]  Laxmikant V. Kale,et al.  Biomolecular Modeling in the Era of Petascale Computing , 2007 .

[56]  Erick Cantú-Paz,et al.  Efficient and Accurate Parallel Genetic Algorithms , 2000, Genetic Algorithms and Evolutionary Computation.

[57]  David A. Bader Petascale Computing: Algorithms and Applications , 2007 .

[58]  Roger Fletcher,et al.  A Rapidly Convergent Descent Method for Minimization , 1963, Comput. J..

[59]  Enrique Alba,et al.  Parallel Metaheuristics: A New Class of Algorithms , 2005 .

[60]  Mihalis Yannakakis,et al.  On the Complexity of Protein Folding , 1998, J. Comput. Biol..

[61]  K. Dill Theory for the folding and stability of globular proteins. , 1985, Biochemistry.

[62]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[63]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.