A Novel EA-based Memetic Approach for Efficiently Mapping Complex Fitness Landscapes

Recent work in computational structural biology focuses on modeling intrinsically dynamic proteins important to human biology and health. The energy landscapes of these proteins are rich in minima that correspond to alternative structures with which a dynamic protein binds to molecular partners in the cell. On such landscapes, evolutionary algorithms that switch their objective from classic optimization to mapping are more informative of protein structure function relationships. While techniques for mapping energy landscapes have been developed in computational chemistry and physics, protein landscapes are more difficult for mapping due to their high dimensionality and multimodality. In this paper, we describe a memetic evolutionary algorithm that is capable of efficiently mapping complex landscapes. In conjunction with a hall of fame mechanism, the algorithm makes use of a novel, lineage- and neighborhood-aware local search procedure or better exploration and mapping of complex landscapes. We evaluate the algorithm on several benchmark problems and demonstrate the superiority of the novel local search mechanism. In addition, we illustrate its effectiveness in mapping the complex multimodal landscape of an intrinsically dynamic protein important to human health.

[1]  R. Nussinov,et al.  The role of dynamic conformational ensembles in biomolecular recognition. , 2009, Nature chemical biology.

[2]  J. Doye Network topology of a potential energy landscape: a static scale-free network. , 2002, Physical review letters.

[3]  Kwok Yip Szeto,et al.  Complex energy landscape mapping by histogram assisted genetic algorithm , 2010, GECCO '10.

[4]  Kwok Yip Szeto,et al.  Landscape Mapping by Multi-population Genetic Algorithm , 2008, NICSO.

[5]  Ruth Nussinov,et al.  Mapping the Conformation Space of Wildtype and Mutant H-Ras with a Memetic, Cellular, and Multiscale Evolutionary Algorithm , 2015, PLoS Comput. Biol..

[6]  Ruth Nussinov,et al.  Principles and Overview of Sampling Methods for Modeling Macromolecular Structure and Dynamics , 2016, PLoS Comput. Biol..

[7]  Jochen S. Hub,et al.  Detection of Functional Modes in Protein Dynamics , 2010 .

[8]  Kurt Binder,et al.  Supercooled Liquids and the Glass Transition , 2011 .

[9]  Kewei Tu,et al.  Mapping the Energy Landscape of Non-convex Optimization Problems , 2014, EMMCVPR.

[10]  Mark A. Miller,et al.  Archetypal energy landscapes , 1998, Nature.

[11]  Mark T. Oakley,et al.  Structures and Energy Landscapes of Hydrated Sulfate Clusters. , 2015, Journal of chemical theory and computation.

[12]  Didier Devaurs,et al.  Characterizing Energy Landscapes of Peptides Using a Combination of Stochastic Algorithms , 2015, IEEE Transactions on NanoBioscience.

[13]  Giorgio Parisi,et al.  Fractal free energy landscapes in structural glasses , 2014, Nature Communications.

[14]  Amarda Shehu,et al.  A multiscale hybrid evolutionary algorithm to obtain sample-based representations of multi-basin protein energy landscapes , 2014, BCB.

[15]  N. Hansen,et al.  Real-Parameter Black-Box Optimization Benchmarking: Experimental Setup , 2010 .

[16]  Kenneth A. De Jong,et al.  Evolutionary search strategies for efficient sample-based representations of multiple-basin protein energy landscapes , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[17]  M. Karplus,et al.  The topology of multidimensional potential energy surfaces: Theory and application to peptide structure and kinetics , 1997 .

[18]  K. Dejong,et al.  An analysis of the behavior of a class of genetic adaptive systems , 1975 .

[19]  Kenneth A. De Jong,et al.  Mapping Multiple Minima in Protein Energy Landscapes with Evolutionary Algorithms , 2015, GECCO.

[20]  D. Kern,et al.  Dynamic personalities of proteins , 2007, Nature.

[21]  D. Boehr,et al.  How Do Proteins Interact? , 2008, Science.

[22]  Amarda Shehu,et al.  A Data-Driven Evolutionary Algorithm for Mapping Multibasin Protein Energy Landscapes , 2015, J. Comput. Biol..