Protein design by multiobjective optimization: evolutionary and non-evolutionary approaches

Traditional simulation-based protein design considers energy minimization of candidate conformations as a singleobjective combinatorial optimization problem. In this paper we consider a challenging protein design problem, producing twelve protein species based on collagen that uniquely assort into four groups of three: a problem defined herein as a 4-level heterotrimer. We formulate a bi-objective combinatorial minimization problem that targets both stability and specificity of the 4-level heterotrimer. In order to approximate its Pareto frontier, we utilize both evolutionary and non-evolutionary approaches, operating in either Pareto or aggregation fashions. Our practical observations suggest that the SMS-EMOA with Evolution Strategies' operators is more effective than standard heuristics deployed in computational protein design, such as Simulated Annealing, Replica Exchange or the Canonical Genetic Algorithm. We investigate the attained Pareto optimal sets using Barrier Tree analysis, aiming to provide insights into the chemical search-space, as well as to explain the observed algorithmic trends. In particular, we identify Replica Exchange as a promising non-evolutionary technique for this problem class, due to its efficient exploration capabilities. Overall, a common high-level protocol for simultaneous landscape analysis of evolutionary and non-evolutionary search methodologies is put forward for the first time.

[1]  Andrew Leaver-Fay,et al.  A Generic Program for Multistate Protein Design , 2011, PloS one.

[2]  David Baker,et al.  Role of the Biomolecular Energy Gap in Protein Design, Structure, and Evolution , 2012, Cell.

[3]  J. Dennis,et al.  A closer look at drawbacks of minimizing weighted sums of objectives for Pareto set generation in multicriteria optimization problems , 1997 .

[4]  Thomas Bäck,et al.  Mixed-integer evolution strategy using multiobjective selection applied to warehouse design optimization , 2010, GECCO '10.

[5]  Vikas Nanda,et al.  Circular Permutation Directs Orthogonal Assembly in Complex Collagen Peptide Mixtures* , 2013, The Journal of Biological Chemistry.

[6]  David Baker,et al.  A Pareto-Optimal Refinement Method for Protein Design Scaffolds , 2013, PloS one.

[7]  Vikas Nanda,et al.  Designing artificial enzymes by intuition and computation. , 2010, Nature chemistry.

[8]  F. Calvo Non-genetic global optimization methods in molecular science: An overview , 2009 .

[9]  Vikas Nanda,et al.  Empirical estimation of local dielectric constants: Toward atomistic design of collagen mimetic peptides , 2015, Biopolymers.

[10]  D. Baker,et al.  Design of a Novel Globular Protein Fold with Atomic-Level Accuracy , 2003, Science.

[11]  Fei Xu,et al.  Computational design of a collagen A:B:C-type heterotrimer. , 2011, Journal of the American Chemical Society.

[12]  Kalyanmoy Deb,et al.  Multi-Objective Evolutionary Algorithms , 2015, Handbook of Computational Intelligence.

[13]  Michael W Deem,et al.  Parallel tempering: theory, applications, and new perspectives. , 2005, Physical chemistry chemical physics : PCCP.

[14]  Riccardo Poli,et al.  Limitations of the fitness-proportional negative slope coefficient as a difficulty measure , 2009, GECCO '09.

[15]  Yaohang Li,et al.  Parallel tempering in Rosetta Practice , 2005, Advances in Bioinformatics and Its Applications.

[16]  David E. Clark,et al.  Evolutionary Algorithms in Molecular Design , 1999 .

[17]  Bernd Hartke,et al.  Application of Evolutionary Algorithms to Global Cluster Geometry Optimization , 2004 .

[18]  Shira Warszawski,et al.  A “Fuzzy”-Logic Language for Encoding Multiple Physical Traits in Biomolecules , 2014, Journal of molecular biology.

[19]  H M Berman,et al.  Crystal and molecular structure of a collagen-like peptide at 1.9 A resolution. , 1994, Science.

[20]  Rolf Backofen,et al.  Exploring the lower part of discrete polymer model energy landscapes , 2006 .

[21]  Christopher M. Summa,et al.  De novo design and structural characterization of proteins and metalloproteins. , 1999, Annual review of biochemistry.

[22]  Nicola Beume,et al.  SMS-EMOA: Multiobjective selection based on dominated hypervolume , 2007, Eur. J. Oper. Res..

[23]  F M Richards,et al.  Optimal sequence selection in proteins of known structure by simulated evolution. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Mark A. Miller,et al.  Archetypal energy landscapes , 1998, Nature.

[25]  David B. Fogel,et al.  Evolutionary algorithms in theory and practice , 1997, Complex.

[26]  Xi Chen,et al.  Evolution Strategies as a Scalable Alternative to Reinforcement Learning , 2017, ArXiv.

[27]  J. Fallas,et al.  Computational design of self-assembling register-specific collagen heterotrimers , 2012, Nature Communications.

[28]  Thomas Bäck,et al.  Evolutionary algorithms in theory and practice - evolution strategies, evolutionary programming, genetic algorithms , 1996 .

[29]  J Machta,et al.  Strengths and weaknesses of parallel tempering. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30]  Andries Petrus Engelbrecht,et al.  A survey of techniques for characterising fitness landscapes and some possible ways forward , 2013, Inf. Sci..

[31]  Witold Pedrycz,et al.  Springer Handbook of Computational Intelligence , 2015, Springer Handbook of Computational Intelligence.

[32]  Ernesto Benini,et al.  Genetic Diversity as an Objective in Multi-Objective Evolutionary Algorithms , 2003, Evolutionary Computation.

[33]  Michael Masin,et al.  Diversity Maximization Approach for Multiobjective Optimization , 2008, Oper. Res..

[34]  David E. Clark,et al.  Evolutionary Algorithms in Molecular Design: Clark/Evolutionary , 2000 .

[35]  Marco Laumanns,et al.  SPEA2: Improving the Strength Pareto Evolutionary Algorithm For Multiobjective Optimization , 2002 .

[36]  G. Parisi,et al.  Simulated tempering: a new Monte Carlo scheme , 1992, hep-lat/9205018.

[37]  D. Baker,et al.  The coming of age of de novo protein design , 2016, Nature.