Investigating vesicular selection: A selection operator in in vitro evolution

Abstract: Directed protein evolution has led to major advances in organic chemistry, enabling the development of highly optimised proteins. The SELEX method has also been highly effective in evolving ribose nucleic acid (RNA) or deoxy-ribose nucleic acid (DNA) molecules; variants have been proposed which allow SELEX to be used in protein evolution. All of these methods can be viewed as evolutionary algorithms implemented in chemistry. A number of methods rely on selection of natural cells, or of artificial bubbles. These methods result in a new form of selection mechanism, which we call vesicular selection (VS). It is not, prima facie, clear whether VS is an effective selection mechanism, or how its performance is affected by changes in vesicle size. It is difficult to investigate this in vitro, so we use in silico methods derived from evolutionary computation. The primary aim is to test whether this selection method hinders biochemical evolutionary search (in which case, it might be worth investing research effort in discovering alternative selection methods). An in silico implementation of this selection method, embedded in an otherwise-typical evolutionary computation system, shows reasonable ability to solve tough optimisation problems, together with an acceptable ability to concentrate the solutions found. We compare it with tournament selection (TS), a standard evolutionary computation method, which can be finely tuned for high selection pressure, but only coarsely tuned for low selection pressure. By contrast, the new selection mechanism VS is highly tunable at low selection pressures. It is thus particularly suited to problem domains where extensive exploration capabilities are required. Since there is very good reason to believe that protein search spaces require highly exploratory search, the selection mechanism is well matched to its application in combinatorial chemistry.

[1]  Ponnuthurai Nagaratnam Suganthan,et al.  Benchmark Functions for the CEC'2013 Special Session and Competition on Large-Scale Global Optimization , 2008 .

[2]  R L Jernigan,et al.  Consistencies of individual DNA base-amino acid interactions in structures and sequences. , 1995, Nucleic acids research.

[3]  Martin Pelikan Analysis of estimation of distribution algorithms and genetic algorithms on NK landscapes , 2008, GECCO '08.

[4]  Julian Francis Miller,et al.  Cartesian genetic programming , 2008, GECCO '08.

[5]  Janet Wiles,et al.  Maximally rugged NK landscapes contain the highest peaks , 2005, GECCO '05.

[6]  Takuya Ueda,et al.  Protein synthesis by pure translation systems. , 2005, Methods.

[7]  H. Hogrefe,et al.  Amplification efficiency of thermostable DNA polymerases. , 2003, Analytical biochemistry.

[8]  Kalyanmoy Deb,et al.  Multi-objective optimization using evolutionary algorithms , 2001, Wiley-Interscience series in systems and optimization.

[9]  Heinz Mühlenbein,et al.  The parallel genetic algorithm as function optimizer , 1991, Parallel Comput..

[10]  Martin J. Oates,et al.  Fitness Gains and Mutation Patterns: Deriving Mutation Rates by Exploiting Landscape Data , 2002, FOGA.

[11]  E. D. Weinberger,et al.  The NK model of rugged fitness landscapes and its application to maturation of the immune response. , 1989, Journal of theoretical biology.

[12]  C. Voigt,et al.  Rational evolutionary design: the theory of in vitro protein evolution. , 2000, Advances in protein chemistry.

[13]  Dan S. Tawfik,et al.  Man-made cell-like compartments for molecular evolution , 1998, Nature Biotechnology.

[14]  L. Gold,et al.  Systematic evolution of ligands by exponential enrichment: RNA ligands to bacteriophage T4 DNA polymerase. , 1990, Science.

[15]  David Corne,et al.  Observed dynamics of Large Scale Parallel Evolutionary Algorithms with Implications for protein Engineering , 2002, SEAL.

[16]  Christophe Ampe,et al.  Reducing mutational bias in random protein libraries. , 2005, Analytical biochemistry.

[17]  Sean Luke,et al.  Cooperative Multi-Agent Learning: The State of the Art , 2005, Autonomous Agents and Multi-Agent Systems.

[18]  W. Thilly,et al.  Specificity, efficiency, and fidelity of PCR. , 1993, PCR methods and applications.

[19]  A. E. Eiben,et al.  Introduction to Evolutionary Computing , 2003, Natural Computing Series.

[20]  Nguyen Xuan Hoai,et al.  Estimating the distribution and propagation of genetic programming building blocks through tree compression , 2009, GECCO.

[21]  Cara MacNish Towards unbiased benchmarking of evolutionary and hybrid algorithms for real-valued optimisation , 2007, Connect. Sci..

[22]  Anne Brindle,et al.  Genetic algorithms for function optimization , 1980 .

[23]  J. Szostak,et al.  In vitro selection of RNA molecules that bind specific ligands , 1990, Nature.

[24]  Gary B. Lamont,et al.  Evolutionary Algorithms for Solving Multi-Objective Problems (Genetic and Evolutionary Computation) , 2006 .

[25]  L. Darrell Whitley,et al.  Unbiased tournament selection , 2005, GECCO '05.

[26]  Kalyanmoy Deb,et al.  An Investigation of Niche and Species Formation in Genetic Function Optimization , 1989, ICGA.

[27]  A. Griewank Generalized descent for global optimization , 1981 .

[28]  J. David Schaffer,et al.  Proceedings of the third international conference on Genetic algorithms , 1989 .

[29]  Norman D. Cook,et al.  THE CASE FOR REVERSE TRANSLATION , 1975 .

[30]  Kenneth Alan De Jong,et al.  An analysis of the behavior of a class of genetic adaptive systems. , 1975 .

[31]  Günter Rudolph,et al.  Theory of Evolutionary Algorithms: A Bird's Eye View , 1999, Theor. Comput. Sci..

[32]  C. Darwin On the Origin of Species by Means of Natural Selection: Or, The Preservation of Favoured Races in the Struggle for Life , 2019 .

[33]  Aimo A. Törn,et al.  Global Optimization , 1999, Science.

[34]  Kenneth A. De Jong,et al.  Measurement of Population Diversity , 2001, Artificial Evolution.

[35]  Harri Savilahti,et al.  Critical evaluation of random mutagenesis by error-prone polymerase chain reaction protocols, Escherichia coli mutator strain, and hydroxylamine treatment. , 2009, Analytical biochemistry.

[36]  Peter Nordin,et al.  Genetic programming - An Introduction: On the Automatic Evolution of Computer Programs and Its Applications , 1998 .

[37]  Nicolas Lachiche,et al.  Coarse grain parallelization of evolutionary algorithms on GPGPU cards with EASEA , 2009, GECCO.

[38]  K J Eilertsen,et al.  Identification and characterization of two huge protein components of the brush border cytoskeleton: evidence for a cellular isoform of titin , 1992, The Journal of cell biology.

[39]  Xiaodong Li,et al.  Benchmark Functions for the CEC'2010 Special Session and Competition on Large-Scale , 2009 .

[40]  R. Siegel,et al.  Generation of large libraries of random mutants in Bacillus subtilis by PCR-based plasmid multimerization. , 1997, BioTechniques.

[41]  K. O'Neil,et al.  Phage display: protein engineering by directed evolution. , 1995, Current opinion in structural biology.

[42]  E. Vermaas,et al.  Selection of single-stranded DNA molecules that bind and inhibit human thrombin , 1992, Nature.

[43]  F. Arnold,et al.  Directed evolution of biocatalysts. , 1999, Current opinion in chemical biology.

[44]  David J. Galas,et al.  A Mathematical Analysis ofin VitroMolecular Selection – Amplification , 1996 .

[45]  Leighton Pritchard,et al.  A general model of error-prone PCR. , 2005, Journal of theoretical biology.

[46]  Philip A. Romero,et al.  Exploring protein fitness landscapes by directed evolution , 2009, Nature Reviews Molecular Cell Biology.

[47]  Kenneth A. De Jong Evolutionary Computation Theory , 2006 .

[48]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[49]  Martin J. Oates,et al.  On Fitness Distributions and Expected Fitness Gain of Mutation Rates in Parallel Evolutionary Algorithms , 2002, PPSN.

[50]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[51]  H. H. Rosenbrock,et al.  An Automatic Method for Finding the Greatest or Least Value of a Function , 1960, Comput. J..

[52]  Lishan Kang,et al.  An Experimental Study of Some Control Parameters in Parallel Genetic Programming , 2003, Neural Parallel Sci. Comput..

[53]  Daniel A. Ashlock,et al.  An Updated Taxonomy of Evolutionary Computation Problems using Graph-based Evolutionary Algorithms , 2006, 2006 IEEE International Conference on Evolutionary Computation.

[54]  J. A. Lozano,et al.  Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation , 2001 .

[55]  Daniel A. Ashlock,et al.  Graph-based evolutionary algorithms , 2006, IEEE Transactions on Evolutionary Computation.

[56]  W. Gilbert Origin of life: The RNA world , 1986, Nature.

[57]  Stuart A. Kauffman,et al.  ORIGINS OF ORDER , 2019, Origins of Order.

[58]  G. F. Joyce,et al.  Darwinian Evolution on a Chip , 2008, PLoS biology.

[59]  C. Langton Self-reproduction in cellular automata , 1984 .

[60]  Andrew D Griffiths,et al.  Directed evolution by in vitro compartmentalization , 2006, Nature Methods.

[61]  Gary B. Lamont,et al.  Evolutionary Algorithms for Solving Multi-Objective Problems , 2002, Genetic Algorithms and Evolutionary Computation.

[62]  Andreas Plückthun,et al.  In-vitro protein evolution by ribosome display and mRNA display. , 2004, Journal of immunological methods.