A trade-off relationship between energetic cost and entropic cost for in vitro evolution

In this paper, we consider two complementary cost functions for the landscape exploring processes to obtain the global optimum sequence through in vitro evolution protocol: one is the entropic cost C(etp), which is based on the deviation from the uniformity of a mutant distribution in sequence space, and the other is the energetic cost C(eng), which is based on the total number of sequences to be generated and evaluated. Based on a prior knowledge about the structure of a given fitness landscapes, the conductor of the experiment can think up the efficient search algorithm (ESA), which requires the minimum number of points (=sequences) to be searched up to the global optimum. For five typical fitness landscapes, we considered their respective (putative) ESA, C(etp)(*) and C(eng)(*) based on the ESA. As a result, we found a trade-off relationship between C(etp)(*) and C(eng)(*) for every case, that is, C(eng)(*)+C(etp)(*) is approximately equal to the logarithm of the volume of the sequence space. C(etp)(*) and C(eng)(*) are interpreted in terms of the information-theoretic concepts.

[1]  M. Kasahara Major histocompatibility complex : evolution, structure, and function , 2000 .

[2]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[3]  Philip A. Romero,et al.  Exploring protein fitness landscapes by directed evolution , 2009, Nature Reviews Molecular Cell Biology.

[4]  G. Jung,et al.  Repertoire forecast of MHC class I binding peptides with peptide libraries , 2000 .

[5]  J. Wells,et al.  Additivity of mutational effects in proteins. , 1990, Biochemistry.

[6]  Y. Husimi,et al.  Fitness Landscape of a Biopolymer Participating in a Multi-step Reaction , 1998 .

[7]  C. Voigt,et al.  Rational evolutionary design: the theory of in vitro protein evolution. , 2000, Advances in protein chemistry.

[8]  R. Nussinov,et al.  In silico protein design by combinatorial assembly of protein building blocks , 2004, Protein science : a publication of the Protein Society.

[9]  K. Yura,et al.  Repeat of a helix-turn-helix module in DNA-binding proteins. , 1993, Protein engineering.

[10]  P. Stadler,et al.  Hamming chromatography , 2005, Molecular Diversity.

[11]  John Maynard Smith,et al.  Natural Selection and the Concept of a Protein Space , 1970, Nature.

[12]  Hong Qian,et al.  Open-system thermodynamic analysis of DNA polymerase fidelity , 2009, Physical biology.

[13]  Yasuhiko Shibanaka,et al.  Surveying a local fitness landscape of a protein with epistatic sites for the study of directed evolution. , 2002, Biopolymers.

[14]  A. Perelson,et al.  Protein evolution on partially correlated landscapes. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[15]  M. Eigen,et al.  Natural selection: a phase transition? , 2000, Biophysical chemistry.

[16]  R. Houghten,et al.  Generation and use of synthetic peptide combinatorial libraries for basic research and drug discovery , 1991, Nature.

[17]  Koichi Nishigaki,et al.  Construction of block-shuffled libraries of DNA for evolutionary protein engineering: Y-ligation-based block shuffling. , 2002, Protein engineering.

[18]  R. Houghten,et al.  Synthetic peptide combinatorial libraries (SPCLs): identification of the antigenic determinant of beta-endorphin recognized by monoclonal antibody 3E7. , 1993, Gene.

[19]  H. Yanagawa,et al.  Towards the creation of novel proteins by block shuffling. , 2006, Combinatorial chemistry & high throughput screening.

[20]  E. D. Weinberger,et al.  A theory of pragmatic information and its application to the quasi-species model of biological evolution. , 2001, Bio Systems.

[21]  Tsutomu Nakamura,et al.  Systematic circular permutation of an entire protein reveals essential folding elements , 2000, Nature Structural Biology.

[22]  Dorothea Heiss-Czedik,et al.  An Introduction to Genetic Algorithms. , 1997, Artificial Life.

[23]  Y. Iwasa,et al.  Evolutionarily stable mutation rate in a periodically changing environment. , 1989, Genetics.

[24]  A. D. McLachlan,et al.  Profile analysis: detection of distantly related proteins. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Christopher A. Voigt,et al.  Protein building blocks preserved by recombination , 2002, Nature Structural Biology.

[26]  L. Brillouin,et al.  Science and information theory , 1956 .