An effective mutation-based measure for evaluating the suitability of parental sequences to undergo DNA shuffling experiments

The DNA shuffling process has been successfully used in many experiments of Directed Molecular Evolution. In a shuffling experiment genes are recombined by an iterative procedure of PCR cycles aiming at obtaining new genes, hopefully with some of the original functions being improved. The optimizations of the parameters involved in the process as well as the characteristics of the parental sequences are of extreme importance to guarantee the success of a shuffling experiment. This paper proposes a new measure, based on the number of bases between existing mutations in the parental sequences, suitable for evaluating the suitability of two sequences to be submitted to a DNA shuffling experiment. In order to investigate the usefulness of the proposed mutation-based measure versus two commonly used measures, a family of 37 DNA gene sequences codifying for snake venom metallopeptidases was used for evaluation purposes using the three measures. The parental sequences identified by each of the three measures were validated by simulating the DNA shuffling process using the software eShuffle. The eShuffle results illustrate on the benefits of the mutation-based measure proposed in this paper.

[1]  András Kocsor,et al.  Sequence analysis Application of compression-based distance measures to protein sequence classification : a methodological study , 2005 .

[2]  R. Keenan,et al.  DNA shuffling as a tool for protein crystallization. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Paul M. B. Vitányi,et al.  Clustering by compression , 2003, IEEE Transactions on Information Theory.

[4]  H. Kishino,et al.  Evaluation of the maximum likelihood estimate of the evolutionary tree topologies from DNA sequence data, and the branching order in hominoidea , 1989, Journal of Molecular Evolution.

[5]  M. Kimura A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences , 1980, Journal of Molecular Evolution.

[6]  Dong‐sheng Li,et al.  A novel high molecular weight metalloproteinase cleaves fragment F1 of activated human prothrombin. , 2004, Toxicon : official journal of the International Society on Toxinology.

[7]  W. You,et al.  A novel metalloprotease from Gloydius halys venom induces endothelial cell apoptosis through its protease and disintegrin-like domains. , 2003, Journal of biochemistry.

[8]  Wen-Bin Wu,et al.  Activation of MMP-2, cleavage of matrix proteins, and adherens junctions during a snake venom metalloproteinase-induced endothelial cell apoptosis. , 2003, Experimental cell research.

[9]  Jonas S. Almeida,et al.  Alignment-free sequence comparison-a review , 2003, Bioinform..

[10]  J. Seffernick,et al.  Novel enzyme activities and functional plasticity revealed by recombining highly homologous enzymes. , 2001, Chemistry & biology.

[11]  C D Maranas,et al.  Predicting crossover generation in DNA shuffling , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Xin Chen,et al.  An information-based sequence distance and its application to whole mitochondrial genome phylogeny , 2001, Bioinform..

[13]  Catherine Letondal,et al.  A Web interface generator for molecular biology programs in Unix , 2001, Bioinform..

[14]  F. Arnold,et al.  Methods for in vitro DNA recombination and random chimeragenesis. , 2000, Methods in enzymology.

[15]  Marc Ostermeier,et al.  Finding Cinderella's slipper—proteins that fit , 1999, Nature Biotechnology.

[16]  H. Suenaga,et al.  Enhanced degradation of polychlorinated biphenyls by directed evolution of biphenyl dioxygenase , 1998, Nature Biotechnology.

[17]  W. Stemmer,et al.  DNA shuffling of a family of genes from diverse species accelerates directed evolution , 1998, Nature.

[18]  F. Arnold,et al.  Functional and nonfunctional mutations distinguished by random recombination of homologous genes. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[19]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[20]  W. Stemmer DNA shuffling by random fragmentation and reassembly: in vitro recombination for molecular evolution. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[21]  W. Stemmer Rapid evolution of a protein in vitro by DNA shuffling , 1994, Nature.

[22]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[23]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.