Evolution of new protein topologies through multistep gene rearrangements

New protein folds have emerged throughout evolution, but it remains unclear how a protein fold can evolve while maintaining its function, particularly when fold changes require several sequential gene rearrangements. Here, we explored hypothetical evolutionary pathways linking different topological families of the DNA-methyltransferase superfamily. These pathways entail successive gene rearrangements through a series of intermediates, all of which should be sufficiently active to maintain the organism's fitness. By means of directed evolution, and starting from HaeIII methyltransferase (M.HaeIII), we selected all the required intermediates along these paths (a duplicated fused gene and duplicates partially truncated at their 5′ or 3′ coding regions) that maintained function in vivo. These intermediates led to new functional genes that resembled natural methyltransferases from three known classes or that belonged to a new class first seen in our evolution experiments and subsequently identified in natural genomes. Our findings show that new protein topologies can evolve gradually through multistep gene rearrangements and provide new insights regarding these processes.

[1]  C. Chothia The nature of the accessible and buried surfaces in proteins. , 1976, Journal of molecular biology.

[2]  G M Edelman,et al.  Favin versus concanavalin A: Circularly permuted amino acid sequences. , 1979, Proceedings of the National Academy of Sciences of the United States of America.

[3]  A. Kiss,et al.  Cloning the modification methylase gene of Bacillus sphaericus R in Escherichia coli. , 1980, Gene.

[4]  J. Richardson,et al.  The anatomy and taxonomy of protein structure. , 1981, Advances in protein chemistry.

[5]  K. Luger,et al.  Correct folding of circularly permuted variants of a beta alpha barrel enzyme in vivo. , 1989, Science.

[6]  H. Kotani,et al.  The fokI restriction-modification system. I. Organization and nucleotide sequences of the restriction and modification genes. , 1989, The Journal of biological chemistry.

[7]  H. Sugisaki,et al.  The FokI restriction-modification system. II. Presence of two domains in FokI methylase responsible for modification of different DNA strands. , 1989, The Journal of biological chemistry.

[8]  A. Janulaitis,et al.  Sequence motifs characteristic of DNA[cytosine-N4]methyltransferases: similarity to adenine and cytosine-C5 DNA-methylases. , 1989, Nucleic acids research.

[9]  W. Lipscomb,et al.  The crystal structure of Haelll methyltransferase covalently complexed to DNA: An extrahelical cytosine and rearranged base pairing , 1995, Cell.

[10]  I. Kobayashi,et al.  Restriction-modification systems as genomic parasites in competition for specific sequences. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[11]  R B Russell,et al.  Swaposins: circular permutations within genes encoding saposin homologues. , 1995, Trends in biochemical sciences.

[12]  R. Blumenthal,et al.  Structure-guided analysis reveals nine sequence motifs conserved among DNA amino-methyltransferases, and suggests a catalytic mechanism for these enzymes. , 1995, Journal of molecular biology.

[13]  H. K. Schachman,et al.  Random circular permutation of genes and expressed polypeptide chains: application of the method to the catalytic chains of aspartate transcarbamoylase. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[14]  The Flavobacterium okeanokoites adenine-N6-specific DNA-methyltransferase M.FokI is a tandem enzyme of two independent domains with very different kinetic properties. , 1998, European journal of biochemistry.

[15]  Werner Braun,et al.  Exact and efficient analytical calculation of the accessible surface areas and their gradients for macromolecules , 1998, J. Comput. Chem..

[16]  Dan S. Tawfik,et al.  Man-made cell-like compartments for molecular evolution , 1998, Nature Biotechnology.

[17]  Marc Ostermeier,et al.  Finding Cinderella's slipper—proteins that fit , 1999, Nature Biotechnology.

[18]  R. Glockshuber,et al.  Random circular permutation of DsbA reveals segments that are essential for protein folding and stability. , 1999, Journal of molecular biology.

[19]  Marc Ostermeier,et al.  A combinatorial approach to hybrid enzymes independent of DNA homology , 1999, Nature Biotechnology.

[20]  J. H. Shim,et al.  Combinatorial protein engineering by incremental truncation. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Albert Jeltsch,et al.  Circular Permutations in the Molecular Evolution of DNA Methyltransferases , 1999, Journal of Molecular Evolution.

[22]  Tsutomu Nakamura,et al.  Systematic circular permutation of an entire protein reveals essential folding elements , 2000, Nature Structural Biology.

[23]  Janusz M Bujnicki,et al.  Sequence permutations in the molecular evolution of DNA methyltransferases , 2002, BMC Evolutionary Biology.

[24]  N. Grishin Fold change in evolution of protein structures. , 2001, Journal of structural biology.

[25]  Jennifer L. Martin,et al.  SAM (dependent) I AM: the S-adenosylmethionine-dependent methyltransferase fold. , 2002, Current opinion in structural biology.

[26]  E. Koonin,et al.  Trends in protein evolution inferred from sequence and structure analysis. , 2002, Current opinion in structural biology.

[27]  E. Koonin,et al.  The structure of the protein universe and genome evolution , 2002, Nature.

[28]  Beyond Watson and Crick: DNA Methylation and Molecular Enzymology of DNA Methyltransferases , 2002 .

[29]  Dan S. Tawfik,et al.  Directed evolution of phosphotriesterase from Pseudomonas diminuta for heterologous expression in Escherichia coli results in stabilization of the metal-free state. , 2005, Protein engineering, design & selection : PEDS.

[30]  A. Wagner Robustness, evolvability, and neutrality , 2005, FEBS letters.

[31]  S. Chandrasegaran,et al.  Protein fragment complementation in M.HhaI DNA methyltransferase. , 2005, Biochemical and biophysical research communications.