The Reconstruction of Doubled Genomes

The genome can be modeled as a set of strings (chromosomes) of distinguished elements called genes. Genome duplication is an important source of new gene functions and novel physiological pathways. Originally (ancestrally), a duplicated genome contains two identical copies of each chromosome, but through the genomic rearrangement mutational processes of reciprocal translocation (prefix and/or suffix exchanges between chromosomes) and substring reversals, this simple doubled structure is disrupted. At the time of observation, each of the chromosomes resulting from the accumulation of rearrangements can be decomposed into a succession of conserved segments, such that each segment appears exactly twice in the genome. We present exact algorithms for reconstructing the ancestral doubled genome in linear time, minimizing the number of rearrangement mutations required to derive the observed order of genes along the present-day chromosomes. Somewhat different techniques are required for a translocations-only model, a translocations/reversals model, both of these in the multichromosomal context (eukaryotic nuclear genomes), and a reversals-only model for single chromosome prokaryotic and organellar genomes. We apply these methods to the yeast genome, which is thought to have doubled, and to the liverwort mitochondrial genome, whose duplicate genes are unlikely to have arisen by genome doubling.

[1]  B. Gaut,et al.  DNA sequence evidence for the segmental allotetraploid origin of maize. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[2]  K. Choo,et al.  Neocentromeres: role in human disease, evolution, and centromere study. , 2002, American journal of human genetics.

[3]  David Sankoff,et al.  Genome Halving , 1998, CPM.

[4]  David A. Bader,et al.  A Linear-Time Algorithm for Computing Inversion Distance between Signed Permutations with an Experimental Study , 2001, J. Comput. Biol..

[5]  J. T. Trevors,et al.  Evolution of bacterial genomes , 1997, Antonie van Leeuwenhoek.

[6]  Pavel A. Pevzner,et al.  Transforming cabbage into turnip: polynomial algorithm for sorting signed permutations by reversals , 1995, JACM.

[7]  Karsten Hokamp,et al.  Extensive genomic duplication during early chordate evolution , 2002, Nature Genetics.

[8]  D. Sredni,et al.  Differential regulation of neurogenesis by the two Xenopus GATA-1 genes , 1997, Molecular and cellular biology.

[9]  Austin L. Hughes,et al.  Phylogenies of Developmentally Important Proteins Do Not Support the Hypothesis of Two Rounds of Genome Duplication Early in Vertebrate History , 1999, Journal of Molecular Evolution.

[10]  T. Delmonte,et al.  Toward a unified genetic map of higher plants, transcending the monocot–dicot divergence , 1996, Nature Genetics.

[11]  M. Lynch,et al.  The evolutionary fate and consequences of duplicate genes. , 2000, Science.

[12]  A. Hughes,et al.  Pattern and timing of gene duplication in animal genomes. , 2001, Genome research.

[13]  Pavel A. Pevzner,et al.  Transforming men into mice (polynomial algorithm for genomic distance problem) , 1995, Proceedings of IEEE 36th Annual Foundations of Computer Science.

[14]  Adam Siepel,et al.  An algorithm to find all sorting reversals , 2002 .

[15]  Dannie Durand,et al.  Vertebrate evolution: doubling and shuffling with a full deck. , 2003, Trends in genetics : TIG.

[16]  David Sankoff,et al.  Hybridization and Genome Rearrangement , 1999, CPM.

[17]  Haim Kaplan,et al.  A Faster and Simpler Algorithm for Sorting Signed Permutations by Reversals , 1999, SIAM J. Comput..

[18]  D. J. Lydiate,et al.  Desaturase multigene families of Brassica napus arose through genome duplication , 1997, Theoretical and Applied Genetics.

[19]  David Sankoff,et al.  Reconstructing the pre-doubling genome , 1999, RECOMB.

[20]  Alberto Caprara,et al.  Sorting by reversals is difficult , 1997, RECOMB '97.

[21]  R. Hinegardner Evolution of Cellular DNA Content in Teleost Fishes , 1968, The American Naturalist.

[22]  K. J. Fryxell,et al.  The coevolution of gene family trees. , 1996, Trends in genetics : TIG.

[23]  Gad M. Landau,et al.  Proceedings of the 12th Annual Symposium on Combinatorial Pattern Matching , 2001 .

[24]  F. Müller,et al.  Chromatin diminution in nematodes , 1996, BioEssays : news and reviews in molecular, cellular and developmental biology.

[25]  S. Ohno,et al.  Evolution from fish to mammals by gene duplication. , 2009, Hereditas.

[26]  K. H. Wolfe,et al.  Molecular evidence for an ancient duplication of the entire yeast genome , 1997, Nature.

[27]  Susumu Ohno,et al.  DNA values of four primitive chordates , 1967, Chromosoma.

[28]  G. Moore,et al.  Cereal Genome Evolution: Grasses, line up and form a circle , 1995, Current Biology.

[29]  K. H. Wolfe,et al.  Eukaryote genome duplication - where's the evidence? , 1998, Current opinion in genetics & development.

[30]  Sridhar Hannenhalli,et al.  Polynomial-time Algorithm for Computing Translocation Distance Between Genomes , 1995, Discret. Appl. Math..

[31]  Takashi Kunisawa,et al.  Identification and chromosomal distribution of DNA sequence segments conserved since divergence of Escherichia coli and Bacillus subtilis , 2004, Journal of Molecular Evolution.

[32]  R. Shoemaker,et al.  Genome duplication in soybean (Glycine subgenus soja). , 1996, Genetics.

[33]  K. Oda,et al.  Gene organization deduced from the complete sequence of liverwort Marchantia polymorpha mitochondrial DNA. A primitive form of plant mitochondrial genome. , 1992, Journal of molecular biology.

[34]  Anne Bergeron,et al.  A very elementary presentation of the Hannenhalli-Pevzner theory , 2005, Discret. Appl. Math..

[35]  Nadia El-Mabrouk,et al.  Reconstructing an ancestral genome using minimum segments duplications and reversals , 2002, J. Comput. Syst. Sci..

[36]  D. Sankoff,et al.  Comparable rates of gene loss and functional divergence after genome duplications early in vertebrate evolution. , 1997, Genetics.

[37]  S. Tanksley,et al.  Comparative linkage maps of the rice and maize genomes. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Margaret R. Thomson,et al.  Vertebrate genome evolution and the zebrafish gene map , 1998, Nature Genetics.

[39]  K. H. Wolfe,et al.  Extent of genomic rearrangement after genome duplication in yeast. , 1998, Proceedings of the National Academy of Sciences of the United States of America.