Phylogenetic detection of numerous gene duplications shared by animals, fungi and plants

BackgroundGene duplication is considered a major driving force for evolution of genetic novelty, thereby facilitating functional divergence and organismal diversity, including the process of speciation. Animals, fungi and plants are major eukaryotic kingdoms and the divergences between them are some of the most significant evolutionary events. Although gene duplications in each lineage have been studied extensively in various contexts, the extent of gene duplication prior to the split of plants and animals/fungi is not clear.ResultsHere, we have studied gene duplications in early eukaryotes by phylogenetic relative dating. We have reconstructed gene families (with one or more orthogroups) with members from both animals/fungi and plants by using two different clustering strategies. Extensive phylogenetic analyses of the gene families show that, among nearly 2,600 orthogroups identified, at least 300 of them still retain duplication that occurred before the divergence of the three kingdoms. We further found evidence that such duplications were also detected in some highly divergent protists, suggesting that these duplication events occurred in the ancestors of most major extant eukaryotic groups.ConclusionsOur phylogenetic analyses show that numerous gene duplications happened at the early stage of eukaryotic evolution, probably before the separation of known major eukaryotic lineages. We discuss the implication of our results in the contexts of different models of eukaryotic phylogeny. One possible explanation for the large number of gene duplication events is one or more large-scale duplications, possibly whole genome or segmental duplication(s), which provides a genomic basis for the successful radiation of early eukaryotes.

[1]  Hong Ma,et al.  Evolutionary history of histone demethylase families: distinct evolutionary patterns suggest functional divergence , 2008, BMC Evolutionary Biology.

[2]  William R. Taylor,et al.  The rapid generation of mutation data matrices from protein sequences , 1992, Comput. Appl. Biosci..

[3]  L. Hug,et al.  Phylogenomic analyses support the monophyly of Excavata and resolve relationships among eukaryotic “supergroups” , 2009, Proceedings of the National Academy of Sciences.

[4]  B. Birren,et al.  Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae , 2004, Nature.

[5]  M. Nei,et al.  The origins and early evolution of DNA mismatch repair genes—multiple horizontal gene transfers and co-evolution , 2007, Nucleic acids research.

[6]  Mitchell L Sogin Early evolution and the origin of eukaryotes , 1992, Current Biology.

[7]  A. Force,et al.  Preservation of duplicate genes by complementary, degenerative mutations. , 1999, Genetics.

[8]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[9]  Y Van de Peer,et al.  Genome duplication, divergent resolution and speciation. , 2001, Trends in genetics : TIG.

[10]  M. P. Cummings PHYLIP (Phylogeny Inference Package) , 2004 .

[11]  Boris G. Mirkin,et al.  Ancestral paralogs and pseudoparalogs and their role in the emergence of the eukaryotic cell , 2005, Nucleic acids research.

[12]  S Blair Hedges,et al.  BMC Evolutionary Biology BioMed Central , 2003 .

[13]  E. Eichler,et al.  The origins and impact of primate segmental duplications. , 2009, Trends in genetics : TIG.

[14]  T. Cavalier-smith,et al.  Myosin domain evolution and the primary divergence of eukaryotes , 2005, Nature.

[15]  A. Simpson,et al.  The real ‘kingdoms’ of eukaryotes , 2004, Current Biology.

[16]  C. Seoighe Turning the clock back on ancient genome duplication. , 2003, Current opinion in genetics & development.

[17]  T. Cavalier-smith,et al.  Rooting the Eukaryote Tree by Using a Derived Gene Fusion , 2002, Science.

[18]  B Franz Lang,et al.  The tree of eukaryotes. , 2005, Trends in ecology & evolution.

[19]  M. Nei,et al.  Concerted and birth-and-death evolution of multigene families. , 2005, Annual review of genetics.

[20]  S. Adl,et al.  The New Higher Level Classification of Eukaryotes with Emphasis on the Taxonomy of Protists , 2005, The Journal of eukaryotic microbiology.

[21]  Eugene V Koonin,et al.  The Biological Big Bang model for the major transitions in evolution , 2007, Biology Direct.

[22]  Paramvir S. Dehal,et al.  Two Rounds of Whole Genome Duplication in the Ancestral Vertebrate , 2005, PLoS biology.

[23]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[24]  O. Gascuel,et al.  An improved general amino acid replacement matrix. , 2008, Molecular biology and evolution.

[25]  Fabien Burki,et al.  Phylogenomics reveals a new ‘megagroup’ including most photosynthetic eukaryotes , 2008, Biology Letters.

[26]  Sudhir Kumar,et al.  Genomic clocks and evolutionary timescales. , 2003, Trends in genetics : TIG.

[27]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[28]  Kevin P. Byrne,et al.  Multiple rounds of speciation associated with reciprocal gene loss in polyploid yeasts , 2006, Nature.

[29]  J. Bull,et al.  An Empirical Test of Bootstrapping as a Method for Assessing Confidence in Phylogenetic Analysis , 1993 .

[30]  Yves Van de Peer,et al.  Computational approaches to unveiling ancient genome duplications , 2009 .

[31]  Steven Maere,et al.  Genome duplication and the origin of angiosperms. , 2005, Trends in ecology & evolution.

[32]  Martin Vingron,et al.  Ontologizer 2.0 - a multifunctional tool for GO term enrichment analysis and data exploration , 2008, Bioinform..

[33]  Yasuko Takahashi,et al.  Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events , 2022 .

[34]  Nobutaka Hirokawa,et al.  Analysis of the kinesin superfamily: insights into structure and function. , 2005, Trends in cell biology.

[35]  O. Gascuel,et al.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. , 2003, Systematic biology.

[36]  Hong Ma,et al.  Long-term maintenance of stable copy number in the eukaryotic SMC family: origin of a vertebrate meiotic SMC1 and fate of recent segmental duplicates , 2008 .

[37]  G. Fischer,et al.  A prominent role for segmental duplications in modeling eukaryotic genomes. , 2009, Comptes rendus biologies.

[38]  A. Rokas The origins of multicellularity and the early history of the genetic toolkit for animal development. , 2008, Annual review of genetics.

[39]  Darren A. Natale,et al.  The COG database: an updated version includes eukaryotes , 2003, BMC Bioinformatics.

[40]  Toni Gabaldón,et al.  trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses , 2009, Bioinform..

[41]  Alexandros Stamatakis,et al.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models , 2006, Bioinform..

[42]  M. Lynch,et al.  The evolutionary fate and consequences of duplicate genes. , 2000, Science.

[43]  Dr. Susumu Ohno Evolution by Gene Duplication , 1970, Springer Berlin Heidelberg.

[44]  Rolf Apweiler,et al.  InterProScan - an integration platform for the signature-recognition methods in InterPro , 2001, Bioinform..

[45]  O. Gascuel,et al.  Approximate likelihood-ratio test for branches: A fast, accurate, and powerful alternative. , 2006, Systematic biology.

[46]  W. Martin,et al.  Endosymbiotic gene transfer: organelle genomes forge eukaryotic chromosomes , 2004, Nature Reviews Genetics.

[47]  P. Forterre,et al.  Universal tree of life , 1993, Nature.

[48]  Charles E. Chapple,et al.  Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype , 2004, Nature.

[49]  M. Nei,et al.  Evolution of F-box genes in plants: Different modes of sequence divergence and their relationships with functional diversification , 2009, Proceedings of the National Academy of Sciences.

[50]  Steven Maere,et al.  The gain and loss of genes during 600 million years of vertebrate evolution , 2006, Genome Biology.

[51]  Guillaume Blanc,et al.  Widespread Paleopolyploidy in Model Plant Species Inferred from Age Distributions of Duplicate Genes , 2004, The Plant Cell Online.

[52]  R. Guigó,et al.  Global trends of whole-genome duplications revealed by the ciliate Paramecium tetraurelia , 2006, Nature.

[53]  E. Álvarez-Buylla,et al.  An ancestral MADS-box gene duplication occurred before the divergence of plants and animals. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[54]  Yves Van de Peer,et al.  Computational approaches to unveiling ancient genome duplications , 2004, Nature Reviews Genetics.

[55]  Masami Hasegawa,et al.  Root of the Eukaryota tree as inferred from combined maximum likelihood analyses of multiple molecular sequence data. , 2005, Molecular biology and evolution.

[56]  Brian C. Thomas,et al.  Gene-balanced duplications, like tetraploidy, provide predictable drive to increase morphological complexity. , 2006, Genome research.

[57]  K. H. Wolfe,et al.  Molecular evidence for an ancient duplication of the entire yeast genome , 1997, Nature.

[58]  Marie Sémon,et al.  Consequences of genome duplication. , 2007, Current opinion in genetics & development.

[59]  E. Koonin,et al.  Analysis of Rare Genomic Changes Does Not Support the Unikont–Bikont Phylogeny and Suggests Cyanobacterial Symbiosis as the Point of Primary Radiation of Eukaryotes , 2009, Genome biology and evolution.

[60]  Gynheung An,et al.  Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[61]  M. Gribskov,et al.  The Genome of Black Cottonwood, Populus trichocarpa (Torr. & Gray) , 2006, Science.

[62]  Masatoshi Nei,et al.  Origins and evolution of the recA/RAD51 gene family: Evidence for ancient gene duplication and endosymbiotic gene transfer , 2006, Proceedings of the National Academy of Sciences.

[63]  Haibao Tang,et al.  Unraveling ancient hexaploidy through multiply-aligned angiosperm gene maps. , 2008, Genome research.

[64]  Sean B. Carroll,et al.  Gene duplication and the adaptive evolution of a classic genetic switch , 2007, Nature.

[65]  M. Sogin,et al.  Evolution of the protists and protistan parasites from the perspective of molecular systematics. , 1998, International journal for parasitology.