Strong functional patterns in the evolution of eukaryotic genomes revealed by the reconstruction of ancestral protein domain repertoires

BackgroundGenome size and complexity, as measured by the number of genes or protein domains, is remarkably similar in most extant eukaryotes and generally exhibits no correlation with their morphological complexity. Underlying trends in the evolution of the functional content and capabilities of different eukaryotic genomes might be hidden by simultaneous gains and losses of genes.ResultsWe reconstructed the domain repertoires of putative ancestral species at major divergence points, including the last eukaryotic common ancestor (LECA). We show that, surprisingly, during eukaryotic evolution domain losses in general outnumber domain gains. Only at the base of the animal and the vertebrate sub-trees do domain gains outnumber domain losses. The observed gain/loss balance has a distinct functional bias, most strikingly seen during animal evolution, where most of the gains represent domains involved in regulation and most of the losses represent domains with metabolic functions. This trend is so consistent that clustering of genomes according to their functional profiles results in an organization similar to the tree of life. Furthermore, our results indicate that metabolic functions lost during animal evolution are likely being replaced by the metabolic capabilities of symbiotic organisms such as gut microbes.ConclusionsWhile protein domain gains and losses are common throughout eukaryote evolution, losses oftentimes outweigh gains and lead to significant differences in functional profiles. Results presented here provide additional arguments for a complex last eukaryotic common ancestor, but also show a general trend of losses in metabolic capabilities and gain in regulatory complexity during the rise of animals.

[1]  Fabien Burki,et al.  Monophyly of Rhizaria and multigene phylogeny of unicellular bikonts. , 2006, Molecular biology and evolution.

[2]  D. Moreira,et al.  A Complex Cell Division Machinery Was Present in the Last Common Ancestor of Eukaryotes , 2009, PloS one.

[3]  J. Berg Genome sequence of the nematode C. elegans: a platform for investigating biology. , 1998, Science.

[4]  J. W. Valentine,et al.  Morphological complexity increase in metazoans , 1994, Paleobiology.

[5]  Thomas Lengauer,et al.  Improved scoring of functional groups from gene expression data by decorrelating GO graph structure , 2006, Bioinform..

[6]  L. Hug,et al.  Phylogenomic analyses support the monophyly of Excavata and resolve relationships among eukaryotic “supergroups” , 2009, Proceedings of the National Academy of Sciences.

[7]  C. Berney,et al.  A molecular time-scale for eukaryote evolution recalibrated with the continuous microfossil record , 2006, Proceedings of the Royal Society B: Biological Sciences.

[8]  James A. Cuff,et al.  Distinguishing protein-coding and noncoding genes in the human genome , 2007, Proceedings of the National Academy of Sciences.

[9]  E. Koonin,et al.  The structure of the protein universe and genome evolution , 2002, Nature.

[10]  Philip C. J. Donoghue,et al.  Calibrating and constraining molecular clocks , 2009 .

[11]  E. Koonin,et al.  Analysis of Rare Genomic Changes Does Not Support the Unikont–Bikont Phylogeny and Suggests Cyanobacterial Symbiosis as the Point of Primary Radiation of Eukaryotes , 2009, Genome biology and evolution.

[12]  A. Simpson,et al.  Evolution: Revisiting the Root of the Eukaryote Tree , 2009, Current Biology.

[13]  Piero Carninci Non-coding RNA transcription: turning on neighbours , 2008, Nature Cell Biology.

[14]  J. Finnerty,et al.  Rising starlet: the starlet sea anemone, Nematostella vectensis. , 2005, BioEssays : news and reviews in molecular, cellular and developmental biology.

[15]  石柠 My favorite animal , 2006 .

[16]  D. M. Krylov,et al.  Gene loss, protein sequence divergence, gene dispensability, expression level, and interactivity are correlated in eukaryotic evolution. , 2003, Genome research.

[17]  A. Hughes,et al.  Shedding Genomic Ballast: Extensive Parallel Loss of Ancestral Gene Families in Animals , 2004, Journal of Molecular Evolution.

[18]  Kenneth S. Kosik,et al.  Reconstructing ancestral genome content based on symmetrical best alignments and Dollo parsimony , 2008, Bioinform..

[19]  Bernard Henrissat,et al.  Characterizing a model human gut microbiota composed of members of its two dominant bacterial phyla , 2009, Proceedings of the National Academy of Sciences.

[20]  A. Simpson,et al.  Cytoskeletal organization, phylogenetic affinities and systematics in the contentious taxon Excavata (Eukaryota). , 2003, International journal of systematic and evolutionary microbiology.

[21]  R. Sommer,et al.  How to become a parasite - lessons from the genomes of nematodes. , 2009, Trends in genetics : TIG.

[22]  J. Farris Phylogenetic Analysis Under Dollo's Law , 1977 .

[23]  M. Telford Animal Phylogeny: Back to the Coelomata? , 2004, Current Biology.

[24]  Léon Personnaz,et al.  Enrichment or depletion of a GO category within a class of genes: which test? , 2007, Bioinform..

[25]  Luciano Milanesi,et al.  Bioinformatics of Genome Regulation and Structure II , 2006 .

[26]  Boris G. Mirkin,et al.  Ancestral paralogs and pseudoparalogs and their role in the emergence of the eukaryotic cell , 2005, Nucleic acids research.

[27]  R. Haygood Mutation Rate and the Cost of Complexity , 2006 .

[28]  Philip C. J. Donoghue,et al.  MicroRNAs and the advent of vertebrate morphological complexity , 2008, Proceedings of the National Academy of Sciences.

[29]  Eric J. Deeds,et al.  Prokaryotic phylogenies inferred from protein structural domains. , 2005, Genome research.

[30]  Cyrus Chothia,et al.  Protein Family Expansions and Biological Complexity , 2006, PLoS Comput. Biol..

[31]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[32]  I King Jordan,et al.  Transposable elements and the evolution of eukaryotic complexity. , 2002, Current issues in molecular biology.

[33]  Robert D. Finn,et al.  The Pfam protein families database , 2004, Nucleic Acids Res..

[34]  David J. Miller,et al.  The gene complement of the ancestral bilaterian - was Urbilateria a monster? , 2009, Journal of biology.

[35]  S. Morris The Cambrian "explosion": slow-fuse or megatonnage? , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Dannie Durand,et al.  Graph Theoretical Insights into Evolution of Multidomain Proteins , 2005, RECOMB.

[37]  R. Saint,et al.  EST Analysis of the Cnidarian Acropora millepora Reveals Extensive Gene Loss and Rapid Sequence Divergence in the Model Invertebrates , 2003, Current Biology.

[38]  R. Guigó,et al.  Global trends of whole-genome duplications revealed by the ciliate Paramecium tetraurelia , 2006, Nature.

[39]  J. Claverie,et al.  What If There Are Only 30,000 Human Genes? , 2001, Science.

[40]  Christian M. Zmasek,et al.  phyloXML: XML for evolutionary biology and comparative genomics , 2009, BMC Bioinformatics.

[41]  Laura Wegener Parfrey,et al.  Evaluating Support for the Current Classification of Eukaryotic Diversity , 2006, PLoS genetics.

[42]  V. Ros,et al.  Lateral gene transfer between prokaryotes and multicellular eukaryotes: ongoing and significant? , 2009, BMC Biology.

[43]  S. Adl,et al.  The New Higher Level Classification of Eukaryotes with Emphasis on the Taxonomy of Protists , 2005, The Journal of eukaryotic microbiology.

[44]  Fabien Burki,et al.  Phylogenomics reveals a new ‘megagroup’ including most photosynthetic eukaryotes , 2008, Biology Letters.

[45]  M. Martindale,et al.  Unexpected complexity of the Wnt gene family in a sea anemone , 2005, Nature.

[46]  S. Carroll Chance and necessity: the evolution of morphological complexity and diversity , 2001, Nature.

[47]  William H. Majoros,et al.  Macronuclear Genome Sequence of the Ciliate Tetrahymena thermophila, a Model Eukaryote , 2006, PLoS biology.

[48]  L. Patthy,et al.  Modules, multidomain proteins and organismic complexity , 2005, The FEBS journal.

[49]  Sudhir Kumar,et al.  The timetree of life , 2009 .

[50]  S. Baldauf An overview of the phylogeny and diversity of eukaryotes , 2008 .

[51]  D. Penny,et al.  Evaluating hypotheses for the origin of eukaryotes. , 2007, BioEssays : news and reviews in molecular, cellular and developmental biology.

[52]  Manfred Binder,et al.  Evolution of complex fruiting–body morphologies in homobasidiomycetes , 2002, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[53]  A. Godzik,et al.  Surprising complexity of the ancestral apoptosis network , 2007, Genome Biology.

[54]  Hank Tu,et al.  The Genome of Naegleria gruberi Illuminates Early Eukaryotic Versatility , 2010, Cell.

[55]  Robert D. Finn,et al.  InterPro: the integrative protein signature database , 2008, Nucleic Acids Res..

[56]  Bernard B. Suh,et al.  Reconstructing contiguous regions of an ancestral genome. , 2006, Genome research.

[57]  C. Ponting,et al.  The natural history of protein domains. , 2002, Annual review of biophysics and biomolecular structure.

[58]  Martin Vingron,et al.  Improved detection of overrepresentation of Gene-Ontology annotations with parent-child analysis , 2007, Bioinform..

[59]  Stephen M. Mount,et al.  The genome sequence of Drosophila melanogaster. , 2000, Science.

[60]  R. Raff,et al.  Dollo's law and the death and resurrection of genes. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[61]  Anthony Levasseur,et al.  Ancestral animal genomes reconstruction. , 2007, Current opinion in immunology.

[62]  B. Schierwater My favorite animal, Trichoplax adhaerens. , 2005, BioEssays : news and reviews in molecular, cellular and developmental biology.

[63]  Andrew Smith Genome sequence of the nematode C-elegans: A platform for investigating biology , 1998 .

[64]  R. Tjian,et al.  Transcription regulation and animal diversity , 2003, Nature.

[65]  J. Palmer,et al.  Horizontal gene transfer in eukaryotic evolution , 2008, Nature Reviews Genetics.

[66]  J. Palmer,et al.  Horizontal gene transfer in plants. , 2006, Journal of experimental botany.

[67]  Mark C. Field,et al.  First and last ancestors: reconstructing evolution of the endomembrane system with ESCRTs, vesicle coat proteins, and nuclear pore complexes. , 2009, Current opinion in cell biology.

[68]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[69]  Martin Vingron,et al.  Ontologizer 2.0 - a multifunctional tool for GO term enrichment analysis and data exploration , 2008, Bioinform..

[70]  Pierre Pontarotti,et al.  Eleven ancestral gene families lost in mammals and vertebrates while otherwise universally conserved in animals , 2006, BMC Evolutionary Biology.

[71]  E. Koonin,et al.  Evolution of protein domain promiscuity in eukaryotes. , 2008, Genome research.