Transcription factor evolution in eukaryotes and the assembly of the regulatory toolkit in multicellular lineages

Significance Independent transitions to multicellularity in eukaryotes involved the evolution of complex transcriptional regulation toolkits to control cell differentiation. By using comparative genomics, we show that plants and animals required richer transcriptional machineries compared with other eukaryotic multicellular lineages. We suggest this is due to their orchestrated embryonic development. Moreover, our analysis of transcription factor (TF) expression patterns during the development of animals reveal links between TF evolution, species ontogeny, and the phylotypic stage. Transcription factors (TFs) are the main players in transcriptional regulation in eukaryotes. However, it remains unclear what role TFs played in the origin of all of the different eukaryotic multicellular lineages. In this paper, we explore how the origin of TF repertoires shaped eukaryotic evolution and, in particular, their role into the emergence of multicellular lineages. We traced the origin and expansion of all known TFs through the eukaryotic tree of life, using the broadest possible taxon sampling and an updated phylogenetic background. Our results show that the most complex multicellular lineages (i.e., those with embryonic development, Metazoa and Embryophyta) have the most complex TF repertoires, and that these repertoires were assembled in a stepwise manner. We also show that a significant part of the metazoan and embryophyte TF toolkits evolved earlier, in their respective unicellular ancestors. To gain insights into the role of TFs in the development of both embryophytes and metazoans, we analyzed TF expression patterns throughout their ontogeny. The expression patterns observed in both groups recapitulate those of the whole transcriptome, but reveal some important differences. Our comparative genomics and expression data reshape our view on how TFs contributed to eukaryotic evolution and reveal the importance of TFs to the origins of multicellularity and embryonic development.

[1]  H. Hoekstra,et al.  Causes and consequences of the evolution of reproductive proteins. , 2008, The International journal of developmental biology.

[2]  Corinne Da Silva,et al.  Phylogenomics Revives Traditional Views on Deep Animal Relationships , 2009, Current Biology.

[3]  F. Berger,et al.  Position dependent control of cell fate in the Fucus embryo: role of intercellular communication. , 1998, Development.

[4]  H. Doddapaneni,et al.  Cyanophora paradoxa Genome Elucidates Origin of Photosynthesis in Algae and Plants , 2012, Science.

[5]  D. Tautz,et al.  A phylogenetically based transcriptome age index mirrors ontogenetic divergence patterns , 2010, Nature.

[6]  Diethard Tautz,et al.  Phylostratigraphic tracking of cancer genes suggests a link to the emergence of multicellularity in metazoa , 2010, BMC Biology.

[7]  F. Delsuc,et al.  Additional molecular support for the new chordate phylogeny , 2008, Genesis.

[8]  Todd H. Oakley,et al.  The Amphimedon queenslandica genome and the evolution of animal complexity , 2010, Nature.

[9]  L. Mirny,et al.  Different gene regulation strategies revealed by analysis of binding motifs. , 2009, Trends in genetics : TIG.

[10]  Richard Cowper-Sal·lari,et al.  microRNAs reveal the interrelationships of hagfish, lampreys, and gnathostomes and the nature of the ancestral vertebrate , 2010, Proceedings of the National Academy of Sciences.

[11]  Tomislav Domazet-Loso,et al.  A phylostratigraphy approach to uncover the genomic history of major adaptations in metazoan lineages. , 2007, Trends in genetics : TIG.

[12]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[13]  B. Lang,et al.  Phylogenetic relationships within the Opisthokonta based on phylogenomic analyses of conserved single-copy protein domains. , 2012, Molecular biology and evolution.

[14]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[15]  P. Keeling,et al.  The evolutionary history of haptophytes and cryptophytes: phylogenomic evidence for separate origins , 2012, Proceedings of the Royal Society B: Biological Sciences.

[16]  J. Derisi,et al.  Identification and characterization of a previously undescribed family of sequence-specific DNA-binding domains , 2013, Proceedings of the National Academy of Sciences.

[17]  N. King,et al.  The unicellular ancestry of animal development. , 2004, Developmental cell.

[18]  R. Guigó,et al.  Global trends of whole-genome duplications revealed by the ciliate Paramecium tetraurelia , 2006, Nature.

[19]  Wen-Hsiung Li,et al.  Transcription Factor Families Have Much Higher Expansion Rates in Plants than in Animals1 , 2005, Plant Physiology.

[20]  T. Eulgem Eukaryotic transcription factors , 2001, Genome Biology.

[21]  F. Delsuc,et al.  Tunicates and not cephalochordates are the closest living relatives of vertebrates , 2006, Nature.

[22]  D. Maddison,et al.  Mesquite: a modular system for evolutionary analysis. Version 2.6 , 2009 .

[23]  B Franz Lang,et al.  The origins of multicellularity: a multi-taxon genome initiative. , 2007, Trends in genetics : TIG.

[24]  L. Hug,et al.  Phylogenomic analyses support the monophyly of Excavata and resolve relationships among eukaryotic “supergroups” , 2009, Proceedings of the National Academy of Sciences.

[25]  M. Van Montagu,et al.  Convergent gene loss following gene and genome duplications creates single-copy families in flowering plants , 2013, Proceedings of the National Academy of Sciences.

[26]  I. Grosse,et al.  A transcriptomic hourglass in plant embryogenesis , 2012, Nature.

[27]  M. Telford,et al.  Arthropods Are Monophyletic Arthropods Are Ecdysozoans the Origin and Evolution of Arthropods Insight Review , 2022 .

[28]  A. Rokas The origins of multicellularity and the early history of the genetic toolkit for animal development. , 2008, Annual review of genetics.

[29]  Susana M. Coelho,et al.  OUROBOROS is a master regulator of the gametophyte to sporophyte life cycle transition in the brown alga Ectocarpus , 2011, Proceedings of the National Academy of Sciences.

[30]  E. Shelest,et al.  Transcription factors in fungi. , 2008, FEMS microbiology letters.

[31]  J. W. Valentine,et al.  Evolutionary dynamics of plants and animals: a comparative approach. , 1991, Palaios.

[32]  Matthew W. Brown,et al.  Aggregative Multicellularity Evolved Independently in the Eukaryotic Supergroup Rhizaria , 2012, Current Biology.

[33]  W. Albertin,et al.  Polyploidy in fungi: evolution after whole-genome duplication , 2012, Proceedings of the Royal Society B: Biological Sciences.

[34]  B Franz Lang,et al.  Unexpected repertoire of metazoan transcription factors in the unicellular holozoan Capsaspora owczarzaki. , 2011, Molecular biology and evolution.

[35]  G. Richards,et al.  Early evolution of metazoan transcription factors. , 2009, Current opinion in genetics & development.

[36]  T. Hughes A Handbook of Transcription Factors , 2011 .

[37]  Elliot M Meyerowitz,et al.  Plants Compared to Animals: The Broadest Comparative Study of Development , 2002, Science.

[38]  D. Latchman Transcription factors: an overview. , 1997, The international journal of biochemistry & cell biology.

[39]  Naoki Irie,et al.  Comparative transcriptome analysis reveals vertebrate phylotypic period during organogenesis , 2011, Nature communications.

[40]  B. Morgenstern,et al.  Improved Phylogenomic Taxon Sampling Noticeably Affects Nonbilaterian Relationships , 2010, Molecular biology and evolution.

[41]  D. B. Burt Evolutionary stasis, constraint and other terminology describing evolutionary patterns☆ , 2001 .

[42]  C. Bowler,et al.  Transcription factor families inferred from genome sequences of photosynthetic stramenopiles. , 2010, The New phytologist.

[43]  B. Mueller‐Roeber,et al.  Genome-Wide Phylogenetic Comparative Analysis of Plant Transcriptional Regulation: A Timeline of Loss, Gain, Expansion, and Correlation with Complexity , 2010, Genome biology and evolution.

[44]  A. Papavassiliou Transcription factors in eukaryotes , 1997 .

[45]  Daniel Rios,et al.  Ensembl 2011 , 2010, Nucleic Acids Res..

[46]  Susana M. Coelho,et al.  Genome structure and metabolic features in the red seaweed Chondrus crispus shed light on evolution of the Archaeplastida , 2013, Proceedings of the National Academy of Sciences.

[47]  R. Tjian,et al.  Transcription regulation and animal diversity , 2003, Nature.

[48]  F. Gleason,et al.  Phylogenetic interpretations and ecological potentials of the Mesomycetozoea (Ichthyosporea) , 2013 .

[49]  Klaus O. Kopec,et al.  Genome of Acanthamoeba castellanii highlights extensive lateral gene transfer and early evolution of tyrosine kinase signaling , 2013, Genome Biology.

[50]  M. Cloutier,et al.  Genome-Wide Analysis Reveals Gene Expression and Metabolic Network Dynamics during Embryo Development in Arabidopsis1[W][OA] , 2011, Plant Physiology.

[51]  A. Sebé-Pedrós,et al.  Premetazoan origin of the hippo signaling pathway. , 2012, Cell reports.

[52]  S. Hedges,et al.  Molecular phylogeny and divergence times of deuterostome animals. , 2005, Molecular biology and evolution.

[53]  D. Tautz,et al.  The evolutionary origin of orphan genes , 2011, Nature Reviews Genetics.

[54]  Sarah A. Teichmann,et al.  Genomic repertoires of DNA-binding transcription factors across the tree of life , 2010, Nucleic acids research.

[55]  E. Furlong,et al.  Transcription factors: from enhancer binding to developmental control , 2012, Nature Reviews Genetics.

[56]  B. Lang,et al.  Rooting the eukaryotic tree with mitochondrial and bacterial proteins. , 2012, Molecular biology and evolution.

[57]  Corinne Da Silva,et al.  The Ectocarpus genome and the independent evolution of multicellularity in brown algae , 2010, Nature.

[58]  D Penny,et al.  Genomics and the Irreducible Nature of Eukaryote Cells , 2006, Science.

[59]  S. Brady,et al.  Comprehensive developmental profiles of gene activity in regions and subregions of the Arabidopsis seed , 2013, Proceedings of the National Academy of Sciences.

[60]  Juan M. Vaquerizas,et al.  A census of human transcription factors: function, expression and evolution , 2009, Nature Reviews Genetics.

[61]  B. S. Baker,et al.  Gene Expression During the Life Cycle of Drosophila melanogaster , 2002, Science.

[62]  N. Friedman,et al.  Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data , 2011, Nature Biotechnology.

[63]  A. Meyer,et al.  The evolutionary significance of ancient genome duplications , 2009, Nature Reviews Genetics.

[64]  Susana M. Coelho,et al.  Life-cycle-generation-specific developmental processes are modified in the immediate upright mutant of the brown alga Ectocarpus siliculosus , 2008, Development.

[65]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[66]  M. Maldonado,et al.  Deep metazoan phylogeny: when different genes tell different stories. , 2013, Molecular phylogenetics and evolution.

[67]  César A. Hidalgo,et al.  Proto-genes and de novo gene birth , 2012, Nature.

[68]  Léon Personnaz,et al.  Enrichment or depletion of a GO category within a class of genes: which test? , 2007, Bioinform..

[69]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[70]  Dave T. Gerrard,et al.  Gene expression divergence recapitulates the developmental hourglass model , 2010, Nature.

[71]  Itai Yanai,et al.  Developmental milestones punctuate gene expression in the Caenorhabditis embryo. , 2012, Developmental cell.

[72]  Randor Radakovits,et al.  Draft genome sequence and genetic transformation of the oleaginous alga Nannochloropis gaditana , 2012, Nature Communications.

[73]  Sarah A. Teichmann,et al.  DBD––taxonomically broad transcription factor predictions: new content and functionality , 2007, Nucleic Acids Res..

[74]  Sarah J. Bourlat,et al.  The evolution of the Ecdysozoa , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.