Transcription Factor Families Have Much Higher Expansion Rates in Plants than in Animals1

Transcription factors (TFs), which are central to the regulation of gene expression, are usually members of multigene families. In plants, they are involved in diverse processes such as developmental control and elicitation of defense and stress responses. To investigate if differences exist in the expansion patterns of TF gene families between plants and other eukaryotes, we first used Arabidopsis (Arabidopsis thaliana) TFs to identify TF DNA-binding domains. These DNA-binding domains were then used to identify related sequences in 25 other eukaryotic genomes. Interestingly, among 19 families that are shared between animals and plants, more than 14 are larger in plants than in animals. After examining the lineage-specific expansion of TF families in two plants, eight animals, and two fungi, we found that TF families shared among these organisms have undergone much more dramatic expansion in plants than in other eukaryotes. Moreover, this elevated expansion rate of plant TF is not simply due to higher duplication rates of plant genomes but also to a higher degree of expansion compared to other plant genes. Further, in many Arabidopsis-rice (Oryza sativa) TF orthologous groups, the degree of lineage-specific expansion in Arabidopsis is correlated with that in rice. This pattern of parallel expansion is much more pronounced than the whole-genome trend in rice and Arabidopsis. The high rate of expansion among plant TF genes and their propensity for parallel expansion suggest frequent adaptive responses to selection pressure common among higher plants.

[1]  P. Huijser,et al.  A new family of DNA binding proteins includes putative transcriptional regulators of theAntirrhinum majus floral meristem identity geneSQUAMOSA , 1996, Molecular and General Genetics MGG.

[2]  Cathal Seoighe,et al.  Genome duplication led to highly selective expansion of the Arabidopsis thaliana proteome. , 2004, Trends in genetics : TIG.

[3]  Guillaume Blanc,et al.  Functional Divergence of Duplicated Genes Formed by Polyploidy during Arabidopsis Evolution , 2004, The Plant Cell Online.

[4]  Klaus F. X. Mayer,et al.  Comparative Analysis of the Receptor-Like Kinase Family in Arabidopsis and Rice , 2004, The Plant Cell Online.

[5]  K. Skriver,et al.  Structure of the conserved domain of ANAC, a member of the NAC family of transcription factors , 2004, EMBO reports.

[6]  O. D. C. E. Silva CG-1, a parsley light-induced DNA-binding protein , 1994, Plant Molecular Biology.

[7]  Jonathan F. Wendel,et al.  Genome evolution in polyploids , 2004, Plant Molecular Biology.

[8]  Wen-Hsiung Li,et al.  Dating the Monocot–Dicot Divergence and the Origin of Core Eudicots Using Whole Chloroplast Genomes , 2004, Journal of Molecular Evolution.

[9]  Richard W. Lusk,et al.  Organismal complexity, protein complexity, and gene duplicability , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[10]  R. Durbin,et al.  The Genome Sequence of Caenorhabditis briggsae: A Platform for Comparative Genomics , 2003, PLoS biology.

[11]  Matthew W. Hahn,et al.  The evolution of transcriptional regulation in eukaryotes. , 2003, Molecular biology and evolution.

[12]  C. Pál,et al.  Dosage sensitivity and the evolution of gene families in yeast , 2003, Nature.

[13]  R. Tjian,et al.  Transcription regulation and animal diversity , 2003, Nature.

[14]  Ramana V. Davuluri,et al.  AGRIS: Arabidopsis Gene Regulatory Information Server, an information resource of Arabidopsis cis-regulatory elements and transcription factors , 2003, BMC Bioinformatics.

[15]  A. Hughes,et al.  Parallel evolution by gene duplication in the genomes of two unicellular fungi. , 2003, Genome research.

[16]  Gene Ontology Consortium The Gene Ontology (GO) database and informatics resource , 2003 .

[17]  Peer Bork,et al.  Comparative Genome and Proteome Analysis of Anopheles gambiae and Drosophila melanogaster , 2002, Science.

[18]  M. Miles,et al.  An insect molecular clock dates the origin of the insects and accords with palaeontological and biogeographic landmarks. , 2002, Molecular biology and evolution.

[19]  Huanming Yang,et al.  A Draft Sequence of the Rice Genome (Oryza sativa L. ssp. japonica) , 2002, Science.

[20]  A. Oliphant,et al.  A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). , 2002, Science.

[21]  Y Van de Peer,et al.  Comparative genomics provides evidence for an ancient genome duplication event in fish. , 2001, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[22]  H. Pan,et al.  Regions of microsynteny in Magnaporthe grisea and Neurospora crassa. , 2001, Fungal genetics and biology : FG & B.

[23]  R. R. Samaha,et al.  Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes. , 2000, Science.

[24]  D. G. Brown,et al.  The origins of genomic duplications in Arabidopsis. , 2000, Science.

[25]  M. Delseny,et al.  Extensive Duplication and Reshuffling in the Arabidopsis Genome , 2000, Plant Cell.

[26]  S. Carroll Endless Forms The Evolution of Gene Regulation and Morphological Diversity , 2000, Cell.

[27]  S. Dongen Graph clustering by flow simulation , 2000 .

[28]  T. Eulgem,et al.  The WRKY superfamily of plant transcription factors. , 2000, Trends in plant science.

[29]  E. Knaap,et al.  A novel gibberellin-induced gene from rice and its potential regulatory role in stem growth. , 2000, Plant physiology.

[30]  J. Bowman,et al.  The YABBY gene family and abaxial cell fate. , 2000, Current opinion in plant biology.

[31]  Peer Bork,et al.  SMART: a web-based tool for the study of genetically mobile domains , 2000, Nucleic Acids Res..

[32]  Leif Schauser,et al.  A plant regulator controlling development of symbiotic root nodules , 1999, Nature.

[33]  P. Benfey,et al.  The GRAS gene family in Arabidopsis: sequence characterization and basic expression analysis of the SCARECROW-LIKE genes. , 1999, The Plant journal : for cell and molecular biology.

[34]  E. Coen,et al.  The TCP domain: a motif found in proteins regulating plant growth and development. , 1999, The Plant journal : for cell and molecular biology.

[35]  A. Force,et al.  Preservation of duplicate genes by complementary, degenerative mutations. , 1999, Genetics.

[36]  E. Fraenkel,et al.  Structural basis of DNA recognition by the heterodimeric cell cycle transcription factor E2F-DP. , 1999, Genes & development.

[37]  J. Ecker,et al.  Nuclear events in ethylene signaling: a transcriptional cascade mediated by ETHYLENE-INSENSITIVE3 and ETHYLENE-RESPONSE-FACTOR1. , 1998, Genes & development.

[38]  R. Wickner,et al.  Mak21p of Saccharomyces cerevisiae, a Homolog of Human CAATT-binding Protein, Is Essential for 60 S Ribosomal Subunit Biogenesis* , 1998, The Journal of Biological Chemistry.

[39]  M. Esaka,et al.  Functional analyses of the Dof domain, a zinc finger DNA‐binding domain, in a pumpkin DNA‐binding protein AOBP , 1998, FEBS letters.

[40]  J. Doebley,et al.  Transcriptional Regulators and the Evolution of Plant Form , 1998, Plant Cell.

[41]  K. Borden RING fingers and B-boxes: zinc-binding protein-protein interaction domains. , 1998, Biochemistry and cell biology = Biochimie et biologie cellulaire.

[42]  Sean R. Eddy,et al.  Pfam: multiple sequence alignments and HMM-profiles of protein domains , 1998, Nucleic Acids Res..

[43]  G. Hagen,et al.  ARF1, a transcription factor that binds to auxin response elements. , 1997, Science.

[44]  C. Kao,et al.  The conserved B3 domain of VIVIPAROUS1 has a cooperative DNA binding activity. , 1997, The Plant cell.

[45]  P. Freemont,et al.  The RING finger domain: a recent example of a sequence-structure family. , 1996, Current opinion in structural biology.

[46]  O. Hobert,et al.  Interaction of Vav with ENX-1, a putative transcriptional regulator of homeobox gene expression , 1996, Molecular and cellular biology.

[47]  R. Reisz,et al.  Archerpeton anthracos from the Joggins Formation of Nova Scotia: a microsaur, not a reptile , 1996 .

[48]  C. Ebeling,et al.  Identification and Characterization of the Mouse Obesity Gene tubby: A Member of a Novel Gene Family , 1996, Cell.

[49]  R. Scheuermann,et al.  The immunoglobulin heavy-chain matrix-associating regions are bound by Bright: a B cell-specific trans-activator that describes a new DNA-binding protein family. , 1995, Genes & development.

[50]  Song Tan,et al.  Structure of serum response factor core bound to DNA , 1995, Nature.

[51]  D. Weigel The APETALA2 domain is related to a novel type of DNA binding domain. , 1995, The Plant cell.

[52]  T. Gibson,et al.  The PHD finger: implications for chromatin-mediated transcriptional regulation. , 1995, Trends in biochemical sciences.

[53]  M. Ohme-Takagi,et al.  Ethylene-inducible DNA binding proteins that interact with an ethylene-responsive element. , 1995, The Plant cell.

[54]  Littlewood Td,et al.  Transcription factors 2: helix-loop-helix. , 1995, Protein profile.

[55]  G. Evan,et al.  Transcription factors 2: helix-loop-helix. , 1995, Protein profile.

[56]  A M Gronenborn,et al.  NMR structure of a specific DNA complex of Zn-containing DNA binding domain of GATA-1. , 1993, Science.

[57]  D. Weigel,et al.  LEAFY controls floral meristem identity in Arabidopsis , 1992, Cell.

[58]  C. Benoist,et al.  Evolutionary variation of the CCAAT-binding transcription factor NF-Y. , 1992, Nucleic acids research.

[59]  M. Nissen,et al.  The A.T-DNA-binding domain of mammalian high mobility group I chromosomal proteins. A novel peptide motif for recognizing DNA structure. , 1990, The Journal of biological chemistry.

[60]  S. Kuhara,et al.  Domains of the SFL1 protein of yeasts are homologous to Myc oncoproteins or yeast heat-shock transcription factor. , 1989, Gene.

[61]  M. Scott,et al.  The structure and function of the homeodomain. , 1989, Biochimica et biophysica acta.

[62]  S. McKnight,et al.  The leucine zipper: a hypothetical structure common to a new class of DNA binding proteins. , 1988, Science.

[63]  A. E. Sippel,et al.  The highly conserved amino‐terminal region of the protein encoded by the v‐myb oncogene functions as a DNA‐binding domain. , 1987, The EMBO journal.

[64]  A Klug,et al.  Zinc fingers: a novel protein fold for nucleic acid recognition. , 1987, Cold Spring Harbor symposia on quantitative biology.