Five-Vertebrate ChIP-seq Reveals the Evolutionary Dynamics of Transcription Factor Binding

Subtle Variation Despite vast phenotypic differences, vertebrates have many readily recognizable specific cell types, like liver hepatocytes. The gene expression that defines specific cells depends on evolutionarily conserved orthologous transcription factors. Schmidt et al. (p. 1036, published online 8 April) studied the conservation and divergence in the genome-wide binding of two such transcription factors, CEBPA and HNF4A, in livers from human, dog, mouse, short-tailed opossum, and chicken. Although the sequence bound by orthologous transcription factors was similar, the vast majority of binding events were unique to each species. Binding of two liver-specific transcription factors in several vertebrate species reveals complex regulatory evolution. Transcription factors (TFs) direct gene expression by binding to DNA regulatory regions. To explore the evolution of gene regulation, we used chromatin immunoprecipitation with high-throughput sequencing (ChIP-seq) to determine experimentally the genome-wide occupancy of two TFs, CCAAT/enhancer-binding protein alpha and hepatocyte nuclear factor 4 alpha, in the livers of five vertebrates. Although each TF displays highly conserved DNA binding preferences, most binding is species-specific, and aligned binding events present in all five species are rare. Regions near genes with expression levels that are dependent on a TF are often bound by the TF in multiple species yet show no enhanced DNA sequence constraint. Binding divergence between species can be largely explained by sequence changes to the bound motifs. Among the binding events lost in one lineage, only half are recovered by another binding event within 10 kilobases. Our results reveal large interspecies differences in transcriptional regulation and provide insight into regulatory evolution.

[1]  F. Gonzalez,et al.  Disruption of the c/ebp alpha gene in adult mouse liver , 1997, Molecular and cellular biology.

[2]  A. Clark,et al.  Evolution of transcription factor binding sites in Mammalian gene regulatory regions: conservation and turnover. , 2002, Molecular biology and evolution.

[3]  Jon D. McAuliffe,et al.  Phylogenetic Shadowing of Primate Sequences to Find Functional Regions of the Human Genome , 2003, Science.

[4]  A. Sandelin,et al.  Identification of conserved regulatory elements by comparative genome analysis , 2003, Journal of biology.

[5]  D. Haussler,et al.  Ultraconserved Elements in the Human Genome , 2004, Science.

[6]  S. Batzoglou,et al.  Distribution and intensity of constraint in mammalian genomic sequence. , 2005, Genome research.

[7]  Jean L. Chang,et al.  An initial strategy for the systematic identification of functional elements in the human genome by low-redundancy comparative sequencing. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[8]  I. Talianidis,et al.  Mitogen-Activated Protein Kinase-Mediated Disruption of Enhancer-Promoter Communication Inhibits Hepatocyte Nuclear Factor 4α Expression , 2006, Molecular and Cellular Biology.

[9]  F. Robert,et al.  Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression , 2006 .

[10]  Francesca Chiaromonte,et al.  ESPERR: learning strong and weak signals in genomic sequence alignments to identify functional elements. , 2006, Genome research.

[11]  D. Gifford,et al.  Tissue-specific transcriptional regulation has diverged significantly between human and mouse , 2007, Nature Genetics.

[12]  Daniel J. Blankenberg,et al.  28-way vertebrate alignment and conservation track in the UCSC Genome Browser. , 2007, Genome research.

[13]  T. Mikkelsen,et al.  Systematic discovery of regulatory motifs in conserved regions of the human genome, including thousands of CTCF insulator sites , 2007, Proceedings of the National Academy of Sciences.

[14]  Magdalena I. Swanson,et al.  PAZAR: a framework for collection and dissemination of cis-regulatory sequence annotation , 2007, Genome Biology.

[15]  Colin N. Dewey,et al.  Analyses of deep mammalian sequence alignments and constraint predictions for 1% of the human genome. , 2007, Genome research.

[16]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[17]  Bronwen L. Aken,et al.  Genome of the marsupial Monodelphis domestica reveals innovation in non-coding sequences , 2007, Nature.

[18]  Mark Gerstein,et al.  Divergence of transcription factor binding sites across related yeast species. , 2007, Science.

[19]  G. Wray The evolutionary significance of cis-regulatory mutations , 2007, Nature Reviews Genetics.

[20]  Michael D. Wilson,et al.  Species-Specific Transcription in Mice Carrying Human Chromosome 21 , 2008, Science.

[21]  James B. Brown,et al.  Developmental roles of 21 Drosophila transcription factors are determined by quantitative differences in binding to an overlapping set of thousands of genomic regions , 2009, Genome Biology.

[22]  Roderic Guigo,et al.  Functional Targets of the Monogenic Diabetes Transcription Factors HNF-1α and HNF-4α Are Highly Conserved Between Mice and Humans , 2009, Diabetes.

[23]  A. Visel,et al.  ChIP-seq accurately predicts tissue-specific activity of enhancers , 2009, Nature.

[24]  Juan M. Vaquerizas,et al.  A census of human transcription factors: function, expression and evolution , 2009, Nature Reviews Genetics.

[25]  Esther T. Chan,et al.  Conservation of core gene expression in vertebrate tissues , 2009, Journal of biology.

[26]  Daniel E. Newburger,et al.  Diversity and Complexity in DNA Recognition by Transcription Factors , 2009, Science.

[27]  Alexander J. Hartemink,et al.  Finding regulatory DNA motifs using alignment-free evolutionary conservation information , 2010, Nucleic acids research.