Widespread contribution of transposable elements to the innovation of gene regulatory networks

Transposable elements (TEs) have been shown to contain functional binding sites for certain transcription factors (TFs). However, the extent to which TEs contribute to the evolution of TF binding sites is not well known. We comprehensively mapped binding sites for 26 pairs of orthologous TFs in two pairs of human and mouse cell lines (representing two cell lineages), along with epigenomic profiles, including DNA methylation and six histone modifications. Overall, we found that 20% of binding sites were embedded within TEs. This number varied across different TFs, ranging from 2% to 40%. We further identified 710 TF–TE relationships in which genomic copies of a TE subfamily contributed a significant number of binding peaks for a TF, and we found that LTR elements dominated these relationships in human. Importantly, TE-derived binding peaks were strongly associated with open and active chromatin signatures, including reduced DNA methylation and increased enhancer-associated histone marks. On average, 66% of TE-derived binding events were cell type-specific with a cell type-specific epigenetic landscape. Most of the binding sites contributed by TEs were species-specific, but we also identified binding sites conserved between human and mouse, the functional relevance of which was supported by a signature of purifying selection on DNA sequences of these TEs. Interestingly, several TFs had significantly expanded binding site landscapes only in one species, which were linked to species-specific gene functions, suggesting that TEs are an important driving force for regulatory innovation. Taken together, our data suggest that TEs have significantly and continuously shaped gene regulatory networks during mammalian evolution.

[1]  Ross C Hardison,et al.  Divergent functions of hematopoietic transcription factors in lineage priming and differentiation during erythro-megakaryopoiesis , 2014, Genome research.

[2]  Z. Weng,et al.  Principles of regulatory information conservation between mouse and human , 2014, Nature.

[3]  Shane J. Neph,et al.  A comparative encyclopedia of DNA elements in the mouse genome , 2014, Nature.

[4]  Richard A. Moore,et al.  Functional DNA methylation differences between tissues, cell types, and across individuals discovered using the M&M algorithm , 2013, Genome research.

[5]  Keith L. Ligon,et al.  DNA hypomethylation within specific transposable element families associates with tissue-specific enhancer landscape , 2013, Nature Genetics.

[6]  G. Bourque,et al.  The Majority of Primate-Specific Regulatory Sequences Are Derived from Transposable Elements , 2013, PLoS genetics.

[7]  Jeffrey B. Cheng,et al.  Estimating absolute methylation levels at single-CpG resolution from methylation enrichment and restriction enzyme sequencing methods , 2013, RECOMB.

[8]  Zev N. Kronenberg,et al.  Transposable Elements Are Major Contributors to the Origin, Diversification, and Regulation of Vertebrate Long Noncoding RNAs , 2013, PLoS genetics.

[9]  D. Odom,et al.  CTCF and Cohesin: Linking Gene Regulatory Elements with Their Targets , 2013, Cell.

[10]  M. Rubinstein,et al.  Exaptation of Transposable Elements into Novel Cis-Regulatory Elements: Is the Evidence Always Strong? , 2013, Molecular biology and evolution.

[11]  J. Baker,et al.  Endogenous retroviruses function as species-specific enhancer elements in the placenta , 2013, Nature Genetics.

[12]  D. Mager,et al.  Transposable elements: an abundant and natural source of regulatory sequences for host genes. , 2012, Annual review of genetics.

[13]  D. Mager,et al.  Epigenetic interplay between mouse endogenous retroviruses and host genes , 2012, Genome Biology.

[14]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[15]  Marc D. Perry,et al.  ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia , 2012, Genome research.

[16]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[17]  Hana Kim,et al.  Retrotransposons as a major source of epigenetic variations in the mammalian genome , 2012, Epigenetics.

[18]  C. Feschotte,et al.  Endogenous viruses: insights into viral evolution and impact on host biology , 2012, Nature Reviews Genetics.

[19]  Michael D. Wilson,et al.  Waves of Retrotransposon Expansion Remodel Genome Organization and CTCF Binding in Multiple Mammalian Lineages , 2012, Cell.

[20]  M. Batzer,et al.  Repetitive Elements May Comprise Over Two-Thirds of the Human Genome , 2011, PLoS genetics.

[21]  Vincent J. Lynch,et al.  Transposon-mediated rewiring of gene regulatory networks contributed to the evolution of pregnancy in mammals , 2011, Nature Genetics.

[22]  D. Higgins,et al.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega , 2011, Molecular systems biology.

[23]  Keith R. Oliver,et al.  Mobile DNA and the TE-Thrust hypothesis: supporting evidence from the primates , 2011, Mobile DNA.

[24]  Herbert Schulz,et al.  RAD21 Cooperates with Pluripotency Transcription Factors in the Maintenance of Embryonic Stem Cell Identity , 2011, PloS one.

[25]  William Stafford Noble,et al.  FIMO: scanning for occurrences of a given motif , 2011, Bioinform..

[26]  Allen D. Delaney,et al.  Conserved Role of Intragenic DNA Methylation in Regulating Alternative Promoters , 2010, Nature.

[27]  Ali Eroglu,et al.  Long-range function of an intergenic retrotransposon , 2010, Proceedings of the National Academy of Sciences.

[28]  G. Bourque,et al.  Transposable elements have rewired the core regulatory network of human embryonic stem cells , 2010, Nature Genetics.

[29]  Chad A. Cowan,et al.  Rewirable gene regulatory networks in the preimplantation embryonic development of three mammalian species. , 2010, Genome research.

[30]  C. Glass,et al.  Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. , 2010, Molecular cell.

[31]  Cory Y. McLean,et al.  GREAT improves functional interpretation of cis-regulatory regions , 2010, Nature Biotechnology.

[32]  M. Gerstein,et al.  Variation in Transcription Factor Binding Among Humans , 2010, Science.

[33]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[34]  D. Mager,et al.  Endogenous retroviral LTRs as promoters for human genes: a critical assessment. , 2009, Gene.

[35]  Guillaume Bourque,et al.  Transposable elements in gene regulation and in the evolution of vertebrate genomes. , 2009, Current opinion in genetics & development.

[36]  V. Corces,et al.  CTCF: Master Weaver of the Genome , 2009, Cell.

[37]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[38]  A. Visel,et al.  ChIP-seq accurately predicts tissue-specific activity of enhancers , 2009, Nature.

[39]  E. Liu,et al.  Evolution of the mammalian transcription factor binding repertoire via transposable elements. , 2008, Genome research.

[40]  P. Park,et al.  Design and analysis of ChIP-seq experiments for DNA-binding proteins , 2008, Nature Biotechnology.

[41]  D. Landsman,et al.  Evolutionary rates and patterns for human transcription factor binding sites derived from repetitive DNA , 2008, BMC Genomics.

[42]  C. Feschotte Transposable elements and the evolution of regulatory networks , 2008, Nature Reviews Genetics.

[43]  N. Saitou,et al.  Possible involvement of SINEs in mammalian-specific brain formation , 2008, Proceedings of the National Academy of Sciences.

[44]  H. Aburatani,et al.  Cohesin mediates transcriptional insulation by CCCTC-binding factor , 2008, Nature.

[45]  P. Fernández-Salguero,et al.  Genome-wide B1 retrotransposon binds the transcription factors dioxin receptor and Slug and regulates gene expression in vivo , 2008, Proceedings of the National Academy of Sciences.

[46]  D. Haussler,et al.  Species-specific endogenous retroviruses shape the transcriptional network of the human tumor suppressor protein p53 , 2007, Proceedings of the National Academy of Sciences.

[47]  A. Mortazavi,et al.  Genome-Wide Mapping of in Vivo Protein-DNA Interactions , 2007, Science.

[48]  R. Martienssen,et al.  Transposable elements and the epigenetic regulation of the genome , 2007, Nature Reviews Genetics.

[49]  D. Haussler,et al.  A distal enhancer and an ultraconserved exon are derived from a novel retroposon , 2006, Nature.

[50]  Apoorva Mandavilli,et al.  Of mice and men , 2006, Nature Medicine.

[51]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[52]  J. Jurka,et al.  Repbase Update, a database of eukaryotic repetitive elements , 2005, Cytogenetic and Genome Research.

[53]  Louise Fairall,et al.  How the human telomeric proteins TRF1 and TRF2 recognize telomeric DNA: a view from high‐resolution crystal structures , 2005, EMBO reports.

[54]  Doree Sitkoff,et al.  models homology modeling : From sequence alignments to structural A comparative study of available software for high-accuracy , 2005 .

[55]  C. Hughes,et al.  Of Mice and Not Men: Differences between Mouse and Human Immunology , 2004, The Journal of Immunology.

[56]  Dixie L Mager,et al.  An endogenous retroviral long terminal repeat is the dominant promoter for human β1,3-galactosyltransferase 5 in the colon , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[57]  G. Glazko,et al.  Origin of a substantial fraction of human regulatory sequences from transposable elements. , 2003, Trends in genetics : TIG.

[58]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[59]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[60]  S. LeGrice,et al.  Long Terminal Repeats , 2002 .

[61]  A. Bird DNA methylation patterns and epigenetic memory. , 2002, Genes & development.

[62]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[63]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[64]  J. Landry,et al.  Long Terminal Repeats Are Used as Alternative Promoters for the Endothelin B Receptor and Apolipoprotein C-I Genes in Humans* , 2001, The Journal of Biological Chemistry.

[65]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[66]  Stephen M. Mount,et al.  The genome sequence of Drosophila melanogaster. , 2000, Science.

[67]  David I. K. Martin,et al.  Epigenetic inheritance at the agouti locus in the mouse , 1999, Nature Genetics.

[68]  S. Hainsworth,et al.  A CRITICAL ASSESSMENT , 2014 .

[69]  B. Ames,et al.  Oxidants, antioxidants, and the degenerative diseases of aging. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[70]  M. Meisler,et al.  Retroviral and pseudogene insertion sites reveal the lineage of human salivary and pancreatic amylase genes from a single gene during primate evolution , 1990, Molecular and cellular biology.

[71]  W. Doolittle,et al.  Selfish genes, the phenotype paradigm and genome evolution , 1980, Nature.

[72]  R. Britten,et al.  Gene regulation for higher cells: a theory. , 1969, Science.

[73]  B. Mcclintock,et al.  Controlling elements and the gene. , 1956, Cold Spring Harbor symposia on quantitative biology.

[74]  B. Mcclintock The origin and behavior of mutable loci in maize , 1950, Proceedings of the National Academy of Sciences.

[75]  R. Browne,et al.  A comparative. , 1950, The British journal of ophthalmology.