Transposable elements are the primary source of novelty in primate gene regulation

Gene regulation shapes the evolution of phenotypic diversity. We investigated the evolution of liver promoters and enhancers in six primate species using ChIP-seq (H3K27ac and H3K4me1) to profile cis-regulatory elements (CREs) and using RNA-seq to characterize gene expression in the same individuals. To quantify regulatory divergence, we compared CRE activity across species by testing differential ChIP-seq read depths directly measured for orthologous sequences. We show that the primate regulatory landscape is largely conserved across the lineage, with 63% of the tested human liver CREs showing similar activity across species. Conserved CRE function is associated with sequence conservation, proximity to coding genes, cell-type specificity, and transcription factor binding. Newly evolved CREs are enriched in immune response and neurodevelopmental functions. We further demonstrate that conserved CREs bind master regulators, suggesting that while CREs contribute to species adaptation to the environment, core functions remain intact. Newly evolved CREs are enriched in young transposable elements (TEs), including Long-Terminal-Repeats (LTRs) and SINE-VNTR-Alus (SVAs), that significantly affect gene expression. Conversely, only 16% of conserved CREs overlap TEs. We tested the cis-regulatory activity of 69 TE subfamilies by luciferase reporter assays, spanning all major TE classes, and showed that 95.6% of tested TEs can function as either transcriptional activators or repressors. In conclusion, we demonstrated the critical role of TEs in primate gene regulation and illustrated potential mechanisms underlying evolutionary divergence among the primate species through the noncoding genome.

[1]  Mohammad M. Karimi,et al.  Silencing of transposable elements may not be a major driver of regulatory evolution in primate iPSCs , 2018, eLife.

[2]  D. Odom,et al.  Complexity and conservation of regulatory landscapes underlie evolutionary resilience of mammalian gene expression , 2017, bioRxiv.

[3]  James J. Lewis,et al.  ChIP-Seq-Annotated Heliconius erato Genome Highlights Patterns of cis-Regulatory Evolution in Lepidoptera. , 2016, Cell reports.

[4]  N. A. Rayan,et al.  Massive contribution of transposable elements to mammalian regulatory sequences. , 2016, Seminars in cell & developmental biology.

[5]  C. Laukaitis,et al.  The Role of Retrotransposons in Gene Family Expansions in the Human and Mouse Genomes , 2016, Genome biology and evolution.

[6]  N. Shubin,et al.  Regulatory evolution of Tbx5 and the origin of paired appendages , 2016, Proceedings of the National Academy of Sciences.

[7]  Dustin E. Schones,et al.  Chromatin variation associated with liver metabolism is mediated by transposable elements , 2016, Epigenetics & Chromatin.

[8]  Weidong Tian,et al.  Genome-Wide Identification of Regulatory Sequences Undergoing Accelerated Evolution in the Human Genome , 2016, Molecular biology and evolution.

[9]  C. Meiklejohn,et al.  Sex Chromosome-wide Transcriptional Suppression and Compensatory Cis-Regulatory Evolution Mediate Gene Expression in the Drosophila Male Germline , 2016, PLoS biology.

[10]  R. D. Reed,et al.  Genome editing in butterflies reveals that spalt promotes and Distal-less represses eyespot colour patterns , 2016, Nature Communications.

[11]  D. Page,et al.  Parallel evolution of male germline epigenetic poising and somatic development in animals , 2016, Nature Genetics.

[12]  J. Noonan,et al.  Origin and evolution of developmental enhancers in the mammalian neocortex , 2016, Proceedings of the National Academy of Sciences.

[13]  C. Feschotte,et al.  Regulatory evolution of innate immunity through co-option of endogenous retroviruses , 2016, Science.

[14]  K. Pollard,et al.  Accelerated Evolution of Enhancer Hotspots in the Mammal Ancestor , 2015, Molecular biology and evolution.

[15]  Robert D. Finn,et al.  The Dfam database of repetitive DNA families , 2015, Nucleic Acids Res..

[16]  I. Dubchak,et al.  Functionally conserved enhancers with divergent sequences in distant vertebrates , 2015, BMC Genomics.

[17]  F. Gage,et al.  Enhancer Divergence and cis-Regulatory Evolution in the Human and Chimp Neural Crest , 2015, Cell.

[18]  William Stafford Noble,et al.  Comprehensive identification and analysis of human accelerated regulatory DNA , 2015, Genome research.

[19]  O. Delaneau,et al.  Population Variation and Genetic Control of Modular Chromatin Architecture in Humans , 2015, Cell.

[20]  B. Wertheim Genomic basis of evolutionary change: evolving immunity , 2015, Front. Genet..

[21]  Pietro Liò,et al.  The BioMart community portal: an innovative alternative to large, centralized data repositories , 2015, Nucleic Acids Res..

[22]  M. Clerici,et al.  Evolutionary insights into host–pathogen interactions from mammalian sequence data , 2015, Nature Reviews Genetics.

[23]  Jing Leng,et al.  Evolutionary changes in promoter and enhancer activity during human corticogenesis , 2015, Science.

[24]  Michael Q. Zhang,et al.  Integrative analysis of 111 reference human epigenomes , 2015, Nature.

[25]  J. T. Erichsen,et al.  Enhancer Evolution across 20 Mammalian Species , 2015, Cell.

[26]  Cédric Feschotte,et al.  Ancient Transposable Elements Transformed the Uterine Regulatory Landscape and Transcriptome during the Evolution of Mammalian Pregnancy , 2015, Cell reports.

[27]  Christopher D. Brown,et al.  Coordinated Regulatory Variation Associated with Gestational Hyperglycemia Regulates Expression of the Novel Hexokinase HKDC1 , 2014, Nature Communications.

[28]  Alessandro Vullo,et al.  The Ensembl REST API: Ensembl Data for Any Language , 2014, Bioinform..

[29]  Yeisoo Yu,et al.  Uncovering the novel characteristics of Asian honey bee, Apis cerana, by whole genome sequencing , 2015, BMC Genomics.

[30]  Zhihai Ma,et al.  Widespread contribution of transposable elements to the innovation of gene regulatory networks , 2014, Genome research.

[31]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[32]  Arnaud Martin,et al.  Wnt signaling underlies evolution and development of the butterfly wing pattern symmetry systems. , 2014, Developmental biology.

[33]  Michael D. Wilson,et al.  Multi-species, multi-transcription factor binding highlights conserved control of tissue-specific biological pathways , 2014, eLife.

[34]  N. A. Rayan,et al.  Noncoding origins of anthropoid traits and a new null model of transposon functionalization , 2014, Genome research.

[35]  Peter J. Bickel,et al.  Comparative analysis of regulatory information and circuits across distant species , 2014, Nature.

[36]  P. Provero,et al.  Genome-wide identification and characterization of fixed human-specific regulatory regions. , 2014, American journal of human genetics.

[37]  B. Lazzaro,et al.  The Genetics of Immunity , 2014, G3: Genes, Genomes, Genetics.

[38]  P. Wittkopp,et al.  Tempo and mode of regulatory evolution in Drosophila , 2014, Genome research.

[39]  Daniel E. Zak,et al.  Systems-level analysis of innate immunity. , 2014, Annual review of immunology.

[40]  Xiang Zhou,et al.  Epigenetic modifications are associated with inter-species gene expression variation in primates , 2014, bioRxiv.

[41]  I. Jamieson,et al.  Episodic Positive Selection in the Evolution of Avian Toll-Like Receptor Innate Immunity Genes , 2014, PloS one.

[42]  D. Reinberg,et al.  Interactions between JARID2 and noncoding RNAs regulate PRC2 recruitment to chromatin. , 2014, Molecular cell.

[43]  B. Wertheim,et al.  Evolution of a Cellular Immune Response in Drosophila: A Phenotypic and Genomic Comparative Analysis , 2014, Genome biology and evolution.

[44]  Wei Shi,et al.  featureCounts: an efficient general purpose program for assigning sequence reads to genomic features , 2013, Bioinform..

[45]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[46]  P. Cui,et al.  Dynamic regulation of genome-wide pre-mRNA splicing and stress tolerance by the Sm-like protein LSm5 in Arabidopsis , 2014, Genome Biology.

[47]  Laura E. DeMare,et al.  The Evolution of Lineage-Specific Regulatory Activities in the Human Embryonic Limb , 2013, Cell.

[48]  Keith L. Ligon,et al.  DNA hypomethylation within specific transposable element families associates with tissue-specific enhancer landscape , 2013, Nature Genetics.

[49]  G. Bourque,et al.  The Majority of Primate-Specific Regulatory Sequences Are Derived from Transposable Elements , 2013, PLoS genetics.

[50]  Zev N. Kronenberg,et al.  Transposable Elements Are Major Contributors to the Origin, Diversification, and Regulation of Vertebrate Long Noncoding RNAs , 2013, PLoS genetics.

[51]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[52]  S. Kanba,et al.  A Population-Specific Uncommon Variant in GRIN3A Associated with Schizophrenia , 2013, Biological Psychiatry.

[53]  J. Baker,et al.  Endogenous retroviruses function as species-specific enhancer elements in the placenta , 2013, Nature Genetics.

[54]  Christopher D. Brown,et al.  Integrative Modeling of eQTLs and Cis-Regulatory Elements Suggests Mechanisms Underlying Cell Type Specificity of eQTLs , 2012, PLoS genetics.

[55]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[56]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[57]  Marc D. Perry,et al.  ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia , 2012, Genome research.

[58]  R. I. Hill,et al.  Diversification of complex butterfly wing patterns by repeated regulatory evolution of a Wnt ligand , 2012, Proceedings of the National Academy of Sciences.

[59]  Z. Yakhini,et al.  Inferring gene regulatory logic from high-throughput measurements of thousands of systematically designed promoters , 2012, Nature Biotechnology.

[60]  Joseph B Hiatt,et al.  Massively parallel functional dissection of mammalian enhancers in vivo , 2012, Nature Biotechnology.

[61]  T. Mikkelsen,et al.  Rapid dissection and model-based optimization of inducible enhancers in human cells using a massively parallel reporter assay , 2012, Nature biotechnology.

[62]  Michael D. Wilson,et al.  Waves of Retrotransposon Expansion Remodel Genome Organization and CTCF Binding in Multiple Mammalian Lineages , 2012, Cell.

[63]  Vincent J. Lynch,et al.  Transposon-mediated rewiring of gene regulatory networks contributed to the evolution of pregnancy in mammals , 2011, Nature Genetics.

[64]  Genevieve Konopka,et al.  HNF4A is essential for specification of hepatic progenitors from human pluripotent stem cells , 2011, Development.

[65]  Philip Machanick,et al.  MEME-ChIP: motif analysis of large DNA datasets , 2011, Bioinform..

[66]  John C. Marioni,et al.  Gene Expression Differences Among Primates Are Associated With Changes in a Histone Epigenetic Modification , 2011, Genetics.

[67]  S. O’Brien,et al.  A Molecular Phylogeny of Living Primates , 2011, PLoS genetics.

[68]  Cedric E. Ginestet ggplot2: Elegant Graphics for Data Analysis , 2011 .

[69]  H. Kazazian,et al.  SVA retrotransposons: Evolution and genetic instability. , 2010, Seminars in cancer biology.

[70]  G. Bourque,et al.  Transposable elements have rewired the core regulatory network of human embryonic stem cells , 2010, Nature Genetics.

[71]  Aviv Regev,et al.  The Role of Nucleosome Positioning in the Evolution of Gene Regulation , 2010, PLoS biology.

[72]  Michael D. Wilson,et al.  Five-Vertebrate ChIP-seq Reveals the Evolutionary Dynamics of Transcription Factor Binding , 2010, Science.

[73]  Aaron R. Quinlan,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[74]  K. Pollard,et al.  Detection of nonneutral substitution rates on mammalian phylogenies. , 2010, Genome research.

[75]  Ahsan Huda,et al.  Epigenetic histone modifications of human transposable elements: genome defense versus exaptation , 2010, Mobile DNA.

[76]  D. Freyssenet,et al.  A New Role for Sterol Regulatory Element Binding Protein 1 Transcription Factors in the Regulation of Muscle Mass and Muscle Cell Differentiation , 2009, Molecular and Cellular Biology.

[77]  D. Mager,et al.  Endogenous retroviral LTRs as promoters for human genes: a critical assessment. , 2009, Gene.

[78]  Jacob D. Jaffe,et al.  ZBED6, a Novel Transcription Factor Derived from a Domesticated DNA Transposon Regulates IGF2 Expression and Muscle Growth , 2009, PLoS biology.

[79]  Esther Bae,et al.  Functional Evolution of cis-Regulatory Modules at a Homeotic Gene in Drosophila , 2009, PLoS genetics.

[80]  Jan Komorowski,et al.  Molecular interactions between HNF4a, FOXA2 and GABP identified at regulatory DNA elements through ChIP-sequencing , 2009, Nucleic acids research.

[81]  J. Welch,et al.  Explorer Quantifying Adaptive Evolution in the Drosophila Immune System , 2017 .

[82]  E. Schadt Molecular networks as sensors and drivers of common human diseases , 2009, Nature.

[83]  Mikael Bodén,et al.  MEME Suite: tools for motif discovery and searching , 2009, Nucleic Acids Res..

[84]  E. Liu,et al.  Evolution of the mammalian transcription factor binding repertoire via transposable elements. , 2008, Genome research.

[85]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[86]  A. Visel,et al.  Response to Comment on "Human-Specific Gain of Function in a Developmental Enhancer" , 2009, Science.

[87]  N. Saitou,et al.  Possible involvement of SINEs in mammalian-specific brain formation , 2008, Proceedings of the National Academy of Sciences.

[88]  Christopher D. Brown,et al.  Qualifying the relationship between sequence conservation and molecular function. , 2008, Genome research.

[89]  A. Rolink,et al.  PAX5/TEL acts as a transcriptional repressor causing down-modulation of CD19, enhances migration to CXCL12, and confers survival advantage in pre-BI cells. , 2008, Cancer research.

[90]  Jonathan E. Allen,et al.  Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments , 2007, Genome Biology.

[91]  Timothy B Sackton,et al.  Mutations in smooth muscle α-actin (ACTA2) lead to thoracic aortic aneurysms and dissections , 2007, Nature Genetics.

[92]  D. Haussler,et al.  Species-specific endogenous retroviruses shape the transcriptional network of the human tumor suppressor protein p53 , 2007, Proceedings of the National Academy of Sciences.

[93]  Christopher D. Brown,et al.  Supporting Online Material Materials and Methods Figs. S1 to S6 Table S1 References Functional Architecture and Evolution of Transcriptional Elements That Drive Gene Coexpression , 2022 .

[94]  G. Wray The evolutionary significance of cis-regulatory mutations , 2007, Nature Reviews Genetics.

[95]  E. Davidson The Regulatory Genome: Gene Regulatory Networks In Development And Evolution , 2006 .

[96]  D. Haussler,et al.  A distal enhancer and an ultraconserved exon are derived from a novel retroposon , 2006, Nature.

[97]  Eric H. Davidson,et al.  Gene Regulatory Networks for Development: What They Are, How They Work, and What They Mean , 2006 .

[98]  E. Davidson Genomic Regulatory Systems: Development and Evolution , 2005 .

[99]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[100]  Bruce T Lahn,et al.  Positive selection on the human genome. , 2004, Human molecular genetics.

[101]  J. Baert,et al.  FEV acts as a transcriptional repressor through its DNA-binding ETS domain and alanine-rich domain , 2003, Oncogene.

[102]  P. Tucker,et al.  Tests for Positive Selection on Immune and Reproductive Genes in Closely Related Species of the Murine Genus Mus , 2003, Journal of Molecular Evolution.

[103]  G. Glazko,et al.  Origin of a substantial fraction of human regulatory sequences from transposable elements. , 2003, Trends in genetics : TIG.

[104]  B. Mcclintock,et al.  The significance of responses of the genome to challenge. , 1984, Science.

[105]  R. Britten,et al.  Regulation of gene expression: possible role of repetitive sequences. , 1979, Science.

[106]  M. King,et al.  Evolution at two levels in humans and chimpanzees. , 1975, Science.

[107]  R. Britten,et al.  Gene regulation for higher cells: a theory. , 1969, Science.

[108]  B. Mcclintock The origin and behavior of mutable loci in maize , 1950, Proceedings of the National Academy of Sciences.