RNA polymerase mapping in plants identifies intergenic regulatory elements enriched in causal variants

Abstract Control of gene expression is fundamental at every level of cell function. Promoter-proximal pausing and divergent transcription at promoters and enhancers, which are prominent features in animals, have only been studied in a handful of research experiments in plants. PRO-Seq analysis in cassava (Manihot esculenta) identified peaks of transcriptionally engaged RNA polymerase at both the 5′ and 3′ end of genes, consistent with paused or slowly moving Polymerase. In addition, we identified divergent transcription at intergenic sites. A full genome search for bi-directional transcription using an algorithm for enhancer detection developed in mammals (dREG) identified many intergenic regulatory element (IRE) candidates. These sites showed distinct patterns of methylation and nucleotide conservation based on genomic evolutionary rate profiling (GERP). SNPs within these IRE candidates explained significantly more variation in fitness and root composition than SNPs in chromosomal segments randomly ascertained from the same intergenic distribution, strongly suggesting a functional importance of these sites. Maize GRO-Seq data showed RNA polymerase occupancy at IREs consistent with patterns in cassava. Furthermore, these IREs in maize significantly overlapped with sites previously identified on the basis of open chromatin, histone marks, and methylation, and were enriched for reported eQTL. Our results suggest that bidirectional transcription can identify intergenic genomic regions in plants that play an important role in transcription regulation and whose identification has the potential to aid crop improvement.

[1]  A. Sandelin,et al.  Characterization of Arabidopsis thaliana Promoter Bidirectionality and Antisense RNAs by Inactivation of Nuclear RNA Decay Pathways , 2020, Plant Cell.

[2]  A. Siepel,et al.  An inferred fitness consequence map of the rice genome , 2020, Nature Plants.

[3]  Robert J. Schmitz,et al.  The prevalence, evolution and chromatin signatures of plant regulatory elements , 2019, Nature Plants.

[4]  Robert J. Schmitz,et al.  Widespread Long-range Cis-Regulatory Elements in the Maize Genome , 2019, Nature Plants.

[5]  Jiafu Zhu,et al.  RNA polymerase II activity revealed by GRO-seq and pNET-seq in Arabidopsis , 2018, Nature Plants.

[6]  R. Andersson,et al.  Transcription start site analysis reveals widespread divergent transcription in D. melanogaster and core promoter-encoded enhancer activities , 2017, bioRxiv.

[7]  Peter J. Bradbury,et al.  Dysregulation of expression correlates with rare-allele burden and fitness loss in maize , 2018, Nature.

[8]  Gregory T. Booth,et al.  Cdk9 regulates a promoter-proximal checkpoint to modulate RNA polymerase II elongation rate in fission yeast , 2018, Nature Communications.

[9]  Y. Qi,et al.  Arabidopsis ARGONAUTE 1 Binds Chromatin to Promote Gene Transcription in Response to Hormones and Stresses. , 2017, Developmental cell.

[10]  Huan Meng,et al.  Emerging roles of transcriptional enhancers in chromatin looping and promoter-proximal pausing of RNA polymerase II , 2017, The Journal of Biological Chemistry.

[11]  R. Lozano,et al.  Leveraging Transcriptomics Data for Genomic Prediction Models in Cassava , 2017, bioRxiv.

[12]  Nathan M. Springer,et al.  Genome-wide mapping of transcriptional enhancer candidates using DNA and chromatin features in maize , 2017, Genome Biology.

[13]  E. Buckler,et al.  Cassava haplotype map highlights fixation of deleterious mutations during clonal propagation , 2017, Nature Genetics.

[14]  D. Akdemir,et al.  Accuracies of univariate and multivariate genomic prediction models in African cassava , 2017, bioRxiv.

[15]  Jianbing Yan,et al.  Distant eQTLs and Non-coding Sequences Play Critical Roles in Regulating Gene Expression and Quantitative Trait Variation in Maize. , 2017, Molecular plant.

[16]  D. P. Carpio,et al.  Prospects for Genomic Selection in Cassava Breeding , 2017, bioRxiv.

[17]  J. Wysocka,et al.  Ever-Changing Landscapes: Transcriptional Enhancers in Development and Evolution , 2016, Cell.

[18]  Rurika Oka,et al.  Plant Enhancers: A Call for Discovery. , 2016, Trends in plant science.

[19]  C. Benner,et al.  Nascent RNA sequencing reveals distinct features in plant transcription , 2016, Proceedings of the National Academy of Sciences.

[20]  Doug Speed,et al.  Re-evaluation of SNP heritability in complex human traits , 2016, Nature Genetics.

[21]  Marnin D. Wolfe,et al.  Marker-Based Estimates Reveal Significant Nonadditive Effects in Clonally Propagated Cassava (Manihot esculenta): Implications for the Prediction of Total Genetic Value and the Selection of Varieties , 2016, G3: Genes, Genomes, Genetics.

[22]  Leighton J. Core,et al.  Base-pair-resolution genome-wide mapping of active RNA polymerases using precision nuclear run-on (PRO-seq) , 2016, Nature Protocols.

[23]  D. P. Carpio,et al.  Genome‐Wide Association and Prediction Reveals Genetic Architecture of Cassava Mosaic Disease Resistance and Prospects for Rapid Genetic Improvement , 2016, The plant genome.

[24]  Daniel L. Vera,et al.  Open chromatin reveals the functional maize genome , 2016, Proceedings of the National Academy of Sciences.

[25]  Marnin D. Wolfe,et al.  Marker-based estimates reveal significant non-additive effects in clonally propagated cassava (Manihot esculenta): implications for the prediction of total genetic value and the selection of varieties , 2015, bioRxiv.

[26]  N. Hannett,et al.  Transcription factor trapping by RNA in gene regulatory elements , 2015, Science.

[27]  S. Jacobsen,et al.  CG gene body DNA methylation changes and evolution of duplicated genes in cassava , 2015, Proceedings of the National Academy of Sciences.

[28]  Bao Liu,et al.  Genome-Wide Prediction and Validation of Intergenic Enhancers in Arabidopsis Using Open Chromatin Signatures[OPEN] , 2015, Plant Cell.

[29]  R. Shiekhattar,et al.  Architectural and Functional Commonalities between Enhancers and Promoters , 2015, Cell.

[30]  Ashley R. Woodfin,et al.  PAF1, a Molecular Regulator of Promoter-Proximal Pausing by RNA Polymerase II , 2015, Cell.

[31]  D. Fargo,et al.  Pausing of RNA polymerase II regulates mammalian developmental potential through control of signaling networks. , 2015, Molecular cell.

[32]  Stephanie L. Hyland,et al.  Identification of active transcriptional regulatory elements with GRO-seq , 2015, Nature Methods.

[33]  K. Erhard,et al.  Nascent Transcription Affected by RNA Polymerase IV in Zea mays , 2015, Genetics.

[34]  J. T. Erichsen,et al.  Enhancer Evolution across 20 Mammalian Species , 2015, Cell.

[35]  André L. Martins,et al.  Analysis of nascent RNA identifies a unified architecture of initiation regions at mammalian promoters and enhancers , 2014, Nature Genetics.

[36]  Han Xu,et al.  Partitioning heritability of regulatory and cell-type-specific variants across 11 common diseases. , 2014, American journal of human genetics.

[37]  Carson C Chow,et al.  Second-generation PLINK: rising to the challenge of larger and richer datasets , 2014, GigaScience.

[38]  A. Quinlan BEDTools: The Swiss‐Army Tool for Genome Feature Analysis , 2014, Current protocols in bioinformatics.

[39]  Jing Wang,et al.  CrossMap: a versatile tool for coordinate conversion between genome assemblies , 2014, Bioinform..

[40]  T. Meehan,et al.  An atlas of active enhancers across human cell types and tissues , 2014, Nature.

[41]  Robert J. Elshire,et al.  TASSEL-GBS: A High Capacity Genotyping by Sequencing Analysis Pipeline , 2014, PloS one.

[42]  C. Glass,et al.  Functional roles of enhancer RNAs for oestrogen-dependent transcriptional activation , 2013, Nature.

[43]  B. Jennings,et al.  Pausing for thought: Disrupting the early transcription elongation checkpoint leads to developmental defects and tumourigenesis , 2013, BioEssays : news and reviews in molecular, cellular and developmental biology.

[44]  Leighton J. Core,et al.  Precise Maps of RNA Polymerase Reveal How Promoters Direct Initiation and Pausing , 2013, Science.

[45]  Albert E. Almada,et al.  Divergent transcription of long noncoding RNA/mRNA gene pairs in embryonic stem cells , 2013, Proceedings of the National Academy of Sciences.

[46]  John T. Lis,et al.  Promoter-proximal pausing of RNA polymerase II: emerging roles in metazoans , 2012, Nature Reviews Genetics.

[47]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..

[48]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[49]  J. Marchini,et al.  Genotype Imputation with Thousands of Genomes , 2011, G3: Genes | Genomes | Genetics.

[50]  W. G. Hill,et al.  Genome partitioning of genetic variation for complex traits using common SNPs , 2011, Nature Genetics.

[51]  Robert J. Elshire,et al.  A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species , 2011, PloS one.

[52]  Serafim Batzoglou,et al.  Identifying a High Fraction of the Human Genome to be under Selective Constraint Using GERP++ , 2010, PLoS Comput. Biol..

[53]  C. Glass,et al.  Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. , 2010, Molecular cell.

[54]  G. Kreiman,et al.  Widespread transcription at neuronal activity-regulated enhancers , 2010, Nature.

[55]  P. Donnelly,et al.  A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies , 2009, PLoS genetics.

[56]  Leighton J. Core,et al.  Nascent RNA Sequencing Reveals Widespread Pausing and Divergent Initiation at Human Promoters , 2008, Science.

[57]  Jonathan D. G. Jones,et al.  The plant immune system , 2006, Nature.

[58]  S. Batzoglou,et al.  Distribution and intensity of constraint in mammalian genomic sequence. , 2005, Genome research.

[59]  K. Yano,et al.  Human Transcription Elongation Factor NELF: Identification of Novel Subunits and Reconstitution of the Functionally Active Complex , 2003, Molecular and Cellular Biology.

[60]  D. Tuan,et al.  Transcription of the hypersensitive site HS2 enhancer in erythroid cells. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[61]  S. Shannon,et al.  A Mutation in the Arabidopsis TFL1 Gene Affects Inflorescence Meristem Development. , 1991, The Plant cell.

[62]  M. Van Montagu,et al.  Light‐inducible and tissue‐specific expression of a chimaeric gene under control of the 5′‐flanking sequence of a pea chlorophyll a/b‐binding protein gene , 1985, The EMBO journal.

[63]  Michelle C. Stitzer,et al.  Adaptation in plant genomes: Bigger is different. , 2018, American journal of botany.

[64]  M. Hemberg,et al.  Enhancer RNAs: a class of long noncoding RNAs synthesized at enhancers. , 2015, Cold Spring Harbor perspectives in biology.

[65]  Susan Shannon,et al.  A Mutation in the Arabidopsis TFL 1 Gene Affects Inf lorescence Meristem Development , 2002 .