Genome-wide modeling of transcription kinetics reveals patterns of RNA production delays

Significance Gene transcription is a highly regulated dynamic process. Delays in transcription have important consequences on dynamics of gene expression and consequently on downstream biological function. We model temporal dynamics of transcription using genome-wide time course data measuring transcriptional activity and mRNA concentration. We find a significant number of genes exhibit a long RNA processing delay between transcription termination and mRNA production. These long processing delays are more common for short genes, which would otherwise be expected to transcribe most rapidly. The distribution of intronic reads suggests that these delays are required for splicing to be completed. Understanding such delays is essential for understanding how a rapid cellular response is regulated. Genes with similar transcriptional activation kinetics can display very different temporal mRNA profiles because of differences in transcription time, degradation rate, and RNA-processing kinetics. Recent studies have shown that a splicing-associated RNA production delay can be significant. To investigate this issue more generally, it is useful to develop methods applicable to genome-wide datasets. We introduce a joint model of transcriptional activation and mRNA accumulation that can be used for inference of transcription rate, RNA production delay, and degradation rate given data from high-throughput sequencing time course experiments. We combine a mechanistic differential equation model with a nonparametric statistical modeling approach allowing us to capture a broad range of activation kinetics, and we use Bayesian parameter estimation to quantify the uncertainty in estimates of the kinetic parameters. We apply the model to data from estrogen receptor α activation in the MCF-7 breast cancer cell line. We use RNA polymerase II ChIP-Seq time course data to characterize transcriptional activation and mRNA-Seq time course data to quantify mature transcripts. We find that 11% of genes with a good signal in the data display a delay of more than 20 min between completing transcription and mature mRNA production. The genes displaying these long delays are significantly more likely to be short. We also find a statistical association between high delay and late intron retention in pre-mRNA data, indicating significant splicing-associated production delays in many genes.

[1]  S. Duane,et al.  Hybrid Monte Carlo , 1987 .

[2]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[3]  N. Monk Oscillatory Expression of Hes1, p53, and NF-κB Driven by Transcriptional Time Delays , 2003, Current Biology.

[4]  U. Alon,et al.  Just-in-time transcription program in metabolic pathways , 2004, Nature Genetics.

[5]  Neil D. Lawrence,et al.  Modelling transcriptional regulation using Gaussian Processes , 2006, NIPS.

[6]  Neil D. Lawrence,et al.  Gaussian process modelling of latent chemical species: applications to inferring transcription factor activities , 2008, ECCB.

[7]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[8]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[9]  D. Baltimore,et al.  The stability of mRNA influences the temporal order of the induction of genes encoding inflammatory molecules , 2009, Nature Immunology.

[10]  Antti Honkela,et al.  Model-based method for transcription factor target identification with limited data , 2010, Proceedings of the National Academy of Sciences.

[11]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[12]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[13]  Leighton J. Core,et al.  A Rapid, Extensive, and Transient Transcriptional Response to Estrogen Signaling in Breast Cancer Cells , 2011, Cell.

[14]  Eytan Domany,et al.  Coupled pre-mRNA and mRNA dynamics unveil operational strategies underlying transcriptional responses to stimuli , 2013 .

[15]  N. Friedman,et al.  Metabolic labeling of RNA uncovers principles of RNA production and degradation dynamics in mammalian cells , 2011, Nature Biotechnology.

[16]  M. Rosbash,et al.  Cotranscriptional splicing efficiency differs dramatically between Drosophila and mouse. , 2012, RNA.

[17]  Antti Honkela,et al.  Identifying differentially expressed transcripts from RNA-seq data with biological variation , 2011, Bioinform..

[18]  N. Guex,et al.  Genome-Wide RNA Polymerase II Profiles and RNA Accumulation Reveal Kinetics of Transcription and Associated Epigenetic Changes During Diurnal Cycles , 2012, PLoS biology.

[19]  Maria Carmo-Fonseca,et al.  Dynamic transitions in RNA polymerase II density profiles during transcription termination , 2012, Genome research.

[20]  D. Black,et al.  Splicing kinetics and transcript release from the chromatin compartment limit the rate of Lipid A-induced gene expression. , 2013, RNA.

[21]  M. Lynch,et al.  A simple model to explain evolutionary trends of eukaryotic gene architecture and expression: how competition between splicing and cleavage/polyadenylation factors may affect gene expression and splice-site recognition in eukaryotes. , 2013, BioEssays : news and reviews in molecular, cellular and developmental biology.

[22]  André L. Martins,et al.  Signaling pathways differentially affect RNA polymerase II initiation, pausing, and elongation rate in cells. , 2013, Molecular cell.

[23]  D. Baltimore,et al.  RNA splicing regulates the temporal order of TNF-induced gene expression , 2012, Proceedings of the National Academy of Sciences.

[24]  Neil D. Lawrence,et al.  Inference of RNA Polymerase II Transcription Dynamics from Chromatin Immunoprecipitation Time Course Data , 2013, PLoS Comput. Biol..

[25]  Andreas Krause,et al.  Advances in Neural Information Processing Systems (NIPS) , 2014 .

[26]  D. Bentley Coupling mRNA processing with transcription in time and space , 2014, Nature Reviews Genetics.