Scalable inference of transcriptional kinetic parameters from MS2 time series data

Motivation The MS2-McP (MS2 coat protein) live imaging system allows for visualisation of transcription dynamics through the introduction of hairpin stem-loop sequences into a gene. A fluorescent signal at the site of nascent transcription in the nucleus quantifies mRNA production. computational modelling can be used to infer the promoter states along with the kinetic parameters governing transcription, such as promoter switching frequency and polymerase loading rate. However, modelling of the fluorescent trace presents a challenge due its persistence; the observed fluorescence at a given time point depends on both current and previous promoter states. A memory-adjusted Hidden Markov Model (mHMM) was recently introduced to allow inference of promoter activity from MS2-McP data. However, the computational time for inference scales exponentially with gene length and the mHMM is therefore not currently practical for application to many eukaryotic genes. Results We present a scalable implementation of the mHMM for fast inference of promoter activity and transcriptional kinetic parameters. This new method can model genes of arbitrary length through the use of a time-adaptive truncated compound state space. The truncated state space provides a good approximation to the full state space by retaining the most likely set of states at each time during the forward pass of the algorithm. Testing on MS2-MCP fluorescent data collected from early Drosophila melanogaster embryos indicates that the method provides accurate inference of kinetic parameters within a computationally feasible timeframe. The inferred promoter traces generated by the model can also be used to infer single-cell transcriptional parameters. Availability Python implementation available at https://github.com/ManchesterBioinference/burstInfer, along with code to reproduce the examples presented here.

[1]  Hernan G. Garcia,et al.  The embryo as a laboratory: quantifying transcription in Drosophila. , 2014, Trends in genetics : TIG.

[2]  A. Coulon,et al.  Eukaryotic transcriptional dynamics: from single molecules to cell populations , 2013, Nature Reviews Genetics.

[3]  R. Segev,et al.  GENERAL PROPERTIES OF THE TRANSCRIPTIONAL TIME-SERIES IN ESCHERICHIA COLI , 2011, Nature Genetics.

[4]  Michael L. Simpson,et al.  Transcriptional burst frequency and burst size are equally modulated across the human genome , 2012, Proceedings of the National Academy of Sciences.

[5]  D. Tranchina,et al.  Stochastic mRNA Synthesis in Mammalian Cells , 2006, PLoS biology.

[6]  A. Oudenaarden,et al.  Nature, Nurture, or Chance: Stochastic Gene Expression and Its Consequences , 2008, Cell.

[7]  Nicolas E. Buchler,et al.  BayFish: Bayesian inference of transcription dynamics from population snapshots of single-molecule RNA FISH in single cells , 2017, Genome Biology.

[8]  S. Itzkovitz,et al.  Bursty gene expression in the intact mammalian liver. , 2015, Molecular cell.

[9]  P. Dayan,et al.  A mathematical model explains saturating axon guidance responses to molecular gradients , 2016, eLife.

[10]  Adam M Corrigan,et al.  A continuum model of transcriptional bursting , 2016, eLife.

[11]  M. Elowitz,et al.  Functional roles for noise in genetic circuits , 2010, Nature.

[12]  David M. Umulis,et al.  Organism-scale modeling of early Drosophila patterning via bone morphogenetic proteins. , 2010, Developmental cell.

[13]  Edouard Bertrand,et al.  A Growing Toolbox to Image Gene Expression in Single Cells: Sensitive Approaches for Demanding Challenges. , 2018, Molecular cell.

[14]  M. Levine,et al.  Dpp signaling thresholds in the dorsal ectoderm of the Drosophila embryo. , 2000, Development.

[15]  Aleksandra M. Walczak,et al.  Live Imaging of Bicoid-Dependent Transcription in Drosophila Embryos , 2013, Current Biology.

[16]  E. Cox,et al.  Real-Time Kinetics of Gene Activity in Individual Bacteria , 2005, Cell.

[17]  Michael B. Eisen,et al.  Kinetic sculpting of the seven stripes of the Drosophila even-skipped gene , 2018, bioRxiv.

[18]  Michael Levine,et al.  Enhancer Control of Transcriptional Bursting , 2016, Cell.

[19]  T. Höfer,et al.  Frequency Modulation of Transcriptional Bursting Enables Sensitive and Rapid Gene Regulation. , 2018, Cell systems.

[20]  N. Barkai,et al.  Robustness of the BMP morphogen gradient in Drosophila embryonic patterning , 2022 .

[21]  Hernan G. Garcia,et al.  Multimodal transcriptional control of pattern formation in embryonic development , 2019, Proceedings of the National Academy of Sciences.

[22]  Hernan G. Garcia,et al.  Dynamic regulation of eve stripe 2 expression reveals transcriptional bursts in living Drosophila embryos , 2014, Proceedings of the National Academy of Sciences.

[23]  Shawn C. Little,et al.  Diverse Spatial Expression Patterns Emerge from Unified Kinetics of Transcriptional Bursting , 2018, Cell.

[24]  Huimin Chen,et al.  Transcription Dynamics in Living Cells. , 2016, Annual review of biophysics.

[25]  E. D. De Robertis,et al.  BMP gradients: A paradigm for morphogen-mediated developmental patterning , 2015, Science.

[26]  Hernan G. Garcia,et al.  Quantitative Imaging of Transcription in Living Drosophila Embryos Links Polymerase Activity to Patterning , 2013, Current Biology.

[27]  R. Singer,et al.  Transcriptional Pulsing of a Developmental Gene , 2006, Current Biology.

[28]  M. Rattray,et al.  Modulation of promoter occupancy dictates the transcriptional response to graded BMP signalling levels in the Drosophila embryo , 2019, bioRxiv.

[29]  H. Ashe,et al.  Regulation of the BMP Signaling-Responsive Transcriptional Network in the Drosophila Embryo , 2016, PLoS genetics.

[30]  Christophe Zimmer,et al.  FISH-quant: automatic counting of transcripts in 3D FISH images , 2013, Nature Methods.

[31]  R. Singer,et al.  Localization of ASH1 mRNA particles in living yeast. , 1998, Molecular cell.

[32]  Christophe Zimmer,et al.  A single-molecule view of transcription reveals convoys of RNA polymerases and multi-scale bursting , 2016, Nature Communications.