Robust reconstruction of gene expression profiles from reporter gene data using linear inversion

Motivation: Time-series observations from reporter gene experiments are commonly used for inferring and analyzing dynamical models of regulatory networks. The robust estimation of promoter activities and protein concentrations from primary data is a difficult problem due to measurement noise and the indirect relation between the measurements and quantities of biological interest. Results: We propose a general approach based on regularized linear inversion to solve a range of estimation problems in the analysis of reporter gene data, notably the inference of growth rate, promoter activity, and protein concentration profiles. We evaluate the validity of the approach using in silico simulation studies, and observe that the methods are more robust and less biased than indirect approaches usually encountered in the experimental literature based on smoothing and subsequent processing of the primary data. We apply the methods to the analysis of fluorescent reporter gene data acquired in kinetic experiments with Escherichia coli. The methods are capable of reliably reconstructing time-course profiles of growth rate, promoter activity and protein concentration from weak and noisy signals at low population volumes. Moreover, they capture critical features of those profiles, notably rapid changes in gene expression during growth transitions. Availability and implementation: The methods described in this article are made available as a Python package (LGPL license) and also accessible through a web interface. For more information, see https://team.inria.fr/ibis/wellinverter. Contact: Hidde.de-Jong@inria.fr Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Andrew Travers,et al.  DNA supercoiling — a global transcriptional regulator for enterobacterial growth? , 2005, Nature Reviews Microbiology.

[2]  Terence Hwa,et al.  Combinatorial transcriptional control of the lactose operon of Escherichia coli , 2007, Proceedings of the National Academy of Sciences.

[3]  R. Milo,et al.  Promoters maintain their relative activity levels under different growth conditions , 2013, Molecular systems biology.

[4]  R. Tsien,et al.  The Fluorescent Toolbox for Assessing Protein Location and Function , 2006, Science.

[5]  U. Alon,et al.  Assigning numbers to the arrows: Parameterizing a gene regulation network by using accurate expression kinetics , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[6]  John Lygeros,et al.  Structural identification of unate-like genetic network models from time-lapse protein concentration measurements , 2010, 49th IEEE Conference on Decision and Control (CDC).

[7]  Eric Jones,et al.  SciPy: Open Source Scientific Tools for Python , 2001 .

[8]  Timothy C Elston,et al.  Mathematical analysis and quantification of fluorescent proteins as transcriptional reporters. , 2008, Biophysical journal.

[9]  A. S. Morse,et al.  Introduction to linear system theory , 1972 .

[10]  Per Christian Hansen,et al.  Analysis of Discrete Ill-Posed Problems by Means of the L-Curve , 1992, SIAM Rev..

[11]  Julio R. Banga,et al.  Reverse engineering and identification in systems biology: strategies, perspectives and challenges , 2014, Journal of The Royal Society Interface.

[12]  Riet De Smet,et al.  Advantages and limitations of current network inference methods , 2010, Nature Reviews Microbiology.

[13]  J. Geiselmann,et al.  A genome-wide screen for identifying all regulators of a target gene , 2013, Nucleic acids research.

[14]  R. Hengge-aronis,et al.  Signal Transduction and Regulatory Mechanisms Involved in Control of the σS (RpoS) Subunit of RNA Polymerase , 2002, Microbiology and Molecular Biology Reviews.

[15]  M. Ehrenberg,et al.  Control of rRNA Synthesis in Escherichia coli: a Systems Biology Approach , 2004, Microbiology and Molecular Biology Reviews.

[16]  Milton H. Saier,et al.  Transcriptome Analysis of Crp-Dependent Catabolite Control of Gene Expression in Escherichia coli , 2004, Journal of bacteriology.

[17]  Frédéric Boyer,et al.  WellReader: a MATLAB program for the analysis of fluorescence and luminescence reporter gene data , 2010, Bioinform..

[18]  Vincent Fromion,et al.  BasyLiCA: a tool for automatic processing of a Bacterial Live Cell Array , 2012, Bioinform..

[19]  S. Ueda,et al.  Growth Phase-Dependent Variation in Protein Composition of the Escherichia coli Nucleoid , 1999, Journal of bacteriology.

[20]  D. di Bernardo,et al.  How to infer gene networks from expression profiles , 2007, Molecular systems biology.

[21]  S. Lukyanov,et al.  Fluorescent proteins and their applications in imaging living cells and tissues. , 2010, Physiological reviews.

[22]  Giuseppe De Nicolao,et al.  Nonparametric input estimation in physiological systems: Problems, methods, and case studies , 1997, Autom..

[23]  G. Williams,et al.  The relative rates of protein synthesis and degradation in a growing culture of Escherichia coli. , 1980, The Journal of biological chemistry.

[24]  J. Geiselmann,et al.  Shared control of gene expression in bacteria by transcription factors and global physiology of the cell , 2013, Molecular systems biology.

[25]  Steven E. Lindow,et al.  Predictive and Interpretive Simulation of Green Fluorescent Protein Expression in Reporter Bacteria , 2001, Journal of bacteriology.

[26]  Ivan B. N. Clark,et al.  Unmixing of fluorescence spectra to resolve quantitative time-series measurements of gene expression in plate readers , 2014, BMC Biotechnology.

[27]  U. Sauer,et al.  Dissecting specific and global transcriptional regulation of bacterial gene expression , 2013, Molecular systems biology.

[28]  James C Liao,et al.  A Global Regulatory Role of Gluconeogenic Genes in Escherichia coli Revealed by Transcriptome Network Analysis* , 2005, Journal of Biological Chemistry.

[29]  Brice Enjalbert,et al.  Physiological and Molecular Timing of the Glucose to Acetate Transition in Escherichia coli , 2013, Metabolites.

[30]  Hidde de Jong,et al.  Experimental and computational validation of models of fluorescent and luminescent reporter genes in bacteria , 2010, BMC Systems Biology.

[31]  M. Bertero Linear Inverse and III-Posed Problems , 1989 .

[32]  Eugenio Cinquemani,et al.  Inference of Quantitative Models of Bacterial Promoters from Time-Series Reporter Gene Data , 2015, PLoS Comput. Biol..

[33]  A. D. de Koning,et al.  Effects of Fis on Escherichia coli gene expression during different growth stages. , 2007, Microbiology.

[34]  Carl D. Laird,et al.  Determining transcription factor profiles from fluorescent reporter systems involving regularization of inverse problems , 2012, 2012 American Control Conference (ACC).

[35]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[36]  A. Wolfe The Acetate Switch , 2005, Microbiology and Molecular Biology Reviews.

[37]  G. Wahba Spline models for observational data , 1990 .

[38]  Andrew J. Millar,et al.  Reconstruction of transcriptional dynamics from gene reporter data using differential equations , 2008, Bioinform..

[39]  Gene H. Golub,et al.  Generalized cross-validation as a method for choosing a good ridge parameter , 1979, Milestones in Matrix Computation.