Global parameter estimation methods for stochastic biochemical systems

BackgroundThe importance of stochasticity in cellular processes having low number of molecules has resulted in the development of stochastic models such as chemical master equation. As in other modelling frameworks, the accompanying rate constants are important for the end-applications like analyzing system properties (e.g. robustness) or predicting the effects of genetic perturbations. Prior knowledge of kinetic constants is usually limited and the model identification routine typically includes parameter estimation from experimental data. Although the subject of parameter estimation is well-established for deterministic models, it is not yet routine for the chemical master equation. In addition, recent advances in measurement technology have made the quantification of genetic substrates possible to single molecular levels. Thus, the purpose of this work is to develop practical and effective methods for estimating kinetic model parameters in the chemical master equation and other stochastic models from single cell and cell population experimental data.ResultsThree parameter estimation methods are proposed based on the maximum likelihood and density function distance, including probability and cumulative density functions. Since stochastic models such as chemical master equations are typically solved using a Monte Carlo approach in which only a finite number of Monte Carlo realizations are computationally practical, specific considerations are given to account for the effect of finite sampling in the histogram binning of the state density functions. Applications to three practical case studies showed that while maximum likelihood method can effectively handle low replicate measurements, the density function distance methods, particularly the cumulative density function distance estimation, are more robust in estimating the parameters with consistently higher accuracy, even for systems showing multimodality.ConclusionsThe parameter estimation methodologies described in this work have provided an effective and practical approach in the estimation of kinetic parameters of stochastic systems from either sparse or dense cell population data. Nevertheless, similar to kinetic parameter estimation in other modelling frameworks, not all parameters can be estimated accurately, which is a common problem arising from the lack of complete parameter identifiability from the available data.

[1]  U. Bhalla,et al.  Emergent properties of networks of biological signaling pathways. , 1999, Science.

[2]  K. Burrage,et al.  Stochastic chemical kinetics and the total quasi-steady-state assumption: application to the stochastic simulation algorithm and chemical master equation. , 2008, The Journal of chemical physics.

[3]  C. Pesce,et al.  Regulated cell-to-cell variation in a cell-fate decision system , 2005, Nature.

[4]  Eduardo Sontag,et al.  Building a cell cycle oscillator: hysteresis and bistability in the activation of Cdc2 , 2003, Nature Cell Biology.

[5]  Takuji Nishimura,et al.  Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator , 1998, TOMC.

[6]  D. Vlachos,et al.  Binomial distribution based tau-leap accelerated stochastic simulation. , 2005, The Journal of chemical physics.

[7]  A. Arkin,et al.  Stochastic kinetic analysis of developmental pathway bifurcation in phage lambda-infected Escherichia coli cells. , 1998, Genetics.

[8]  K. Burrage,et al.  Binomial leap methods for simulating stochastic chemical kinetics. , 2004, The Journal of chemical physics.

[9]  Adam P. Arkin,et al.  Efficient stochastic sensitivity analysis of discrete event systems , 2007, J. Comput. Phys..

[10]  Rainer Laur,et al.  Stopping Criteria for Single-Objective Optimization , 2005 .

[11]  Melvin Alexander Applied Statistics and Probability for Engineers , 1995 .

[12]  Michael A. Gibson,et al.  Efficient Exact Stochastic Simulation of Chemical Systems with Many Species and Many Channels , 2000 .

[13]  D. Gillespie Exact Stochastic Simulation of Coupled Chemical Reactions , 1977 .

[14]  Mads Kærn,et al.  Noise in eukaryotic gene expression , 2003, Nature.

[15]  D. Gillespie Markov Processes: An Introduction for Physical Scientists , 1991 .

[16]  X. Xie,et al.  Probing Gene Expression in Live Cells, One Protein Molecule at a Time , 2006, Science.

[17]  J. Rawlings,et al.  Approximate simulation of coupled fast and slow reactions for stochastic chemical kinetics , 2002 .

[18]  Darren J. Wilkinson,et al.  Bayesian Sequential Inference for Stochastic Kinetic Biochemical Network Models , 2006, J. Comput. Biol..

[19]  Robert V. Brill,et al.  Applied Statistics and Probability for Engineers , 2004, Technometrics.

[20]  Dan ie l T. Gil lespie A rigorous derivation of the chemical master equation , 1992 .

[21]  M. Magnasco,et al.  Decay rates of human mRNAs: correlation with functional characteristics and sequence attributes. , 2003, Genome research.

[22]  Darren J. Wilkinson,et al.  Bayesian inference for a discretely observed stochastic kinetic model , 2008, Stat. Comput..

[23]  I. E. Nikerel,et al.  Model reduction and a priori kinetic parameter identifiability analysis using metabolome time series for metabolic reaction networks with linlog kinetics. , 2009, Metabolic engineering.

[24]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[25]  D. W. Scott,et al.  Multivariate Density Estimation, Theory, Practice and Visualization , 1992 .

[26]  A. Arkin,et al.  Stochastic amplification and signaling in enzymatic futile cycles through noise-induced bistability with oscillations. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[27]  R. Weiss,et al.  Artificial cell-cell communication in yeast Saccharomyces cerevisiae using signaling elements from Arabidopsis thaliana , 2005, Nature Biotechnology.

[28]  David Fange,et al.  Noise-Induced Min Phenotypes in E. coli , 2006, PLoS Comput. Biol..

[29]  A. Arkin,et al.  It's a noisy business! Genetic regulation at the nanomolar scale. , 1999, Trends in genetics : TIG.

[30]  M. Elowitz,et al.  A synthetic oscillatory network of transcriptional regulators , 2000, Nature.

[31]  M. Khammash,et al.  The finite state projection algorithm for the solution of the chemical master equation. , 2006, The Journal of chemical physics.

[32]  J. Collins,et al.  Construction of a genetic toggle switch in Escherichia coli , 2000, Nature.

[33]  Roger B. Sidje,et al.  Multiscale Modeling of Chemical Kinetics via the Master Equation , 2008, Multiscale Model. Simul..

[34]  E. Cox,et al.  Real-Time Kinetics of Gene Activity in Individual Bacteria , 2005, Cell.

[35]  Junbin Gao,et al.  Simulated maximum likelihood method for estimating kinetic rates in gene expression , 2007, Bioinform..

[36]  J. Gregg,et al.  Allele-specific Holliday junction formation: a new mechanism of allelic discrimination for SNP scoring. , 2003, Genome research.

[37]  Ertugrul M. Ozbudak,et al.  Multistability in the lactose utilization network of Escherichia coli , 2004, Nature.

[38]  Linda R Petzold,et al.  Efficient step size selection for the tau-leaping simulation method. , 2006, The Journal of chemical physics.

[39]  Pierre L'Ecuyer,et al.  An Object-Oriented Random-Number Package with Many Long Streams and Substreams , 2002, Oper. Res..

[40]  J Timmer,et al.  Parameter estimation in stochastic biochemical reactions. , 2006, Systems biology.

[41]  Yang Cao,et al.  Sensitivity analysis of discrete stochastic systems. , 2005, Biophysical journal.

[42]  I. Chou,et al.  Recent developments in parameter estimation and structure identification of biochemical and genomic systems. , 2009, Mathematical biosciences.

[43]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[44]  Rudiyanto Gunawan,et al.  Iterative approach to model identification of biological networks , 2005, BMC Bioinformatics.