An application of the Krylov-FSP-SSA method to parameter fitting with maximum likelihood

Monte Carlo methods such as the stochastic simulation algorithm (SSA) have traditionally been employed in gene regulation problems. However, there has been increasing interest to directly obtain the probability distribution of the molecules involved by solving the chemical master equation (CME). This requires addressing the curse of dimensionality that is inherent in most gene regulation problems. The finite state projection (FSP) seeks to address the challenge and there have been variants that further reduce the size of the projection or that accelerate the resulting matrix exponential. The Krylov-FSP-SSA variant has proved numerically efficient by combining, on one hand, the SSA to adaptively drive the FSP, and on the other hand, adaptive Krylov techniques to evaluate the matrix exponential. Here we apply this Krylov-FSP-SSA to a mutual inhibitory gene network synthetically engineered in Saccharomyces cerevisiae, in which bimodality arises. We show numerically that the approach can efficiently approximate the transient probability distribution, and this has important implications for parameter fitting, where the CME has to be solved for many different parameter sets. The fitting scheme amounts to an optimization problem of finding the parameter set so that the transient probability distributions fit the observations with maximum likelihood. We compare five optimization schemes for this difficult problem, thereby providing further insights into this approach of parameter estimation that is often applied to models in systems biology where there is a need to calibrate free parameters.

[1]  Brian Munsky,et al.  Finite state projection based bounds to compare chemical master equation models using single-cell data. , 2016, The Journal of chemical physics.

[2]  K. Burrage,et al.  A Krylov-based finite state projection algorithm for solving the chemical master equation arising in the discrete modelling of biological systems , 2006 .

[3]  S. Dreyfus,et al.  Thermodynamical Approach to the Traveling Salesman Problem : An Efficient Simulation Algorithm , 2004 .

[4]  Linda R Petzold,et al.  Efficient step size selection for the tau-leaping simulation method. , 2006, The Journal of chemical physics.

[5]  E. Cox,et al.  Real-Time Kinetics of Gene Activity in Individual Bacteria , 2005, Cell.

[6]  M. Khammash,et al.  A reduced model solution for the chemical master equation arising in stochastic analyses of biological networks , 2006, Proceedings of the 45th IEEE Conference on Decision and Control.

[7]  András Horváth,et al.  Parameter Estimation of Kinetic Rates in Stochastic Reaction Networks by the EM Method , 2008, 2008 International Conference on BioMedical Engineering and Informatics.

[8]  M. Magnasco,et al.  Decay rates of human mRNAs: correlation with functional characteristics and sequence attributes. , 2003, Genome research.

[9]  M. J. D. Powell,et al.  Least Frobenius norm updating of quadratic models that satisfy interpolation conditions , 2004, Math. Program..

[10]  D. Gillespie A General Method for Numerically Simulating the Stochastic Time Evolution of Coupled Chemical Reactions , 1976 .

[11]  X. Xie,et al.  Probing Gene Expression in Live Cells, One Protein Molecule at a Time , 2006, Science.

[12]  Xiaohui Xie,et al.  Parameter inference for discretely observed stochastic kinetic models using stochastic gradient descent , 2010, BMC Systems Biology.

[13]  I. Chou,et al.  Recent developments in parameter estimation and structure identification of biochemical and genomic systems. , 2009, Mathematical biosciences.

[14]  Cleve B. Moler,et al.  Nineteen Dubious Ways to Compute the Exponential of a Matrix, Twenty-Five Years Later , 1978, SIAM Rev..

[15]  Linda R Petzold,et al.  The slow-scale stochastic simulation algorithm. , 2005, The Journal of chemical physics.

[16]  Brian Munsky,et al.  A multiple time interval finite state projection algorithm for the solution to the chemical master equation , 2007, J. Comput. Phys..

[17]  Ido Golding,et al.  Stochastic Kinetics of Nascent RNA. , 2016, Physical review letters.

[18]  William L. Goffe SIMANN: A Global Optimization Algorithm using Simulated Annealing , 1996 .

[19]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[20]  Roger B Sidje,et al.  Understanding the finite state projection and related methods for solving the chemical master equation , 2016, Physical biology.

[21]  Ivo F Sbalzarini,et al.  A new class of highly efficient exact stochastic simulation algorithms for chemical reaction networks. , 2009, The Journal of chemical physics.

[22]  A. Arkin,et al.  Stochastic amplification and signaling in enzymatic futile cycles through noise-induced bistability with oscillations. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Linda R. Petzold,et al.  Accelerated maximum likelihood parameter estimation for stochastic biochemical systems , 2012, BMC Bioinformatics.

[24]  M. Khammash,et al.  Systematic Identification of Signal-Activated Stochastic Gene Regulation , 2013, Science.

[25]  M. Khammash,et al.  Identification from stochastic cell-to-cell variation: a genetic switch case study. , 2010, IET systems biology.

[26]  M. Khammash,et al.  The finite state projection algorithm for the solution of the chemical master equation. , 2006, The Journal of chemical physics.

[27]  Suresh Kumar Poovathingal,et al.  Global parameter estimation methods for stochastic biochemical systems , 2010, BMC Bioinformatics.

[28]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[29]  A. Khachaturyan,et al.  The thermodynamic approach to the structure analysis of crystals , 1981 .

[30]  Leen Stougie,et al.  Global optimization : a stochastic approach , 1980 .

[31]  Roger B. Sidje,et al.  Expokit: a software package for computing matrix exponentials , 1998, TOMS.

[32]  D. Vlachos,et al.  Binomial distribution based tau-leap accelerated stochastic simulation. , 2005, The Journal of chemical physics.

[33]  Michael A. Gibson,et al.  Efficient Exact Stochastic Simulation of Chemical Systems with Many Species and Many Channels , 2000 .

[34]  Brian Munsky,et al.  Reduction and solution of the chemical master equation using time scale separation and finite state projection. , 2006, The Journal of chemical physics.

[35]  Fred W. Glover,et al.  A Template for Scatter Search and Path Relinking , 1997, Artificial Evolution.

[36]  C. Pesce,et al.  Regulated cell-to-cell variation in a cell-fate decision system , 2005, Nature.

[37]  Alexander H. G. Rinnooy Kan,et al.  A stochastic method for global optimization , 1982, Math. Program..

[38]  M. Khammash,et al.  Stochastic analysis of gene regulatory networks using finite state projections and singular perturbation , 2007, 2007 American Control Conference.

[39]  Thomas A. Henzinger,et al.  Solving the chemical master equation using sliding windows , 2010, BMC Systems Biology.

[40]  Darren J. Wilkinson,et al.  Bayesian Sequential Inference for Stochastic Kinetic Biochemical Network Models , 2006, J. Comput. Biol..

[41]  Tibor Csendes,et al.  Nonlinear Parameter Estimation by Global Optimization - Efficiency and reliability , 1989, Acta Cybern..

[42]  Aidan P Thompson,et al.  A constant-time kinetic Monte Carlo algorithm for simulation of large biochemical reaction networks. , 2008, The Journal of chemical physics.

[43]  David Fange,et al.  Noise-Induced Min Phenotypes in E. coli , 2006, PLoS Comput. Biol..

[44]  A. Khachaturyan,et al.  Statistical mechanics approach to the structure determination of a crystal , 1985 .

[45]  A. Arkin,et al.  It's a noisy business! Genetic regulation at the nanomolar scale. , 1999, Trends in genetics : TIG.

[46]  M. Elowitz,et al.  A synthetic oscillatory network of transcriptional regulators , 2000, Nature.

[47]  Hong Li,et al.  Efficient formulation of the stochastic simulation algorithm for chemically reacting systems. , 2004, The Journal of chemical physics.

[48]  D. Wilkinson Stochastic modelling for quantitative description of heterogeneous biological systems , 2009, Nature Reviews Genetics.

[49]  J. Gregg,et al.  Allele-specific Holliday junction formation: a new mechanism of allelic discrimination for SNP scoring. , 2003, Genome research.

[50]  Y. Lai,et al.  Engineering of regulated stochastic cell fate determination , 2013, Proceedings of the National Academy of Sciences.

[51]  J. Collins,et al.  DIVERSITY-BASED, MODEL-GUIDED CONSTRUCTION OF SYNTHETIC GENE NETWORKS WITH PREDICTED FUNCTIONS , 2009, Nature Biotechnology.

[52]  Fred W. Glover,et al.  Scatter Search and Local Nlp Solvers: A Multistart Framework for Global Optimization , 2006, INFORMS J. Comput..

[53]  Roberto Zunino,et al.  Tree-based search for stochastic simulation algorithm , 2012, SAC '12.

[54]  J. Collins,et al.  Construction of a genetic toggle switch in Escherichia coli , 2000, Nature.

[55]  J Timmer,et al.  Parameter estimation in stochastic biochemical reactions. , 2006, Systems biology.

[56]  Carmen G. Moles,et al.  Parameter estimation in biochemical pathways: a comparison of global optimization methods. , 2003, Genome research.

[57]  D. Gillespie,et al.  Avoiding negative populations in explicit Poisson tau-leaping. , 2005, The Journal of chemical physics.

[58]  Tim P. Schulze,et al.  Efficient kinetic Monte Carlo simulation , 2008, J. Comput. Phys..

[59]  Dan ie l T. Gil lespie A rigorous derivation of the chemical master equation , 1992 .

[60]  Darren J. Wilkinson,et al.  Bayesian inference for a discretely observed stochastic kinetic model , 2008, Stat. Comput..

[61]  Nagiza F. Samatova,et al.  The sorting direct method for stochastic simulation of biochemical systems with varying reaction execution behavior , 2006, Comput. Biol. Chem..

[62]  Roger B Sidje,et al.  An adaptive solution to the chemical master equation using tensors. , 2017, The Journal of chemical physics.

[63]  Markus Hegland,et al.  An optimal Finite State Projection Method , 2010, ICCS.

[64]  D. Gillespie Approximate accelerated stochastic simulation of chemically reacting systems , 2001 .

[65]  Junbin Gao,et al.  Simulated maximum likelihood method for estimating kinetic rates in gene expression , 2007, Bioinform..

[66]  R. Brent Algorithms for finding zeros and extrema of functions without calculating derivatives , 1971 .

[67]  R. O'Neill Algorithm AS 47: Function Minimization Using a Simplex Procedure , 1971 .

[68]  Nikolaos V. Sahinidis,et al.  Derivative-free optimization: a review of algorithms and comparison of software implementations , 2013, J. Glob. Optim..

[69]  William L. Goffe,et al.  SIMANN: FORTRAN module to perform Global Optimization of Statistical Functions with Simulated Annealing , 1992 .

[70]  R B Sidje,et al.  Solving the chemical master equation by a fast adaptive finite state projection based on the stochastic simulation algorithm. , 2015, Mathematical biosciences.

[71]  M. Powell The NEWUOA software for unconstrained optimization without derivatives , 2006 .