A Monte Carlo method to estimate cell population heterogeneity from cell snapshot data.

Variation is characteristic of all living systems. Laboratory techniques such as flow cytometry can probe individual cells, and, after decades of experimentation, it is clear that even members of genetically identical cell populations can exhibit differences. To understand whether variation is biologically meaningful, it is essential to discern its source. Mathematical models of biological systems are tools that can be used to investigate causes of cell-to-cell variation. From mathematical analysis and simulation of these models, biological hypotheses can be posed and investigated, then parameter inference can determine which of these is compatible with experimental data. Data from laboratory experiments often consist of "snapshots" representing distributions of cellular properties at different points in time, rather than individual cell trajectories. These data are not straightforward to fit using hierarchical Bayesian methods, which require the number of cell population clusters to be chosen a priori. Nor are they amenable to standard nonlinear mixed effect methods, since a single observation per cell is typically too few to estimate parameter variability. Here, we introduce a computational sampling method named "Contour Monte Carlo" (CMC) for estimating mathematical model parameters from snapshot distributions, which is straightforward to implement and does not require cells be assigned to predefined categories. The CMC algorithm fits to snapshot probability distributions rather than raw data, which means its computational burden does not, like existing approaches, increase with the number of cells observed. Our method is appropriate for underdetermined systems, where there are fewer distinct types of observations than parameters to be determined, and where observed variation is mostly due to variability in cellular processes rather than experimental measurement error. This may be the case for many systems due to continued improvements in resolution of laboratory techniques. In this paper, we apply our method to quantify cellular variation for three biological systems of interest and provide Julia code enabling others to use this method.

[1]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[2]  P. Swain,et al.  Stochastic Gene Expression in a Single Cell , 2002, Science.

[3]  Philipp M. Altrock,et al.  The mathematics of cancer: integrating quantitative models , 2015, Nature Reviews Cancer.

[4]  A. Schmid,et al.  Single-cell analysis in biotechnology, systems biology, and biocatalysis. , 2012, Annual review of chemical and biomolecular engineering.

[5]  Carsten Marr,et al.  Software tools for single-cell tracking and quantification of cellular and molecular properties , 2016, Nature Biotechnology.

[6]  Franz Kappel,et al.  Comparison of optimal design methods in inverse problems , 2011, Inverse problems.

[7]  Albert Tarantola,et al.  Monte Carlo sampling of solutions to inverse problems , 1995 .

[8]  Tim Wildey,et al.  Combining Push-Forward Measures and Bayes' Rule to Construct Consistent Solutions to Stochastic Inverse Problems , 2018, SIAM J. Sci. Comput..

[9]  Mads Kærn,et al.  A chance at survival: gene expression noise and phenotypic diversification strategies , 2009, Molecular microbiology.

[10]  Jukka Intosalmi,et al.  A subpopulation model to analyze heterogeneous cell differentiation dynamics , 2016, Bioinform..

[11]  Amy E. Herr,et al.  Single-cell western blotting , 2014, Nature Methods.

[12]  Jan Hasenauer,et al.  A Hierarchical, Data-Driven Approach to Modeling Single-Cell Populations Predicts Latent Causes of Cell-To-Cell Variability. , 2018, Cell systems.

[13]  D. Rubin,et al.  Inference from Iterative Simulation Using Multiple Sequences , 1992 .

[14]  Frank Delvigne,et al.  Metabolic variability in bioprocessing: implications of microbial phenotypic heterogeneity. , 2014, Trends in biotechnology.

[15]  Hannah H. Chang,et al.  Transcriptome-wide noise controls lineage choice in mammalian progenitor cells , 2008, Nature.

[16]  T. Vukicevic,et al.  Analysis of the Impact of Model Nonlinearities in Inverse Problem Solving , 2008 .

[17]  Lani F. Wu,et al.  Cellular Heterogeneity: Do Differences Make a Difference? , 2010, Cell.

[18]  Claudia Czado,et al.  Evading the curse of dimensionality in nonparametric density estimation with simplified vine copulas , 2015, J. Multivar. Anal..

[19]  Frank Allgöwer,et al.  Identification of models of heterogeneous cell populations from population snapshot data , 2011, BMC Bioinformatics.

[20]  Xiufen Zou,et al.  Single-cell data-driven mathematical model reveals possible molecular mechanisms of embryonic stem-cell differentiation. , 2019, Mathematical biosciences and engineering : MBE.

[21]  William G Telford,et al.  Flow cytometry of fluorescent proteins. , 2012, Methods.

[22]  Alan Edelman,et al.  Julia: A Fresh Approach to Numerical Computing , 2014, SIAM Rev..

[23]  Ajay Jasra,et al.  On population-based simulation for static inference , 2007, Stat. Comput..

[24]  M. Peter,et al.  Scalable inference of heterogeneous reaction kinetics from pooled single-cell recordings , 2013, Nature Methods.

[25]  Gunnar Cedersund,et al.  Nonlinear mixed-effects modelling for single cell estimation: when, why, and how to use it , 2015, BMC Systems Biology.

[26]  Steffen Waldherr,et al.  Estimation methods for heterogeneous cell population models in systems biology , 2018, Journal of The Royal Society Interface.

[27]  Fabian J. Theis,et al.  ODE Constrained Mixture Modelling: A Method for Unraveling Subpopulation Structures and Dynamics , 2014, PLoS Comput. Biol..

[28]  D. Ramkrishna,et al.  Population balance modeling: current status and future prospects. , 2014, Annual review of chemical and biomolecular engineering.

[29]  Joerg Stelling,et al.  A Simple and Flexible Computational Framework for Inferring Sources of Heterogeneity from Single-Cell Dynamics. , 2019, Cell systems.

[30]  M. Ridley The Red Queen: Sex and the Evolution of Human Nature , 1993 .

[31]  Frank Allgöwer,et al.  Bistable Biological Systems: A Characterization Through Local Compact Input-to-State Stability , 2008, IEEE Transactions on Automatic Control.

[32]  P. Maini,et al.  Cellular adaptations to hypoxia and acidosis during somatic evolution of breast cancer , 2007, British Journal of Cancer.

[33]  Pras Pathmanathan,et al.  Uncertainty and variability in models of the cardiac action potential: Can we build trustworthy models? , 2016, Journal of molecular and cellular cardiology.