Detecting differential growth of microbial populations with Gaussian process regression

Microbial growth curves are used to study differential effects of media, genetics, and stress on microbial population growth. Consequently, many modeling frameworks exist to capture microbial population growth measurements. However, current models are designed to quantify growth under conditions that produce a specific functional form. Extensions to these models are required to quantify the effects of perturbations, which often exhibit non-standard growth curves. Rather than fix expected functional forms of different experimental perturbations, we developed a general and robust model of microbial population growth curves using Gaussian process (GP) regression. GP regression modeling of high resolution time-series growth data enables accurate quantification of population growth, and can be extended to identify differential growth phenotypes due to genetic background or stress. Additionally, confounding effects due to experimental variation can be controlled explicitly. Our framework substantially outperforms commonly used microbial population growth models, particularly when modeling growth data from environmentally stressed populations. We apply the GP growth model to a collection of growth measurements for seven transcription factor knockout strains of a model archaeal organism, Halobacterium salinarum. Using these models fitted to growth data, two statistical tests were developed to quantify the differential effects of genetic and environmental perturbations on microbial growth. These statistical tests accurately identify known regulators and implicate novel regulators of growth under standard and stress conditions. Furthermore, the fitted GP regression models are interpretable, recapitulating biological knowledge of growth response while providing new insights into the relevant parameters affecting microbial population growth.

[1]  P. Zuber Management of oxidative stress in Bacillus. , 2009, Annual review of microbiology.

[2]  T. Egli Growth Kinetics, Bacterial , 2009 .

[3]  Jennifer Abrams,et al.  Biology of the Heat Shock Response and Protein Chaperones: Budding Yeast (Saccharomyces cerevisiae) as a Model System , 2012, Microbiology and Molecular Reviews.

[4]  M. Facciotti,et al.  Regulatory Multidimensionality of Gas Vesicle Biogenesis in Halobacterium salinarum NRC-1 , 2011, Archaea.

[5]  Min Pan,et al.  Evolution of context dependent regulation by expansion of feast/famine regulatory proteins , 2014, BMC Systems Biology.

[6]  Amy K. Schmid,et al.  A single transcription factor regulates evolutionarily diverse but functionally linked metabolic pathways in response to nutrient availability , 2009, Molecular systems biology.

[7]  Fernando di Sciascio,et al.  Biomass estimation in batch biotechnological processes by Bayesian Gaussian process regression , 2008, Comput. Chem. Eng..

[8]  Stephen C. Ekker,et al.  Mojo Hand, a TALEN design tool for genome editing applications , 2013, BMC Bioinformatics.

[9]  D. Botstein,et al.  Genomic expression programs in the response of yeast cells to environmental changes. , 2000, Molecular biology of the cell.

[10]  Sunil Kumar,et al.  Regulation of Iron Metabolism by Pyrococcus furiosus , 2013, Journal of bacteriology.

[11]  A. Oren Microbial life at high salt concentrations: phylogenetic and metabolic diversity , 2008, Saline systems.

[12]  Amy K. Schmid,et al.  A Regulatory Hierarchy Controls the Dynamic Transcriptional Response to Extreme Oxidative Stress in Archaea , 2015, PLoS genetics.

[13]  D W Stephen,et al.  The role of the YAP1 and YAP2 genes in the regulation of the adaptive oxidative stress responses of Saccharomyces cerevisiae , 1995, Molecular microbiology.

[14]  Min Pan,et al.  A systems level predictive model for global gene regulation of methanogenesis in a hydrogenotrophic methanogen , 2013, Genome research.

[15]  Jon T. Schnute,et al.  A Versatile Growth Model with Statistically Stable Parameters , 1981 .

[16]  J. Herrou,et al.  General Stress Signaling in the Alphaproteobacteria. , 2015, Annual review of genetics.

[17]  Edda Klipp,et al.  Zooming in on yeast osmoadaptation. , 2012, Advances in experimental medicine and biology.

[18]  Jennifer Listgarten,et al.  Flexible Modelling of Genetic Effects on Function-Valued Traits , 2016, RECOMB.

[19]  A. Matin,et al.  Starvation-induced cross protection against heat or H2O2 challenge in Escherichia coli , 1988, Journal of bacteriology.

[20]  Matthew J. Brauer,et al.  Slow Growth Induces Heat-shock Resistance in Normal and Respiratory-deficient Yeast , 2022 .

[21]  J. Imlay,et al.  Pathways of oxidative damage. , 2003, Annual review of microbiology.

[22]  D. McCorquodale,et al.  Patterns of protein synthesis in Escherichia coli infected by amber mutants in the first-step-transfer DNA of T5. , 1970, Journal of molecular biology.

[23]  S. DasSarma,et al.  Homologous gene knockout in the archaeon Halobacterium salinarum with ura3 as a counterselectable marker , 2000, Molecular microbiology.

[24]  Min Pan,et al.  A systems view of haloarchaeal strategies to withstand stress from transition metals. , 2006, Genome research.

[25]  Xuewen Lu,et al.  Modeling Microbial Responses in Food , 2003 .

[26]  F. J. Richards A Flexible Growth Function for Empirical Use , 1959 .

[27]  B. Palsson,et al.  Constraining the metabolic genotype–phenotype relationship using a phylogeny of in silico methods , 2012, Nature Reviews Microbiology.

[28]  Neil D. Lawrence,et al.  Hierarchical Bayesian modelling of gene expression time series across irregularly sampled replicates and clusters , 2013, BMC Bioinformatics.

[29]  Ricardo Cao,et al.  A random effect multiplicative heteroscedastic model for bacterial growth , 2010, BMC Bioinformatics.

[30]  F. Rombouts,et al.  Modeling of the Bacterial Growth Curve , 1990, Applied and environmental microbiology.

[31]  David J. C. MacKay,et al.  Bayesian Interpolation , 1992, Neural Computation.

[32]  J Baranyi,et al.  Mathematics of predictive food microbiology. , 1995, International journal of food microbiology.

[33]  M. Peleg,et al.  On Models of the Temperature Effect on the Rate of Chemical Reactions and Biological Processes in Foods , 2012, Food Engineering Reviews.

[34]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[35]  T. Hwa,et al.  Coordination of bacterial proteome with metabolism by cyclic AMP signalling , 2013, Nature.

[36]  Francesca Mangili,et al.  Gaussian Processes for Bayesian hypothesis tests on regression functions , 2015, AISTATS.

[37]  Carl E. Rasmussen,et al.  Derivative Observations in Gaussian Process Models of Dynamic Systems , 2002, NIPS.

[38]  J. G. Kuenen,et al.  Biochemical limits to microbial growth yields: An analysis of mixed substrate utilization , 1988, Biotechnology and bioengineering.

[39]  Amy K. Schmid,et al.  The RosR transcription factor is required for gene expression dynamics in response to extreme oxidative stress in a hypersaline-adapted archaeon , 2012, BMC Genomics.

[40]  Amy K. Schmid,et al.  Two transcription factors are necessary for iron homeostasis in a salt-dwelling archaeon , 2010, Nucleic acids research.

[41]  M. Wiper,et al.  Bayesian modeling of bacterial growth for multiple populations , 2014, 1411.5780.

[42]  Amy K. Schmid,et al.  Dynamic Metabolite Profiling in an Archaeon Connects Transcriptional Regulation to Metabolic Consequences , 2015, PloS one.

[43]  V. Thorsson,et al.  Genome sequence of Halobacterium species NRC-1. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Min Pan,et al.  Parallel evolution of transcriptome architecture during genome reorganization. , 2011, Genome research.

[45]  Paul Shannon,et al.  Comprehensive de novo structure prediction in a systems-biology context for the archaea Halobacterium sp. NRC-1 , 2004, Genome Biology.

[46]  Amy K. Schmid,et al.  A transcription factor links growth rate and metabolism in the hypersaline adapted archaeon Halobacterium salinarum , 2014, Molecular microbiology.

[47]  K. Jarrod Millman,et al.  Python for Scientists and Engineers , 2011, Comput. Sci. Eng..

[48]  Antoine Allard,et al.  A system-level model for the microbial regulatory genome , 2014, Molecular systems biology.

[49]  Moselio Schaechter,et al.  Encyclopedia of microbiology , 2009 .

[50]  Geoffrey E. Hinton,et al.  Bayesian Learning for Neural Networks , 1995 .

[51]  George Eastman House,et al.  Sparse Bayesian Learning and the Relevan e Ve tor Ma hine , 2001 .

[52]  Amy K. Schmid,et al.  Protein–DNA binding dynamics predict transcriptional response to nutrients in archaea , 2013, Nucleic acids research.

[53]  Christopher D. Brown,et al.  A statin-dependent QTL for GATM expression is associated with statin-induced myopathy , 2013, Nature.

[54]  N. Krogan,et al.  Phenotypic Landscape of a Bacterial Cell , 2011, Cell.

[55]  J. Monod The Growth of Bacterial Cultures , 1949 .

[56]  Amy K. Schmid,et al.  A Predictive Model for Transcriptional Control of Physiology in a Free Living Cell , 2007, Cell.

[57]  F. Neidhardt,et al.  Patterns of protein synthesis in E. coli: a catalog of the amount of 140 individual proteins at different growth rates , 1978, Cell.

[58]  E. Skjerve,et al.  Growth comparison of several Escherichia coli strains exposed to various concentrations of lactoferrin using linear spline regression , 2012, Microbial Informatics and Experimentation.

[59]  M. Peleg,et al.  Microbial Growth Curves: What the Models Tell Us and What They Cannot , 2011, Critical reviews in food science and nutrition.