Workflow for generating competing hypothesis from models with parameter uncertainty

Mathematical models are increasingly used in life sciences. However, contrary to other disciplines, biological models are typically over-parametrized and loosely constrained by scarce experimental data and prior knowledge. Recent efforts on analysis of complex models have focused on isolated aspects without considering an integrated approach—ranging from model building to derivation of predictive experiments and refutation or validation of robust model behaviours. Here, we develop such an integrative workflow, a sequence of actions expanding upon current efforts with the purpose of setting the stage for a methodology facilitating an extraction of core behaviours and competing mechanistic hypothesis residing within underdetermined models. To this end, we make use of optimization search algorithms, statistical (machine-learning) classification techniques and cluster-based analysis of the state variables' dynamics and their corresponding parameter sets. We apply the workflow to a mathematical model of fat accumulation in the arterial wall (atherogenesis), a complex phenomena with limited quantitative understanding, thus leading to a model plagued with inherent uncertainty. We find that the mathematical atherogenesis model can still be understood in terms of a few key behaviours despite the large number of parameters. This result enabled us to derive distinct mechanistic predictions from the model despite the lack of confidence in the model parameters. We conclude that building integrative workflows enable investigators to embrace modelling of complex biological processes despite uncertainty in parameters.

[1]  Dustin Boswell,et al.  Introduction to Support Vector Machines , 2002 .

[2]  Richard Peto,et al.  General Discussion: II , 1993 .

[3]  Carmen G. Moles,et al.  Parameter estimation in biochemical pathways: a comparison of global optimization methods. , 2003, Genome research.

[4]  Denis Noble,et al.  Differential and integral views of genetics in computational systems biology , 2011, Interface Focus.

[5]  Gunnar Cedersund,et al.  Mass and Information Feedbacks through Receptor Endocytosis Govern Insulin Signaling as Revealed Using a Parameter-free Modeling Framework* , 2010, The Journal of Biological Chemistry.

[6]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[7]  K. S. Brown,et al.  Statistical mechanical approaches to models with many poorly known parameters. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  E. Marder,et al.  Similar network activity from disparate circuit parameters , 2004, Nature Neuroscience.

[9]  David R. Gilbert,et al.  A Model Checking Approach to the Parameter Estimation of Biochemical Pathways , 2008, CMSB.

[10]  Mark K Transtrum,et al.  Why are nonlinear fits to data so challenging? , 2009, Physical review letters.

[11]  Michael Hucka,et al.  The Systems Biology Markup Language (SBML) Level 2 Version 2 , 2007 .

[12]  Paul Geladi,et al.  Principal Component Analysis (PCA): Concept, geometrical intrepretation, mathematical background, algorithms, history , 2009 .

[13]  Xiao-Jing Wang,et al.  Reconciling Coherent Oscillation with Modulationof Irregular Spiking Activity in Selective Attention:Gamma-Range Synchronization between Sensoryand Executive Cortical Areas , 2010, The Journal of Neuroscience.

[14]  J. Arnold,et al.  An ensemble method for identifying regulatory circuits with special reference to the qa gene cluster of Neurospora crassa , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Axel Kowald,et al.  Systems Biology in Practice: Concepts, Implementation and Application , 2005 .

[16]  J. Stelling,et al.  Ensemble modeling for analysis of cell signaling dynamics , 2007, Nature Biotechnology.

[17]  El-Ghazali Talbi,et al.  Metaheuristics - From Design to Implementation , 2009 .

[18]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[19]  Jacob Roll,et al.  Systems biology: model based evaluation and comparison of potential explanations for given biological data , 2009, The FEBS journal.

[20]  Jörg Stelling,et al.  Systems analysis of cellular networks under uncertainty , 2009, FEBS letters.

[21]  Pablo A. Parrilo,et al.  Efficient classification of complete parameter regions based on semidefinite programming , 2007, BMC Bioinformatics.

[22]  Herschel Rabitz,et al.  A general analysis of exact lumping in chemical kinetics , 1989 .

[23]  Christopher R. Myers,et al.  Universally Sloppy Parameter Sensitivities in Systems Biology Models , 2007, PLoS Comput. Biol..

[24]  S. Moore,et al.  Pathogenesis of atherosclerosis. , 1985, Metabolism: clinical and experimental.

[25]  Mauro Birattari,et al.  Tuning Metaheuristics - A Machine Learning Perspective , 2009, Studies in Computational Intelligence.

[26]  A. Tedgui,et al.  Regulatory T‐cell immunity and its relevance to atherosclerosis , 2008, Journal of internal medicine.

[27]  Y. Ho,et al.  Simple Explanation of the No-Free-Lunch Theorem and Its Implications , 2002 .

[28]  Hiroaki Kitano,et al.  The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models , 2003, Bioinform..

[29]  J. Tegnér,et al.  Transcriptional Profiling Uncovers a Network of Cholesterol-Responsive Atherosclerosis Target Genes , 2008, PLoS genetics.

[30]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[31]  Axel Legay,et al.  Statistical Model Checking in BioLab: Applications to the Automated Analysis of T-Cell Receptor Signaling Pathway , 2008, CMSB.

[32]  J. Tegnér,et al.  Deconstructing the core dynamics from a complex time-lagged regulatory biological circuit. , 2009, IET systems biology.

[33]  D A Rand,et al.  Mapping global sensitivity of cellular network dynamics: sensitivity heat maps and a global summation law , 2008, Journal of The Royal Society Interface.

[34]  Armin Biere,et al.  Bounded model checking , 2003, Adv. Comput..

[35]  P. Goldman-Rakic,et al.  Synaptic mechanisms and network dynamics underlying spatial working memory in a cortical network model. , 2000, Cerebral cortex.

[36]  Nava Rubin,et al.  Dynamical characteristics common to neuronal competition models. , 2007, Journal of neurophysiology.

[37]  Jens Timmer,et al.  Data-based identifiability analysis of non-linear dynamical models , 2007, Bioinform..

[38]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  S Waldherr,et al.  Parameter identification, experimental design and model falsification for biological network models using semidefinite programming. , 2010, IET systems biology.

[40]  M. Crowther,et al.  Pathogenesis of atherosclerosis. , 2005, Hematology. American Society of Hematology. Education Program.

[41]  Guillermo A. Cecchi,et al.  When the Optimal Is Not the Best: Parameter Estimation in Complex Biological Models , 2010, PloS one.

[42]  Aldons J. Lusis,et al.  Atherosclerosis : Vascular biology , 2000 .

[43]  Hanna M. Härdin,et al.  Simplified yet highly accurate enzyme kinetics for cases of low substrate concentrations , 2009, The FEBS journal.

[44]  Gilles Clermont,et al.  Computational disease modeling – fact or fiction? , 2009, BMC Systems Biology.

[45]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[46]  Erik De Schutter,et al.  Complex Parameter Landscape for a Complex Neuron Model , 2006, PLoS Comput. Biol..

[47]  Leon Aarons,et al.  A method for robust model order reduction in pharmacokinetics , 2009, Journal of Pharmacokinetics and Pharmacodynamics.

[48]  Jacob Roll,et al.  Model-Based Hypothesis Testing of Key Mechanisms in Initial Phase of Insulin Signaling , 2008, PLoS Comput. Biol..

[49]  Z. Kutalik,et al.  Estimating parameters for generalized mass action models using constraint propagation. , 2007, Mathematical biosciences.

[50]  M. Mavrovouniotis,et al.  Simplification of Mathematical Models of Chemical Reaction Systems. , 1998, Chemical reviews.