Automatic validation of computational models using pseudo-3D spatio-temporal model checking

BackgroundComputational models play an increasingly important role in systems biology for generating predictions and in synthetic biology as executable prototypes/designs. For real life (clinical) applications there is a need to scale up and build more complex spatio-temporal multiscale models; these could enable investigating how changes at small scales reflect at large scales and viceversa. Results generated by computational models can be applied to real life applications only if the models have been validated first. Traditional in silico model checking techniques only capture how non-dimensional properties (e.g. concentrations) evolve over time and are suitable for small scale systems (e.g. metabolic pathways). The validation of larger scale systems (e.g. multicellular populations) additionally requires capturing how spatial patterns and their properties change over time, which are not considered by traditional non-spatial approaches.ResultsWe developed and implemented a methodology for the automatic validation of computational models with respect to both their spatial and temporal properties. Stochastic biological systems are represented by abstract models which assume a linear structure of time and a pseudo-3D representation of space (2D space plus a density measure). Time series data generated by such models is provided as input to parameterised image processing modules which automatically detect and analyse spatial patterns (e.g. cell) and clusters of such patterns (e.g. cellular population). For capturing how spatial and numeric properties change over time the Probabilistic Bounded Linear Spatial Temporal Logic is introduced. Given a collection of time series data and a formal spatio-temporal specification the model checker Mudi (http://mudi.modelchecking.org) determines probabilistically if the formal specification holds for the computational model or not. Mudi is an approximate probabilistic model checking platform which enables users to choose between frequentist and Bayesian, estimate and statistical hypothesis testing based validation approaches. We illustrate the expressivity and efficiency of our approach based on two biological case studies namely phase variation patterning in bacterial colony growth and the chemotactic aggregation of cells.ConclusionsThe formal methodology implemented in Mudi enables the validation of computational models against spatio-temporal logic properties and is a precursor to the development and validation of more complex multidimensional and multiscale models.

[1]  Calin Belta,et al.  A formal verification approach to the design of synthetic gene networks , 2011, IEEE Conference on Decision and Control and European Control Conference.

[2]  Håkan L. S. Younes,et al.  Verification and planning for stochastic processes with asynchronous events , 2004 .

[3]  Bernd Gärtner,et al.  Fast and Robust Smallest Enclosing Balls , 1999, ESA.

[4]  E. Allen Emerson,et al.  Temporal and Modal Logic , 1991, Handbook of Theoretical Computer Science, Volume B: Formal Models and Sematics.

[5]  Lubos Brim,et al.  Model Checking of Biological Systems , 2013, SFM.

[6]  Thomas Hérault,et al.  Approximate Probabilistic Model Checking , 2004, VMCAI.

[7]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[8]  P. Devreotes,et al.  Moving in the right direction: how eukaryotic cells migrate along chemical gradients. , 2011, Seminars in cell & developmental biology.

[9]  Tian Jin,et al.  Gradient sensing during chemotaxis. , 2013, Current opinion in cell biology.

[10]  Serge Haddad,et al.  Application and Theory of Petri Nets , 2012, Lecture Notes in Computer Science.

[11]  Edmund M. Clarke,et al.  Bayesian statistical model checking with application to Stateflow/Simulink verification , 2010, Formal Methods in System Design.

[12]  Lubos Brim,et al.  Parameter Scanning by Parallel Model Checking with Applications in Systems Biology , 2010, 2010 Ninth International Workshop on Parallel and Distributed Methods in Verification, and Second International Workshop on High Performance Computational Systems Biology.

[13]  François Fages,et al.  A general computational method for robustness analysis with applications to synthetic gene networks , 2009, Bioinform..

[14]  J. Van Leeuwen,et al.  Handbook of theoretical computer science - Part A: Algorithms and complexity; Part B: Formal models and semantics , 1990 .

[15]  Axel Legay,et al.  A Bayesian Approach to Model Checking Biological Systems , 2009, CMSB.

[16]  MuguerzaJavier,et al.  An extensive comparative study of cluster validity indices , 2013 .

[17]  Karen M Polizzi What is synthetic biology? , 2013, Methods in molecular biology.

[18]  Stephan Merz,et al.  Model Checking , 2000 .

[19]  Håkan L. S. Younes Probabilistic Verification for "Black-Box" Systems , 2005, CAV.

[20]  Savas Konur,et al.  A Survey on Temporal Logics , 2010, ArXiv.

[21]  Monika Heiner,et al.  Modelling and Analysis of Phase Variation in Bacterial Colony Growth , 2013, CMSB.

[22]  D. Endy Foundations for engineering biology , 2005, Nature.

[23]  François Fages,et al.  Continuous valuations of temporal logic specifications with applications to parameter optimization and robustness measures , 2011, Theor. Comput. Sci..

[24]  Mahesh Viswanathan,et al.  Statistical Model Checking of Black-Box Probabilistic Systems , 2004, CAV.

[25]  T. Ideker,et al.  A new approach to decoding life: systems biology. , 2001, Annual review of genomics and human genetics.

[26]  S. K. Jha,et al.  Quantifying Uncertainty in Epidemiological Models , 2012, 2012 ASE/IEEE International Conference on BioMedical Computing (BioMedCom).

[27]  Glazier,et al.  Simulation of biological cell sorting using a two-dimensional extended Potts model. , 1992, Physical review letters.

[28]  Walter de Back,et al.  Morpheus: a user-friendly modeling environment for multiscale and multicellular systems biology , 2014, Bioinform..

[29]  J. Stelling,et al.  Bridging the gaps in systems biology , 2014, Molecular Genetics and Genomics.

[30]  N. Saunders,et al.  Phase variation mediated niche adaptation during prolonged experimental murine infection with Helicobacter pylori. , 2005, Microbiology.

[31]  Christel Baier,et al.  Principles of model checking , 2008 .

[32]  Martin Schwarick,et al.  MARCIE - Model Checking and Reachability Analysis Done EffiCIEntly , 2011, 2011 Eighth International Conference on Quantitative Evaluation of SysTems.

[33]  B. Appelmelk,et al.  Phase variation in Helicobacter pylori lipopolysaccharide. , 1998, Infection and immunity.

[34]  Gary R. Bradski,et al.  Learning OpenCV - computer vision with the OpenCV library: software that sees , 2008 .

[35]  Olatz Arbelaitz,et al.  An extensive comparative study of cluster validity indices , 2013, Pattern Recognit..

[36]  Alok Aggarwal,et al.  An Optimal Algorithm for Finding Minimal Enclosing Triangles , 1986, J. Algorithms.

[37]  C. Steger On the Calculation of Moments of Polygons , 2007 .

[38]  D. Noble,et al.  Systems biology and the virtual physiological human , 2009, Molecular systems biology.

[39]  Zhen Zhang,et al.  Utilizing stochastic model checking to analyze genetic circuits , 2012, 2012 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB).

[40]  Bernd Finkbeiner,et al.  Checking Finite Traces using Alternating Automata , 2001, Electron. Notes Theor. Comput. Sci..

[41]  Jörg Desel,et al.  Application and Theory of Petri Nets and Concurrency , 2013, Lecture Notes in Computer Science.

[42]  Marta Z. Kwiatkowska,et al.  PRISM 4.0: Verification of Probabilistic Real-Time Systems , 2011, CAV.

[43]  Fred Kröger,et al.  Temporal Logic of Programs , 1987, EATCS Monographs on Theoretical Computer Science.

[44]  Gary R. Bradski,et al.  Learning OpenCV 3: Computer Vision in C++ with the OpenCV Library , 2016 .

[45]  Jan van Leeuwen,et al.  Handbook Of Theoretical Computer Science, Vol. A , 1990 .

[46]  François Fages,et al.  Symbolic Model Checking of Biochemical Networks , 2003, CMSB.

[47]  Christopher J. Langmead,et al.  Generalized Queries and Bayesian Statistical Model Checking in Dynamic Bayesian Networks: Application to Personalized Medicine , 2009 .

[48]  P. Mendes,et al.  Multi-scale modelling and simulation in systems biology. , 2011, Integrative biology : quantitative biosciences from nano to macro.

[49]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[50]  David Hsu,et al.  Statistical Model Checking Based Calibration and Analysis of Bio-pathway Models , 2013, CMSB.

[51]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..

[52]  M. Gravenor,et al.  Mutation rates: estimating phase variation rates when fitness differences are present and their impact on population structure. , 2003, Microbiology.

[53]  Marta Z. Kwiatkowska,et al.  Using probabilistic model checking in systems biology , 2008, PERV.

[54]  Martin Fussenegger,et al.  Emerging biomedical applications of synthetic biology , 2011, Nature Reviews Genetics.

[55]  Herbert Freeman,et al.  Determining the minimum-area encasing rectangle for an arbitrary closed curve , 1975, CACM.

[56]  Paolo Milazzo,et al.  Probabilistic model checking of biological systems with uncertain kinetic rates , 2012, Theor. Comput. Sci..

[57]  Limsoon Wong,et al.  Improved statistical model checking methods for pathway analysis , 2012, BMC Bioinformatics.

[58]  Sumit Kumar Jha,et al.  EpiSpec: A formal specification language for parameterized agent-based models against epidemiological ground truth , 2014, 2014 IEEE 4th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS).

[59]  G. Toussaint Solving geometric problems with the rotating calipers , 1983 .

[60]  Lori A. S. Snyder,et al.  Adaptation by phase variation in pathogenic bacteria. , 2003, Advances in applied microbiology.

[61]  Edda Klipp,et al.  STSE: Spatio-Temporal Simulation Environment Dedicated to Biology , 2011, BMC Bioinformatics.

[62]  T. Lu,et al.  Synthetic biology: an emerging engineering discipline. , 2012, Annual review of biomedical engineering.

[63]  Michal Daszykowski,et al.  Revised DBSCAN algorithm to cluster data with dense adjacent clusters , 2013 .

[64]  Paolo Zuliani,et al.  Statistical model checking for biological applications , 2014, International Journal on Software Tools for Technology Transfer.

[65]  Richard Szeliski,et al.  Computer Vision - Algorithms and Applications , 2011, Texts in Computer Science.

[66]  Martin Schwarick,et al.  Snoopy - A Unifying Petri Net Tool , 2012, Petri Nets.

[67]  Jan Maluszy¿ski Verification, Model Checking, and Abstract Interpretation , 2009, Lecture Notes in Computer Science.

[68]  Sumit Kumar Jha,et al.  Synthesis and infeasibility analysis for stochastic models of biochemical systems using statistical model checking and abstraction refinement , 2011, Theor. Comput. Sci..

[69]  Håkan L. S. Younes,et al.  Statistical probabilistic model checking with a focus on time-bounded properties , 2006, Inf. Comput..

[70]  Axel Legay,et al.  Statistical Model Checking for Complex Stochastic Models in Systems Biology , 2009 .

[71]  Alexandra Jilkine,et al.  A Comparison of Mathematical Models for Polarization of Single Eukaryotic Cells in Response to Guided Cues , 2011, PLoS Comput. Biol..

[72]  H. Kitano Systems Biology: A Brief Overview , 2002, Science.