Over the last decade, executable models of biological behaviors have repeatedly provided new scientific discoveries, uncovered novel insights, and directed new experimental avenues. These models are computer programs whose execution mechanistically simulates aspects of the cell’s behaviors. If the observed behavior of the program agrees with the observed biological behavior, then the program explains the phenomena. This approach has proven beneficial for gaining new biological insights and directing new experimental avenues. One advantage of this approach is that techniques for analysis of computer programs can be applied to the analysis of executable models. For example, one can confirm that a model agrees with experiments for all possible executions of the model (corresponding to all environmental conditions), even if there are a huge number of executions. Various formal methods have been adapted for this context, for example, model checking or symbolic analysis of state spaces. To avoid manual construction of executable models, one can apply synthesis, a method to produce programs automatically from high-level specifications. In the context of biological modeling, synthesis would correspond to extracting executable models from experimental data. We survey recent results about the usage of the techniques underlying synthesis of computer programs for the inference of biological models from experimental data. We describe synthesis of biological models from curated mutation experiment data, inferring network connectivity models from phosphoproteomic data, and synthesis of Boolean networks from gene expression data. While much work has been done on automated analysis of similar datasets using machine learning and artificial intelligence, using synthesis techniques provides new opportunities such as efficient computation of disambiguating experiments, as well as the ability to produce different kinds of models automatically from biological data.
[1]
Thomas A. Henzinger,et al.
Predictive Modeling of Signaling Crosstalk during C. elegans Vulval Development
,
2007,
PLoS Comput. Biol..
[2]
Emina Torlak,et al.
A lightweight symbolic virtual machine for solver-aided host languages
,
2014,
PLDI.
[3]
Julio Saez-Rodriguez,et al.
Exhaustively characterizing feasible logic models of a signaling network using Answer Set Programming
,
2013,
Bioinform..
[4]
Martin Fränzle,et al.
Stochastic Satisfiability Modulo Theory: A Novel Technique for the Analysis of Probabilistic Hybrid Systems
,
2008,
HSCC.
[5]
T. Henzinger,et al.
Executable cell biology
,
2007,
Nature Biotechnology.
[6]
Moshe Y. Vardi.
From Verification to Synthesis
,
2008,
VSTTE.
[7]
S. Kauffman.
Metabolic stability and epigenesis in randomly constructed genetic nets.
,
1969,
Journal of theoretical biology.
[8]
Jonathan M. Garibaldi,et al.
Parameter Estimation Using Metaheuristics in Systems Biology: A Comprehensive Review
,
2012,
IEEE/ACM Transactions on Computational Biology and Bioinformatics.
[9]
S. Emmott,et al.
Defining an essential transcription factor program for naïve pluripotency
,
2014,
Science.
[10]
Rastislav Bodík,et al.
Synthesis of biological models from mutation experiments
,
2013,
POPL.
[11]
Roded Sharan,et al.
Reconstructing Boolean Models of Signaling
,
2012,
RECOMB.
[12]
Ioannis Xenarios,et al.
Hard-wired heterogeneity in blood stem cells revealed using a dynamic regulatory network model
,
2013,
Bioinform..
[13]
David Harel,et al.
Emergent Dynamics of Thymocyte Development and Lineage Determination
,
2006,
PLoS Comput. Biol..
[14]
Orna Kupferman,et al.
Recent Challenges and Ideas in Temporal Synthesis
,
2012,
SOFSEM.
[15]
Stephan Merz,et al.
Model Checking
,
2000
.