Experimental design for system identification of Boolean Control Networks in biology

This study is primarily motivated by biological applications and focuses on the identification of Boolean networks from scarce and noisy data. We consider two Bayesian experimental design scenarios: selection of the observations under a budget, and input design. The goal is to maximize the mutual information between models and data, that is the ultimate statistical upper bound on the identifiability of a system from empirical data. First, we introduce a method to select which components of the state variable to measure under a budget constraint, and at which time points. Our greedy approach takes advantage of the submodularity of the mutual information, and hence requires only a polynomial number of evaluations of the objective to achieve near-optimal designs. Second, we consider the computationally harder task of designing sequences of input interventions, and propose a likelihood-free approximation method. Exact and approximate design solutions are verified with predictive models of genetic regulatory interaction networks in embryonic development.

[1]  Daizhan Cheng,et al.  Analysis and Control of Boolean Networks , 2011 .

[2]  Daizhan Cheng,et al.  A Linear Representation of Dynamics of Boolean Networks , 2010, IEEE Transactions on Automatic Control.

[3]  Ettore Fornasini,et al.  Observability and reconstructibility of Boolean control networks , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).

[4]  R. Baierlein Probability Theory: The Logic of Science , 2004 .

[5]  Ettore Fornasini,et al.  Finite-horizon optimal control of Boolean control networks , 2013, 52nd IEEE Conference on Decision and Control.

[6]  Andreas Krause,et al.  Near-optimal Observation Selection using Submodular Functions , 2007, AAAI.

[7]  Ettore Fornasini,et al.  Observability, Reconstructibility and State Observers of Boolean Control Networks , 2013, IEEE Transactions on Automatic Control.

[8]  Andreas Krause,et al.  Near-optimal Nonmyopic Value of Information in Graphical Models , 2005, UAI.

[9]  Joachim M. Buhmann,et al.  Optimized expected information gain for nonlinear dynamical systems , 2009, ICML '09.

[10]  Pierre Baldi,et al.  Of bits and wows: A Bayesian theory of surprise with applications to attention , 2010, Neural Networks.

[11]  A. Wagner,et al.  Automatic Generation of Predictive Dynamic Models Reveals Nuclear Phosphorylation as the Key Msn2 Control Mechanism , 2013, Science Signaling.

[12]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[13]  Satoru Miyano,et al.  Algorithms for Identifying Boolean Networks and Related Biological Networks Based on Matrix Multiplication and Fingerprint Function , 2000, J. Comput. Biol..

[14]  Jukka Corander,et al.  Approximate Bayesian Computation , 2013, PLoS Comput. Biol..

[15]  H. Othmer,et al.  The topology of the regulatory interactions predicts the expression pattern of the segment polarity genes in Drosophila melanogaster. , 2003, Journal of theoretical biology.

[16]  S. Kauffman Metabolic stability and epigenesis in randomly constructed genetic nets. , 1969, Journal of theoretical biology.

[17]  Stefan Bornholdt,et al.  Boolean network models of cellular regulation: prospects and limitations , 2008, Journal of The Royal Society Interface.

[18]  S. Bornholdt,et al.  Boolean Network Model Predicts Cell Cycle Sequence of Fission Yeast , 2007, PloS one.

[19]  S Fuhrman,et al.  Reveal, a general reverse engineering algorithm for inference of genetic network architectures. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[20]  M. L. Fisher,et al.  An analysis of approximations for maximizing submodular set functions—I , 1978, Math. Program..

[21]  John Lygeros,et al.  Near-optimal selection of parallel inputs in Bayesian experimental design for systems biology , 2014, 2014 European Control Conference (ECC).

[22]  Joachim M. Buhmann,et al.  Near-optimal experimental design for model selection in systems biology , 2013, Bioinform..

[23]  M. Margaliot Controllability and observability of Boolean control networks , 2012 .

[24]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[25]  Joachim M. Buhmann,et al.  Stable Bayesian Parameter Estimation for Biological Dynamical Systems , 2009, 2009 International Conference on Computational Science and Engineering.