Inference of complex biological networks: distinguishability issues and optimization-based solutions

BackgroundThe inference of biological networks from high-throughput data has received huge attention during the last decade and can be considered an important problem class in systems biology. However, it has been recognized that reliable network inference remains an unsolved problem. Most authors have identified lack of data and deficiencies in the inference algorithms as the main reasons for this situation.ResultsWe claim that another major difficulty for solving these inference problems is the frequent lack of uniqueness of many of these networks, especially when prior assumptions have not been taken properly into account. Our contributions aid the distinguishability analysis of chemical reaction network (CRN) models with mass action dynamics. The novel methods are based on linear programming (LP), therefore they allow the efficient analysis of CRNs containing several hundred complexes and reactions. Using these new tools and also previously published ones to obtain the network structure of biological systems from the literature, we find that, often, a unique topology cannot be determined, even if the structure of the corresponding mathematical model is assumed to be known and all dynamical variables are measurable. In other words, certain mechanisms may remain undetected (or they are falsely detected) while the inferred model is fully consistent with the measured data. It is also shown that sparsity enforcing approaches for determining 'true' reaction structures are generally not enough without additional prior information.ConclusionsThe inference of biological networks can be an extremely challenging problem even in the utopian case of perfect experimental information. Unfortunately, the practical situation is often more complex than that, since the measurements are typically incomplete, noisy and sometimes dynamically not rich enough, introducing further obstacles to the structure/parameter estimation process. In this paper, we show how the structural uniqueness and identifiability of the models can be guaranteed by carefully adding extra constraints, and that these important properties can be checked through appropriate computation methods.

[1]  G. Szederkényi Comment on “identifiability of chemical reaction networks” by G. Craciun and C. Pantea , 2009 .

[2]  U. Sauer,et al.  Article number: 62 REVIEW Metabolic networks in motion: 13 C-based flux analysis , 2022 .

[3]  R. Bellman,et al.  On structural identifiability , 1970 .

[4]  J. Stelling Mathematical models in microbial systems biology. , 2004, Current opinion in microbiology.

[5]  Hahn Kim,et al.  Multicore software technologies , 2009, IEEE Signal Processing Magazine.

[6]  Yves Lecourtier,et al.  Identifiability and distinguishability testing via computer algebra , 1985 .

[7]  Keith R. Godfrey,et al.  Structural indistinguishability between uncontrolled (autonomous) nonlinear analytic systems , 2004, Autom..

[8]  Viktor Vladimirovich Nemytskii Qualitative theory of differential equations , 1960 .

[9]  J. Yates,et al.  Structural identifiability and indistinguishability of compartmental models. , 2009, Expert opinion on drug metabolism & toxicology.

[10]  Sach Mukherjee,et al.  Network inference using informative priors , 2008, Proceedings of the National Academy of Sciences.

[11]  Lennart Ljung,et al.  System identification (2nd ed.): theory for the user , 1999 .

[12]  Eva Riccomagno,et al.  Structural identifiability analysis of some highly structured families of statespace models using differential algebra , 2004, Journal of mathematical biology.

[13]  Maksat Ashyraliyev,et al.  Systems biology: parameter estimation for biochemical models , 2009, The FEBS journal.

[14]  Eric Walter,et al.  Identifiability of parametric models , 1987 .

[15]  H. Rabitz,et al.  Similarity transformation approach to identifiability analysis of nonlinear compartmental models. , 1989, Mathematical biosciences.

[16]  G. Szederkényi,et al.  Finding complex balanced and detailed balanced realizations of chemical reaction networks , 2010, 1010.4477.

[17]  Péter Érdi,et al.  Mathematical Models of Chemical Reactions: Theory and Applications of Deterministic and Stochastic Models , 1989 .

[18]  R. Jackson,et al.  General mass action kinetics , 1972 .

[19]  D. Donoho For most large underdetermined systems of linear equations the minimal 𝓁1‐norm solution is also the sparsest solution , 2006 .

[20]  Alex Simpkins,et al.  System Identification: Theory for the User, 2nd Edition (Ljung, L.; 1999) [On the Shelf] , 2012, IEEE Robotics & Automation Magazine.

[21]  Marc E. Pfetsch,et al.  Exact and Approximate Sparse Solutions of Underdetermined Linear Equations , 2008, SIAM J. Sci. Comput..

[22]  久野 誉人,et al.  George B. Dantzig and Mukund N. Thapa 著, Linear Programming 1 : Introduction, (Springer Series in Operations Research), Springer-Verlag, 435頁, 1997年, 定価9,340円 , 1999 .

[23]  T. Rothenberg Identification in Parametric Models , 1971 .

[24]  Joshua S Weitz,et al.  Small-scale copy number variation and large-scale changes in gene expression , 2008, Proceedings of the National Academy of Sciences.

[25]  Sten Bay Jørgensen,et al.  Structural parameter identifiability analysis for dynamic reaction networks , 2008 .

[26]  U. Alon,et al.  Assigning numbers to the arrows: Parameterizing a gene regulation network by using accurate expression kinetics , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[27]  George B. Dantzig,et al.  Linear Programming 1: Introduction , 1997 .

[28]  Benjamin Weitz,et al.  An inverse problem in reaction kinetics , 2011 .

[29]  D. Floreano,et al.  Revealing strengths and weaknesses of methods for gene network inference , 2010, Proceedings of the National Academy of Sciences.

[30]  R. Raman,et al.  Integration of logic and heuristic knowledge in MINLP optimization for process synthesis , 1992 .

[31]  William W. Chen,et al.  Classic and contemporary approaches to modeling biochemical reactions. , 2010, Genes & development.

[32]  P. McSharry,et al.  Mathematical and computational techniques to deduce complex biochemical reaction mechanisms. , 2004, Progress in biophysics and molecular biology.

[33]  Katalin M. Hangos,et al.  Maximal and minimal realizations of reaction kinetic systems: computation and properties , 2010, 1005.2913.

[34]  R. Albert,et al.  Predicting Essential Components of Signal Transduction Networks: A Dynamic Model of Guard Cell Abscisic Acid Signaling , 2006, PLoS biology.

[35]  Eric Walter,et al.  On the structural output distinguishability of parametric models, and its relations with structural identifiability , 1984 .

[36]  Harry L. Trentelman,et al.  Essays on control : perspectives in the theory and its applications , 1993 .

[37]  H. Pohjanpalo,et al.  On the uniqueness of linear compartmental systems , 1977 .

[38]  C. Daub,et al.  BMC Systems Biology , 2007 .

[39]  N. D. Clarke,et al.  Towards a Rigorous Assessment of Systems Biology Models: The DREAM3 Challenges , 2010, PloS one.

[40]  Eva Balsa-Canto,et al.  Parameter estimation and optimal experimental design. , 2008, Essays in biochemistry.

[41]  Xingming Zhao,et al.  Computational Systems Biology , 2013, TheScientificWorldJournal.

[42]  Sanjay Mehrotra,et al.  A model-based optimization framework for the inference of regulatory interactions using time-course DNA microarray expression data , 2007, BMC Bioinformatics.

[43]  Eric Walter,et al.  Identifiability of State Space Models , 1982 .

[44]  Gregory Gutin,et al.  Digraphs - theory, algorithms and applications , 2002 .

[45]  Hidde de Jong,et al.  Modeling and Simulation of Genetic Regulatory Systems: A Literature Review , 2002, J. Comput. Biol..

[46]  Maria Pia Saccomani,et al.  Examples of testing global identifiability of biological and biomedical models with the DAISY software , 2010, Comput. Biol. Medicine.

[47]  Eduardo F. Camacho,et al.  Model predictive control techniques for hybrid systems , 2010, Annu. Rev. Control..

[48]  P. Brazhnik,et al.  Linking the genes: inferring quantitative gene networks from microarray data. , 2002, Trends in genetics : TIG.

[49]  Matthew D. Johnston,et al.  Linear conjugacy of chemical reaction networks , 2011, 1101.1663.

[50]  B. Palsson,et al.  Genome-scale reconstruction of the Saccharomyces cerevisiae metabolic network. , 2003, Genome research.

[51]  J. Schaber,et al.  Model-based inference of biochemical parameters and dynamic properties of microbial signal transduction networks. , 2011, Current opinion in biotechnology.

[52]  Herschel Rabitz,et al.  Identifiability and distinguishability of first-order reaction systems , 1988 .

[53]  Olaf Wolkenhauer,et al.  Systems Biology: the Reincarnation of Systems Theory Applied in Biology? , 2001, Briefings Bioinform..

[54]  Christodoulos A. Floudas,et al.  Nonlinear and Mixed-Integer Optimization , 1995 .

[55]  H P Wynn,et al.  Differential algebra methods for the study of the structural identifiability of rational function state-space models in the biosciences. , 2001, Mathematical biosciences.

[56]  A. Califano,et al.  Dialogue on Reverse‐Engineering Assessment and Methods , 2007, Annals of the New York Academy of Sciences.

[57]  Gheorghe Craciun,et al.  Identifiability of chemical reaction networks , 2008 .

[58]  Lennart Ljung,et al.  On global identifiability for arbitrary model parametrizations , 1994, Autom..

[59]  Sean C. Warnick,et al.  Robust dynamical network structure reconstruction , 2011, Autom..

[60]  Michael Hecker,et al.  Gene regulatory network inference: Data integration in dynamic models - A review , 2009, Biosyst..

[61]  H. Pohjanpalo System identifiability based on the power series expansion of the solution , 1978 .

[62]  Maria Pia Saccomani,et al.  DAISY: A new software tool to test global identifiability of biological and physiological systems , 2007, Comput. Methods Programs Biomed..

[63]  Neil D. Evans,et al.  Structural identifiability analysis via symmetries of differential equations , 2009, Autom..

[64]  Péter Érdi,et al.  Mathematical models of chemical reactions , 1989 .

[65]  B. Wahlström,et al.  Software for solving identification and identifiability problems, E.G. in compartmental systems , 1982 .

[66]  Chiara Sabatti,et al.  Network component analysis: Reconstruction of regulatory signals in biological systems , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[67]  N. D. Clarke,et al.  Correction: Towards a Rigorous Assessment of Systems Biology Models: The DREAM3 Challenges , 2010, PLoS ONE.

[68]  S. Schuster,et al.  Metabolic network structure determines key aspects of functionality and regulation , 2002, Nature.

[69]  Yuval Rabani,et al.  Linear Programming , 2007, Handbook of Approximation Algorithms and Metaheuristics.

[70]  Daniel E. Zak,et al.  Importance of input perturbations and stochastic gene expression in the reverse engineering of genetic regulatory networks: insights from an identifiability analysis of an in silico network. , 2003, Genome research.

[71]  Stephen P. Boyd,et al.  Inferring stable genetic networks from steady-state data , 2011, Autom..

[72]  Arild Thowsen,et al.  Structural identifiability , 1977, 1977 IEEE Conference on Decision and Control including the 16th Symposium on Adaptive Processes and A Special Symposium on Fuzzy Set Theory and Applications.

[73]  Eric Walter,et al.  On the identifiability and distinguishability of nonlinear parametric models , 1996 .

[74]  Gaudenz Danuser,et al.  Linking data to models: data regression , 2006, Nature Reviews Molecular Cell Biology.

[75]  R. Dilão,et al.  A Software Tool to Model Genetic Regulatory Networks. Applications to the Modeling of Threshold Phenomena and of Spatial Patterning in Drosophila , 2010, PloS one.

[76]  M. Feinberg Chemical reaction network structure and the stability of complex isothermal reactors—I. The deficiency zero and deficiency one theorems , 1987 .

[77]  E. Walter,et al.  Global approaches to identifiability testing for linear and nonlinear state space models , 1982 .

[78]  D. di Bernardo,et al.  How to infer gene networks from expression profiles , 2007, Molecular systems biology.

[79]  Z. Tuza,et al.  Finding weakly reversible realizations of chemical reaction networks using optimization , 2011, 1103.4741.

[80]  J. Ross,et al.  A Test Case of Correlation Metric Construction of a Reaction Pathway from Measurements , 1997 .

[81]  Steffen Klamt,et al.  A methodology for the structural and functional analysis of signaling and regulatory networks , 2006, BMC Bioinformatics.

[82]  D. Husmeier,et al.  Reconstructing Gene Regulatory Networks with Bayesian Networks by Combining Expression Data with Multiple Sources of Prior Knowledge , 2007, Statistical applications in genetics and molecular biology.

[83]  G. Szederkényi,et al.  Mass action realizations of reaction kinetic system models on various time scales , 2011 .

[84]  Jörg Raisch,et al.  Subnetwork analysis reveals dynamic features of complex (bio)chemical networks , 2007, Proceedings of the National Academy of Sciences.

[85]  L. Hood,et al.  Reverse Engineering of Biological Complexity , 2007 .

[86]  D di Bernardo,et al.  Inference of gene networks from temporal gene expression profiles. , 2007, IET systems biology.

[87]  Luonan Chen,et al.  Optimization meets systems biology , 2010, BMC Systems Biology.

[88]  J. Hasty,et al.  Reverse engineering gene networks: Integrating genetic perturbations with dynamical modeling , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[89]  S. Soliman,et al.  A Unique Transformation from Ordinary Differential Equations to Reaction Networks , 2010, PloS one.

[90]  Keith R. Godfrey,et al.  Identifiability and indistinguishability of nonlinear pharmacokinetic models , 1994, Journal of Pharmacokinetics and Biopharmaceutics.

[91]  G. Szederkényi Computing sparse and dense realizations of reaction kinetic systems , 2010 .

[92]  A. Kremling,et al.  Modular analysis of signal transduction networks , 2004, IEEE Control Systems.

[93]  Ahmet Ay,et al.  Mathematical modeling of gene expression: a guide for the perplexed biologist , 2011, Critical reviews in biochemistry and molecular biology.

[94]  H. Rabitz,et al.  IDENTIFIABILITY AND DISTINGUISHABILITY OF GENERAL REACTION SYSTEMS , 1994 .

[95]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[96]  Stefanie Widder,et al.  A generalized model of the repressilator , 2006, Journal of mathematical biology.

[97]  D. Kell Metabolomics and systems biology: making sense of the soup. , 2004, Current opinion in microbiology.

[98]  Lennart Ljung,et al.  Perspectives on system identification , 2010, Annu. Rev. Control..

[99]  Hong Wang,et al.  Insights into the behaviour of systems biology models from dynamic sensitivity and identifiability analysis: a case study of an NF-kappaB signalling pathway. , 2006, Molecular bioSystems.

[100]  Maria Pia Saccomani,et al.  Parameter identifiability of nonlinear systems: the role of initial conditions , 2003, Autom..

[101]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[102]  T. Glad,et al.  An Algebraic Approach to Linear and Nonlinear Control , 1993 .