Initial state perturbations as a validation method for data-driven fuzzy models of cellular networks

BackgroundData-driven methods that automatically learn relations between attributes from given data are a popular tool for building mathematical models in computational biology. Since measurements are prone to errors, approaches dealing with uncertain data are especially suitable for this task. Fuzzy models are one such approach, but they contain a large amount of parameters and are thus susceptible to over-fitting. Validation methods that help detect over-fitting are therefore needed to eliminate inaccurate models.ResultsWe propose a method to enlarge the validation datasets on which a fuzzy dynamic model of a cellular network can be tested. We apply our method to two data-driven dynamic models of the MAPK signalling pathway and two models of the mammalian circadian clock. We show that random initial state perturbations can drastically increase the mean error of predictions of an inaccurate computational model, while keeping errors of predictions of accurate models small.ConclusionsWith the improvement of validation methods, fuzzy models are becoming more accurate and are thus likely to gain new applications. This field of research is promising not only because fuzzy models can cope with uncertainty, but also because their run time is short compared to conventional modelling methods that are nowadays used in systems biology.

[1]  Kevin A Janes,et al.  A biological approach to computational models of proteomic networks. , 2006, Current opinion in chemical biology.

[2]  Nicolas Cermakian,et al.  Circadian Clocks in the Immune System , 2015, Journal of biological rhythms.

[3]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .

[4]  Miha Mraz,et al.  Fuzzy Logic as a Computational Tool for Quantitative Modelling of Biological Systems with Uncertain Kinetic Data , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[5]  Ching-Hsue Cheng,et al.  Multi-attribute fuzzy time series method based on fuzzy clustering , 2008, Expert Syst. Appl..

[6]  R. Milo,et al.  Variability and memory of protein levels in human cells , 2006, Nature.

[7]  Robert Reynolds,et al.  Fuzzy logic-based gene regulatory network , 2003, The 12th IEEE International Conference on Fuzzy Systems, 2003. FUZZ '03..

[8]  Julio Saez-Rodriguez,et al.  Fuzzy Logic Analysis of Kinase Pathway Crosstalk in TNF/EGF/Insulin-Induced Signaling , 2007, PLoS Comput. Biol..

[9]  E. Choi,et al.  Pathological roles of MAPK signaling pathways in human diseases. , 2010, Biochimica et biophysica acta.

[10]  A. Yoshimura,et al.  Model analysis of difference between EGF pathway and FGF pathway. , 2004, Biochemical and biophysical research communications.

[11]  Tereza Puchrová Modelling and experimental validation of signalling pathways with relevance to homologous mammalian systems , 2015 .

[12]  A. Goldbeter,et al.  Toward a detailed computational model for the mammalian circadian clock , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Paul B. Lochert,et al.  A fuzzy logic approach for dealing with qualitative quality characteristics of a process , 2008, Expert Syst. Appl..

[14]  Daniel B. Forger,et al.  A detailed predictive model of the mammalian circadian clock , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Peter J. F. Lucas,et al.  Bayesian network modelling through qualitative patterns , 2005, Artif. Intell..

[16]  Jehoshua Bruck,et al.  Scaffold proteins may biphasically affect the levels of mitogen-activated protein kinase signaling and reduce its threshold properties. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Francis J Doyle,et al.  A model of the cell-autonomous mammalian circadian clock , 2009, Proceedings of the National Academy of Sciences.

[18]  David K. Welsh,et al.  Cellular Circadian Clocks in Mood Disorders , 2012, Journal of biological rhythms.

[19]  S. Gery,et al.  A role for the clock gene per1 in prostate cancer. , 2009, Cancer research.

[20]  Drew Endy,et al.  Stimulus Design for Model Selection and Validation in Cell Signaling , 2008, PLoS Comput. Biol..

[21]  Eann A Patterson,et al.  A framework to establish credibility of computational models in biology. , 2017, Progress in biophysics and molecular biology.

[22]  Kazuhiro Aoki,et al.  Multiple Decisive Phosphorylation Sites for the Negative Feedback Regulation of SOS1 via ERK* , 2010, The Journal of Biological Chemistry.

[23]  Julio Saez-Rodriguez,et al.  Training Signaling Pathway Maps to Biochemical Data with Constrained Fuzzy Logic: Quantitative Analysis of Liver Cell Responses to Inflammatory Stimuli , 2011, PLoS Comput. Biol..

[24]  Katsutaka Oishi,et al.  Involvement of circadian clock gene Clock in diabetes‐induced circadian augmentation of plasminogen activator inhibitor‐1 (PAI‐1) expression in the mouse heart , 2005, FEBS letters.

[25]  B. Chissom,et al.  Fuzzy time series and its models , 1993 .

[26]  Hans-Jürgen Zimmermann,et al.  Fuzzy Set Theory - and Its Applications , 1985 .

[27]  Jacek M. Zurada,et al.  Data-driven linguistic modeling using relational fuzzy rules , 2003, IEEE Trans. Fuzzy Syst..

[28]  Geoffrey I. Webb,et al.  Encyclopedia of Machine Learning , 2011, Encyclopedia of Machine Learning.

[29]  U. Bhalla Signaling in small subcellular volumes. I. Stochastic and diffusion effects on individual pathways. , 2004, Biophysical journal.

[30]  H. Zimmermann,et al.  Fuzzy Set Theory and Its Applications , 1993 .

[31]  Jernej Virant Design Considerations of Time in Fuzzy Systems , 1999 .

[32]  J. Blenis,et al.  ERK and p38 MAPK-Activated Protein Kinases: a Family of Protein Kinases with Diverse Biological Functions , 2004, Microbiology and Molecular Biology Reviews.

[33]  Pierre Baldi,et al.  Guidelines for Genome-Scale Analysis of Biological Rhythms , 2017, Journal of biological rhythms.

[34]  Hanspeter Herzel,et al.  Timing of circadian genes in mammalian tissues , 2014, Scientific Reports.

[35]  Ebrahim H. Mamdani,et al.  An Experiment in Linguistic Synthesis with a Fuzzy Logic Controller , 1999, Int. J. Hum. Comput. Stud..

[36]  David Heckerman,et al.  Statistical Resolution of Ambiguous HLA Typing Data , 2008, PLoS Comput. Biol..

[37]  Zuyi Huang,et al.  Fuzzy modeling of signal transduction networks , 2009 .

[38]  William S. Hlavacek,et al.  Relaxation oscillations and hierarchy of feedbacks in MAPK signaling , 2017, Scientific Reports.

[39]  E. Gilles,et al.  Computational modeling of the dynamics of the MAP kinase cascade activated by surface and internalized EGF receptors , 2002, Nature Biotechnology.

[40]  S. Reppert,et al.  Molecular analysis of mammalian circadian rhythms. , 2001, Annual review of physiology.

[41]  P. Woolf,et al.  A fuzzy logic approach to analyzing gene expression data. , 2000, Physiological genomics.

[42]  Michael Hecker,et al.  Gene regulatory network inference: Data integration in dynamic models - A review , 2009, Biosyst..