Identification of biochemical networks by S-tree based genetic programming

MOTIVATION Most previous approaches to model biochemical networks have focused either on the characterization of a network structure with a number of components or on the estimation of kinetic parameters of a network with a relatively small number of components. For system-level understanding, however, we should examine both the interactions among the components and the dynamic behaviors of the components. A key obstacle to this simultaneous identification of the structure and parameters is the lack of data compared with the relatively large number of parameters to be estimated. Hence, there are many plausible networks for the given data, but most of them are not likely to exist in the real system. RESULTS We propose a new representation named S-trees for both the structural and dynamical modeling of a biochemical network within a unified scheme. We further present S-tree based genetic programming to identify the structure of a biochemical network and to estimate the corresponding parameter values at the same time. While other evolutionary algorithms require additional techniques for sparse structure identification, our approach can automatically assemble the sparse primitives of a biochemical network in an efficient way. We evaluate our algorithm on the dynamic profiles of an artificial genetic network. In 20 trials for four settings, we obtain the true structure and their relative squared errors are <5% regardless of releasing constraints about structural sparseness. In addition, we confirm that the proposed algorithm is robust within +/-10% noise ratio. Furthermore, the proposed approach ensures a reasonable estimate of a real yeast fermentation pathway. The comparatively less important connections with non-zero parameters can be detected even though their orders are below 10(-2). To demonstrate the usefulness of the proposed algorithm for real experimental biological data, we provide an additional example on the transcriptional network of SOS response to DNA damage in Escherichia coli. We confirm that the proposed algorithm can successfully identify the true structure except only one relation.

[1]  G. Walker,et al.  The SOS response: recent insights into umuDC-dependent mutagenesis and DNA damage tolerance. , 2000, Annual review of genetics.

[2]  C. Rao,et al.  Control, exploitation and tolerance of intracellular noise , 2002, Nature.

[3]  Masaru Tomita,et al.  Dynamic modeling of genetic networks using genetic algorithm and S-system , 2003, Bioinform..

[4]  J. W. Atmar,et al.  Comparing genetic operators with gaussian mutations in simulated evolutionary processes using linear systems , 1990, Biological Cybernetics.

[5]  Jason A. Papin,et al.  Metabolic pathways in the post-genome era. , 2003, Trends in biochemical sciences.

[6]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[7]  Isao Ono,et al.  Finding multiple solutions based on an evolutionary algorithm for inference of genetic networks by S-system , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[8]  U. Alon,et al.  Assigning numbers to the arrows: Parameterizing a gene regulation network by using accurate expression kinetics , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Kwang-Hyun Cho,et al.  Unravelling the functional interaction structure of a cellular network from temporal slope information of experimental data , 2005, The FEBS journal.

[10]  Andreas Zell,et al.  Optimizing Topology and Parameters of Gene Regulatory Network Models from Time-Series Experiments , 2004, GECCO.

[11]  Jonas S. Almeida,et al.  Decoupling dynamical systems for pathway identification from metabolic profiles , 2004, Bioinform..

[12]  Olaf Wolkenhauer,et al.  A unified framework for unraveling the functional interaction structure of a biomolecular network based on stimulus‐response experimental data , 2005, FEBS letters.

[13]  Tughrul Arslan,et al.  Proceedings Of The 2000 Congress On Evolutionary Computation , 2000, Proceedings of the 2000 Congress on Evolutionary Computation. CEC00 (Cat. No.00TH8512).

[14]  Shuhei Kimura,et al.  Inference of S-system models of genetic networks using a cooperative coevolutionary algorithm , 2005, Bioinform..

[15]  Kwang-Hyun Cho,et al.  Systems biology: Looking at opportunities and challenges in applying systems theory to molecular and cell biology , 2003 .

[16]  Kwang-Hyun Cho,et al.  Identification of small scale biochemical networks based on general type system perturbations , 2005, The FEBS journal.

[17]  Hitoshi Iba,et al.  Evolutionary modeling and inference of gene network , 2002, Inf. Sci..

[18]  Markus J. Herrgård,et al.  Reconstruction of microbial transcriptional regulatory networks. , 2004, Current opinion in biotechnology.

[19]  H. D. Jong,et al.  Qualitative simulation of genetic regulatory networks using piecewise-linear models , 2004, Bulletin of mathematical biology.

[20]  David B. Fogel,et al.  System Identification Through Simulated Evolution: A Machine Learning Approach to Modeling , 1991 .

[21]  Hidde de Jong,et al.  Modeling and Simulation of Genetic Regulatory Systems: A Literature Review , 2002, J. Comput. Biol..

[22]  Xin Yao,et al.  Stochastic ranking for constrained evolutionary optimization , 2000, IEEE Trans. Evol. Comput..

[23]  Takanori Ueda,et al.  Inference of Genetic Network Using the Expression Profile Time Course Data of Mouse P19 Cells , 2002 .

[24]  Araceli M. Huerta,et al.  From specific gene regulation to genomic networks: a global analysis of transcriptional regulation in Escherichia coli. , 1998, BioEssays : news and reviews in molecular, cellular and developmental biology.

[25]  KonagayaAkihiko,et al.  Inference of S-system models of genetic networks using a cooperative coevolutionary algorithm , 2005 .

[26]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[27]  S. Rafii,et al.  Splitting vessels: Keeping lymph apart from blood , 2003, Nature Medicine.

[28]  B O Palsson,et al.  Metabolic modeling of microbial strains in silico. , 2001, Trends in biochemical sciences.

[29]  Georges R. Harik,et al.  Finding Multimodal Solutions Using Restricted Tournament Selection , 1995, ICGA.

[30]  Byoung-Tak Zhang,et al.  Balancing Accuracy and Parsimony in Genetic Programming , 1995, Evolutionary Computation.

[31]  Marta Cascante,et al.  Multicriteria optimization of biochemical systems by linear programming: application to production of ethanol by Saccharomyces cerevisiae. , 2003, Biotechnology and bioengineering.

[32]  J. Stelling Mathematical models in microbial systems biology. , 2004, Current opinion in microbiology.

[33]  Feng-Sheng Wang,et al.  Evolutionary optimization with data collocation for reverse engineering of biological networks , 2005, Bioinform..

[34]  D. Pe’er,et al.  Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data , 2003, Nature Genetics.

[35]  M. Tomita,et al.  Reverse engineering of biochemical equations from time-course data by means of genetic programming. , 2005, Bio Systems.

[36]  Byoung-Tak Zhang,et al.  Evolutionary Induction of Sparse Neural Trees , 1997, Evolutionary Computation.

[37]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[38]  Masahiro Okamoto,et al.  Efficient Numerical Optimization Algorithm Based on Genetic Algorithm for Inverse Problem , 2000, GECCO.