Multi-stage Evolutionary Algorithms for Efficient Identification of Gene Regulatory Networks

With the availability of the time series data from the high-throughput technologies, diverse approaches have been proposed to model gene regulatory networks. Compared with others, S-system has the advantage for these tasks in the sense that it can provide both quantitative (structural) and qualitative (dynamical) modeling in one framework. However, it is not easy to identify the structure of the true network since the number of parameters to be estimated is much larger than that of the available data. Moreover, conventional parameter estimation requires the time-consuming numerical integration to reproduce dynamic profiles for the S-system. In this paper, we propose multi-stage evolutionary algorithms to identify gene regulatory networks efficiently. With the symbolic regression by genetic programming (GP), we can evade the numerical integration steps. This is because the estimation of slopes for each time-course data can be obtained from the results of GP. We also develop hybrid evolutionary algorithms and modified fitness evaluation function to identify the structure of gene regulatory networks and to estimate the corresponding parameters at the same time. By applying the proposed method to the identification of an artificial genetic network, we verify its capability of finding the true S-system.

[1]  M. Savageau Biochemical Systems Analysis: A Study of Function and Design in Molecular Biology , 1976 .

[2]  Masaru Tomita,et al.  Dynamic modeling of genetic networks using genetic algorithm and S-system , 2003, Bioinform..

[3]  Hidde de Jong,et al.  Modeling and Simulation of Genetic Regulatory Systems: A Literature Review , 2002, J. Comput. Biol..

[4]  Andreas Zell,et al.  Optimizing Topology and Parameters of Gene Regulatory Network Models from Time-Series Experiments , 2004, GECCO.

[5]  H. D. Jong,et al.  Qualitative simulation of genetic regulatory networks using piecewise-linear models , 2004, Bulletin of mathematical biology.

[6]  Riccardo Poli,et al.  Genetic and Evolutionary Computation – GECCO 2004 , 2004, Lecture Notes in Computer Science.

[7]  K. F. Tipton,et al.  Biochemical systems analysis: A study of function and design in molecular biology , 1978 .

[8]  C. Rao,et al.  Control, exploitation and tolerance of intracellular noise , 2002, Nature.

[9]  Thomas Bäck,et al.  Evolutionary Algorithms in Theory and Practice , 1996 .

[10]  Masahiro Okamoto,et al.  Efficient Numerical Optimization Algorithm Based on Genetic Algorithm for Inverse Problem , 2000, GECCO.

[11]  J. Stelling Mathematical models in microbial systems biology. , 2004, Current opinion in microbiology.

[12]  Feng-Sheng Wang,et al.  Evolutionary optimization with data collocation for reverse engineering of biological networks , 2005, Bioinform..

[13]  Hitoshi Iba,et al.  Evolutionary modeling and inference of gene network , 2002, Inf. Sci..

[14]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[15]  David B. Fogel,et al.  System Identification Through Simulated Evolution: A Machine Learning Approach to Modeling , 1991 .

[16]  Eberhard O Voit,et al.  Neural-network-based parameter estimation in S-system models of biological networks. , 2003, Genome informatics. International Conference on Genome Informatics.

[17]  B O Palsson,et al.  Metabolic modeling of microbial strains in silico. , 2001, Trends in biochemical sciences.

[18]  Georges R. Harik,et al.  Finding Multimodal Solutions Using Restricted Tournament Selection , 1995, ICGA.

[19]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[20]  Anthony Trewavas,et al.  Much food, many problems , 1999, Nature.