An Intelligent Two-Stage Evolutionary Algorithm for Dynamic Pathway Identification From Gene Expression Profiles

From gene expression profiles, it is desirable to rebuild cellular dynamic regulation networks to discover more delicate and substantial functions in molecular biology, biochemistry, bioengineering, and pharmaceutics. The S-system model is suitable to characterize biochemical network systems and capable of analyzing the regulatory system dynamics. However, the inference of an S-system model of N-gene genetic networks has 2N(N + 1) parameters in a set of nonlinear differential equations to be optimized. This paper proposes an intelligent two-stage evolutionary algorithm (iTEA) to efficiently infer the S-system models of genetic networks from time-series data of gene expression. To cope with the curse of dimensionality, the proposed algorithm consists of two stages, where each uses a divide-and-conquer strategy. The optimization problem is first decomposed into N subproblems having 2(N + 1) parameters each. At the first stage, each subproblem is solved using a novel intelligent genetic algorithm (IGA) with intelligent crossover based on an orthogonal experimental design (OED). At the second stage, the obtained N solutions to the N subproblems are combined and refined using an OED-based simulated annealing algorithm for handling noisy gene expression profiles. The effectiveness of iTEA is evaluated using simulated expression patterns with and without noise running on a single-processor PC. It is shown that 1) IGA is efficient enough to solve subproblems, 2) IGA is significantly superior to the existing method GA with simplex crossover (SPXGA), and 3) iTEA performs well in inferring S-system models for dynamic pathway identification.

[1]  A Sorribas,et al.  Metabolic pathway characterization from transient response data obtained in situ: parameter estimation in S-system models. , 1993, Journal of theoretical biology.

[2]  Tapan P. Bagchi,et al.  Taguchi methods explained : practical steps to robust design , 1993 .

[3]  Araceli M. Huerta,et al.  From specific gene regulation to genomic networks: a global analysis of transcriptional regulation in Escherichia coli. , 1998, BioEssays : news and reviews in molecular, cellular and developmental biology.

[4]  S Fuhrman,et al.  Reveal, a general reverse engineering algorithm for inference of genetic network architectures. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[5]  Satoru Miyano,et al.  Identification of Genetic Networks from a Small Number of Gene Expression Patterns Under the Boolean Network Model , 1998, Pacific Symposium on Biocomputing.

[6]  Satoru Miyano,et al.  Algorithms for identifying Boolean networks and related biological networks based on matrix multiplication and fingerprint function , 2000, RECOMB '00.

[7]  Lih-Yuan Deng,et al.  Orthogonal Arrays: Theory and Applications , 1999, Technometrics.

[8]  E. Winzeler,et al.  Genomics, gene expression and DNA arrays , 2000, Nature.

[9]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[10]  Masahiro Okamoto,et al.  Efficient Numerical Optimization Algorithm Based on Genetic Algorithm for Inverse Problem , 2000, GECCO.

[11]  Richard D. Wesel,et al.  Quasi-convexity and optimal binary fusion for distributed detection with identical sensors in generalized Gaussian noise , 2001, IEEE Trans. Inf. Theory.

[12]  P. Brazhnik,et al.  Gene networks: how to put the function in genomics. , 2002, Trends in biotechnology.

[13]  Takanori Ueda,et al.  Inference of Genetic Network Using the Expression Profile Time Course Data of Mouse P19 Cells , 2002 .

[14]  Masaru Tomita,et al.  Dynamic modeling of genetic networks using genetic algorithm and S-system , 2003, Bioinform..

[15]  Dirk Husmeier,et al.  Sensitivity and specificity of inferring genetic regulatory interactions from microarray experiments with dynamic Bayesian networks , 2003, Bioinform..

[16]  Isao Ono,et al.  Finding multiple solutions based on an evolutionary algorithm for inference of genetic networks by S-system , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[17]  Carmen G. Moles,et al.  Parameter estimation in biochemical pathways: a comparison of global optimization methods. , 2003, Genome research.

[18]  Jonas S. Almeida,et al.  Decoupling dynamical systems for pathway identification from metabolic profiles , 2004, Bioinform..

[19]  Shinn-Ying Ho,et al.  OSA: orthogonal simulated annealing algorithm and its application to designing mixed H2/H∞ optimal controllers , 2004, IEEE Trans. Syst. Man Cybern. Part A.

[20]  Paul P. Wang,et al.  Advances to Bayesian network inference for generating causal networks from observational biological data , 2004, Bioinform..

[21]  Shinn-Ying Ho,et al.  Intelligent evolutionary algorithms for large parameter optimization problems , 2004, IEEE Transactions on Evolutionary Computation.

[22]  S. Kimura,et al.  Inference of S-system Models of Genetic Networks from Noisy Time-series Data , 2004 .

[23]  Min Zou,et al.  A new dynamic Bayesian network (DBN) approach for identifying gene regulatory networks from time course microarray data , 2005, Bioinform..

[24]  Feng-Sheng Wang,et al.  Evolutionary optimization with data collocation for reverse engineering of biological networks , 2005, Bioinform..

[25]  Shuhei Kimura,et al.  Inference of S-system models of genetic networks using a cooperative coevolutionary algorithm , 2005, Bioinform..

[26]  Shinn-Ying Ho,et al.  Optimizing fuzzy neural networks for tuning PID controllers using an orthogonal simulated annealing algorithm OSA , 2006, IEEE Transactions on Fuzzy Systems.

[27]  Paul Horton,et al.  Inference of Scale-free Networks from Gene Expression Time Series , 2006, J. Bioinform. Comput. Biol..

[28]  H. Szu Fast simulated annealing , 1987 .