Using gene expression programming to infer gene regulatory networks from time-series data

Gene regulatory networks inference is currently a topic under heavy research in the systems biology field. In this paper, gene regulatory networks are inferred via evolutionary model based on time-series microarray data. A non-linear differential equation model is adopted. Gene expression programming (GEP) is applied to identify the structure of the model and least mean square (LMS) is used to optimize the parameters in ordinary differential equations (ODEs). The proposed work has been first verified by synthetic data with noise-free and noisy time-series data, respectively, and then its effectiveness is confirmed by three real time-series expression datasets. Finally, a gene regulatory network was constructed with 12 Yeast genes. Experimental results demonstrate that our model can improve the prediction accuracy of microarray time-series data effectively.

[1]  Cândida Ferreira,et al.  Gene Expression Programming: Mathematical Modeling by an Artificial Intelligence (Studies in Computational Intelligence) , 2006 .

[2]  Cândida Ferreira,et al.  Designing Neural Networks Using Gene Expression Programming , 2004, WSC.

[3]  C. Garino,et al.  Computational Biology and Chemistry , 2015 .

[4]  Zheng Li,et al.  Large-scale dynamic gene regulatory network inference combining differential equation models with local dynamic Bayesian network analysis , 2011, Bioinform..

[5]  Ankush Mittal,et al.  Model gene network by semi-fixed Bayesian network , 2006, Expert Syst. Appl..

[6]  Changjie Tang,et al.  Distance Guided Classification with Gene Expression Programming , 2006, ADMA.

[7]  Liang Tang,et al.  Mining Class Contrast Functions by Gene Expression Programming , 2009, ADMA.

[8]  Hidde de Jong,et al.  Modeling and Simulation of Genetic Regulatory Systems: A Literature Review , 2002, J. Comput. Biol..

[9]  Cândida Ferreira,et al.  Gene Expression Programming: A New Adaptive Algorithm for Solving Problems , 2001, Complex Syst..

[10]  Weimin Xiao,et al.  Evolving accurate and compact classification rules with gene expression programming , 2003, IEEE Trans. Evol. Comput..

[11]  Satoru Miyano,et al.  Inferring Gene Regulatory Networks from Time-Ordered Gene Expression Data of Bacillus Subtilis Using Differential Equations , 2002, Pacific Symposium on Biocomputing.

[12]  L. Guarente,et al.  Regulation of the yeast CYT1 gene encoding cytochrome c1 by HAP1 and HAP2/3/4. , 1991, Molecular and cellular biology.

[13]  Hod Lipson,et al.  Automated reverse engineering of nonlinear dynamical systems , 2007, Proceedings of the National Academy of Sciences.

[14]  Fang-Xiang Wu,et al.  Modeling Gene Expression from Microarray Expression Data with State-Space Equations , 2003, Pacific Symposium on Biocomputing.

[15]  Ting Chen,et al.  Modeling Gene Expression with Differential Equations , 1998, Pacific Symposium on Biocomputing.

[16]  Yuehui Chen,et al.  Reverse engineering of gene regulatory networks using flexible neural tree models , 2013, Neurocomputing.

[17]  Andreas Stafylopatis,et al.  Data Mining based on Gene Expression Programming and Clonal Selection , 2006, 2006 IEEE International Conference on Evolutionary Computation.

[18]  Adrian E. Raftery,et al.  Model-based clustering and data transformations for gene expression data , 2001, Bioinform..

[19]  Zidong Wang,et al.  An Extended Kalman Filtering Approach to Modeling Nonlinear Dynamic Gene Regulatory Networks via Short Gene Expression Time Series , 2009, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[20]  P. Woolf,et al.  A fuzzy logic approach to analyzing gene expression data. , 2000, Physiological genomics.

[21]  Zoubin Ghahramani,et al.  Learning Dynamic Bayesian Networks , 1997, Summer School on Neural Networks.

[22]  Jinde Cao,et al.  On Delayed Genetic Regulatory Networks With Polytopic Uncertainties: Robust Stability Analysis , 2008, IEEE Transactions on NanoBioscience.

[23]  Hitoshi Iba,et al.  Evolutionary modeling and inference of gene network , 2002, Inf. Sci..

[24]  Jae-Woo Chang,et al.  Advances in Web-Age Information Management , 2001, Lecture Notes in Computer Science.

[25]  Jonas S. Almeida,et al.  Identification of neutral biochemical network models from time series data , 2009, BMC Syst. Biol..

[26]  Patrik D'haeseleer,et al.  Linear Modeling of mRNA Expression Levels During CNS Development and Injury , 1998, Pacific Symposium on Biocomputing.

[27]  Changjie Tang,et al.  Time Series Prediction Based on Gene Expression Programming , 2004, WAIM.

[28]  J. Butcher The numerical analysis of ordinary differential equations: Runge-Kutta and general linear methods , 1987 .

[29]  L. Hurley,et al.  The importance of negative superhelicity in inducing the formation of G-quadruplex and i-motif structures in the c-Myc promoter: implications for drug targeting and control of gene expression. , 2009, Journal of medicinal chemistry.

[30]  Andreas Stafylopatis,et al.  Efficient Evolution of Accurate Classification Rules Using a Combination of Gene Expression Programming and Clonal Selection , 2008, IEEE Transactions on Evolutionary Computation.

[31]  Edward R. Dougherty,et al.  Inference of Noisy Nonlinear Differential Equation Models for Gene Regulatory Networks Using Genetic Programming and Kalman Filtering , 2008, IEEE Transactions on Signal Processing.

[32]  Ahmet Sacan,et al.  Data simulation and regulatory network reconstruction from time-series microarray data using stepwise multiple linear regression , 2012, Network Modeling Analysis in Health Informatics and Bioinformatics.

[33]  Satoru Miyano,et al.  Identification of Genetic Networks from a Small Number of Gene Expression Patterns Under the Boolean Network Model , 1998, Pacific Symposium on Biocomputing.

[34]  R. Somogyi,et al.  Gene Expression Data Analysis and Modeling , 1999 .

[35]  C. Ball,et al.  Identification of genes periodically expressed in the human cell cycle and their expression in tumors. , 2002, Molecular biology of the cell.

[36]  Guy Karlebach,et al.  Modelling and analysis of gene regulatory networks , 2008, Nature Reviews Molecular Cell Biology.

[37]  Kevin Murphy,et al.  Modelling Gene Expression Data using Dynamic Bayesian Networks , 2006 .

[38]  B A McKinney,et al.  Hybrid grammar-based approach to nonlinear dynamical system identification from biological time series. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[39]  Zhide Hu,et al.  Study of Human Dopamine Sulfotransferases Based on Gene Expression Programming , 2011, Chemical biology & drug design.

[40]  Wei-Po Lee,et al.  A clustering-based approach for inferring recurrent neural networks as gene regulatory networks , 2008, Neurocomputing.

[41]  Masaru Tomita,et al.  E-CELL: software environment for whole-cell simulation , 1999, Bioinform..

[42]  Michael Hecker,et al.  Gene regulatory network inference: Data integration in dynamic models - A review , 2009, Biosyst..

[43]  Zidong Wang,et al.  On Robust Stability of Stochastic Genetic Regulatory Networks With Time Delays: A Delay Fractioning Approach , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[44]  Stefan Bornholdt,et al.  Boolean network models of cellular regulation: prospects and limitations , 2008, Journal of The Royal Society Interface.

[45]  Chun-Chi Liu,et al.  Bayesian approach to transforming public gene expression repositories into disease diagnosis databases , 2010, Proceedings of the National Academy of Sciences.