A model-based optimization framework for the inference on gene regulatory networks from DNA array data

MOTIVATION Identification of the regulatory structures in genetic networks and the formulation of mechanistic models in the form of wiring diagrams is one of the significant objectives of expression profiling using DNA microarray technologies and it requires the development and application of identification frameworks. RESULTS We have developed a novel optimization framework for identifying regulation in a genetic network using the S-system modeling formalism. We show that balance equations on both mRNA and protein species led to a formulation suitable for analyzing DNA-microarray data whereby protein concentrations have been eliminated and only mRNA relative concentrations are retained. Using this formulation, we examined if it is possible to infer a set of possible genetic regulatory networks consistent with observed mRNA expression patterns. Two origins of changes in mRNA expression patterns were considered. One derives from changes in the biophysical properties of the system that alter the molecular-interaction kinetics and/or message stability. The second is due to gene knock-outs. We reduced the identification problem to an optimization problem (of the so-called mixed-integer non-linear programming class) and we developed an algorithmic procedure for solving this optimization problem. Using simulated data generated by our mathematical model, we show that our method can actually find the regulatory network from which the data were generated. We also show that the number of possible alternate genetic regulatory networks depends on the size of the dataset (i.e. number of experiments), but this dependence is different for each of the two types of problems considered, and that a unique solution requires fewer datasets than previously estimated in the literature. This is the first method that also allows the identification of every possible regulatory network that could explain the data, when the number of experiments does not allow identification of unique regulatory structure.

[1]  Dennis J. Michaud,et al.  eXPatGen: Generating Dynamic Expression Patterns for the Systematic Evaluation of Analytical Methods , 2003, Bioinform..

[2]  Madhukar S. Dasika,et al.  A Mixed Integer Linear Programming (MILP) Framework for Inferring Time Delay in Gene Regulatory Networks , 2004, Pacific Symposium on Biocomputing.

[3]  Ying Wang,et al.  Theoretical and computational studies of the glucose signaling pathways in yeast using global gene expression data , 2003, Biotechnology and bioengineering.

[4]  M. Savageau Biochemical systems analysis. II. The steady-state solutions for an n-pool system using a power-law approximation. , 1969, Journal of theoretical biology.

[5]  M A Savageau,et al.  Biochemical systems theory: operational differences among variant representations and their significance. , 1991, Journal of theoretical biology.

[6]  V. Thorsson,et al.  Discovery of regulatory interactions through perturbation: inference and experimental design. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[7]  D. Botstein,et al.  Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF , 2001, Nature.

[8]  Roger E Bumgarner,et al.  Integrated genomic and proteomic analyses of a systematically perturbed metabolic network. , 2001, Science.

[9]  R. Somogyi,et al.  The gene expression matrix: towards the extraction of genetic network architectures , 1997 .

[10]  Shinohara,et al.  A System to Find Genetic Networks Using Weighted Network Model. , 1999, Genome informatics. Workshop on Genome Informatics.

[11]  Pedro Mendes,et al.  Artificial gene networks for objective comparison of analysis algorithms , 2003, ECCB.

[12]  Hidde de Jong,et al.  Modeling and Simulation of Genetic Regulatory Systems: A Literature Review , 2002, J. Comput. Biol..

[13]  Shane T. Jensen,et al.  The Spo0A regulon of Bacillus subtilis , 2003, Molecular microbiology.

[14]  R. Tibshirani,et al.  Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Masahiro Okamoto,et al.  Development of a System for the Inference of Large Scale Genetic Networks , 2000, Pacific Symposium on Biocomputing.

[16]  D. Botstein,et al.  The transcriptional program in the response of human fibroblasts to serum. , 1999, Science.

[17]  Diego di Bernardo,et al.  Robust Identification of Large Genetic Networks , 2003, Pacific Symposium on Biocomputing.

[18]  J. Hasty,et al.  Reverse engineering gene networks: Integrating genetic perturbations with dynamical modeling , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[19]  V. Anne Smith,et al.  Evaluating functional network inference using simulations of complex biological systems , 2002, ISMB.

[20]  Eberhard O. Voit,et al.  Power-Law Approach to Modeling Biological Systems : I. Theory , 1982 .

[21]  J. Mesirov,et al.  Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Dirk Husmeier,et al.  Sensitivity and specificity of inferring genetic regulatory interactions from microarray experiments with dynamic Bayesian networks , 2003, Bioinform..

[23]  Patrik D'haeseleer,et al.  Genetic network inference: from co-expression clustering to reverse engineering , 2000, Bioinform..

[24]  James M. Bower,et al.  Computational Modeling of Genetic and Biochemical Networks (Computational Molecular Biology) , 2004 .

[25]  S Fuhrman,et al.  Reveal, a general reverse engineering algorithm for inference of genetic network architectures. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[26]  J. Collins,et al.  Inferring Genetic Networks and Identifying Compound Mode of Action via Expression Profiling , 2003, Science.

[27]  A. Barabasi,et al.  Bioinformatics analysis of experimentally determined protein complexes in the yeast Saccharomyces cerevisiae. , 2003, Genome research.

[28]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Linda M. Wills,et al.  Reverse Engineering , 1996, Springer US.

[30]  Jorge J. Moré,et al.  The NEOS Server , 1998 .

[31]  Sven Leyffer,et al.  Numerical Experience with Lower Bounds for MIQP Branch-And-Bound , 1998, SIAM J. Optim..

[32]  Gary D. Stormo,et al.  Modeling Regulatory Networks with Weight Matrices , 1998, Pacific Symposium on Biocomputing.

[33]  William Gropp,et al.  Optimization environments and the NEOS server , 1997 .

[34]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[35]  M. Savageau Biochemical systems analysis. II. The steady-state solutions for an n-pool system using a power-law approximation. , 1969, Journal of theoretical biology.

[36]  Matsumoto,et al.  Finding Genetic Network from Experiments by Weighted Network Model. , 1998, Genome informatics. Workshop on Genome Informatics.

[37]  Jesper Tegnér,et al.  Reverse engineering gene networks using singular value decomposition and robust regression , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Michael A. Savageau,et al.  Introduction to S-systems and the underlying power-law formalism , 1988 .

[39]  Tommi S. Jaakkola,et al.  Bayesian Methods for Elucidating Genetic Regulatory Networks , 2002, IEEE Intell. Syst..

[40]  Patrik D'haeseleer,et al.  Linear Modeling of mRNA Expression Levels During CNS Development and Injury , 1998, Pacific Symposium on Biocomputing.

[41]  William S. Hlavacek,et al.  Method for determining natural design principles of biological control circuits , 1998, J. Intell. Fuzzy Syst..

[42]  James R. Knight,et al.  A Protein Interaction Map of Drosophila melanogaster , 2003, Science.

[43]  Michael A. Savageau,et al.  Models of Gene Function: General Methods of Kinetic Analysis and Specific Ecological Correlates , 1983 .

[44]  Ting Chen,et al.  Modeling Gene Expression with Differential Equations , 1998, Pacific Symposium on Biocomputing.

[45]  K. F. Tipton,et al.  Biochemical systems analysis: A study of function and design in molecular biology , 1978 .

[46]  John J. Wyrick,et al.  Genome-wide location and function of DNA binding proteins. , 2000, Science.

[47]  M A Savageau,et al.  Accuracy of alternative representations for integrated biochemical systems. , 1987, Biochemistry.

[48]  Fang-Xiang Wu,et al.  Modeling Gene Expression from Microarray Expression Data with State-Space Equations , 2003, Pacific Symposium on Biocomputing.

[49]  Satoru Miyano,et al.  Inferring qualitative relations in genetic networks and metabolic pathways , 2000, Bioinform..

[50]  Elizabeth D. Dolan,et al.  NEOS Server 4.0 Administrative Guide , 2001, ArXiv.