Gene regulatory network inference: Data integration in dynamic models - A review

Systems biology aims to develop mathematical models of biological systems by integrating experimental and theoretical techniques. During the last decade, many systems biological approaches that base on genome-wide data have been developed to unravel the complexity of gene regulation. This review deals with the reconstruction of gene regulatory networks (GRNs) from experimental data through computational methods. Standard GRN inference methods primarily use gene expression data derived from microarrays. However, the incorporation of additional information from heterogeneous data sources, e.g. genome sequence and protein-DNA interaction data, clearly supports the network inference process. This review focuses on promising modelling approaches that use such diverse types of molecular biological information. In particular, approaches are discussed that enable the modelling of the dynamics of gene regulatory systems. The review provides an overview of common modelling schemes and learning algorithms and outlines current challenges in GRN modelling.

[1]  A. Sandelin,et al.  Applied bioinformatics for the identification of regulatory elements , 2004, Nature Reviews Genetics.

[2]  R. Thomas,et al.  Boolean formalization of genetic control circuits. , 1973, Journal of theoretical biology.

[3]  Eberhard O Voit,et al.  Theoretical Biology and Medical Modelling Identification of Metabolic System Parameters Using Global Optimization Methods , 2022 .

[4]  Carmen G. Moles,et al.  Parameter estimation in biochemical pathways: a comparison of global optimization methods. , 2003, Genome research.

[5]  Reinhard Guthke,et al.  Dynamic network reconstruction from gene expression data applied to immune response during bacterial infection , 2005, Bioinform..

[6]  Dirk Husmeier,et al.  Sensitivity and specificity of inferring genetic regulatory interactions from microarray experiments with dynamic Bayesian networks , 2003, Bioinform..

[7]  Rainer Spang,et al.  Inferring cellular networks – a review , 2007, BMC Bioinformatics.

[8]  E. Davidson,et al.  The hardwiring of development: organization and function of genomic regulatory systems. , 1997, Development.

[9]  J. Hasty,et al.  Reverse engineering gene networks: Integrating genetic perturbations with dynamical modeling , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Korbinian Strimmer,et al.  From correlation to causation networks: a simple approximate learning algorithm and its application to high-dimensional plant gene expression data , 2007, BMC Systems Biology.

[11]  Jean-Loup Faulon,et al.  Boolean dynamics of genetic regulatory networks inferred from microarray time series data , 2007, Bioinform..

[12]  S Fuhrman,et al.  Reveal, a general reverse engineering algorithm for inference of genetic network architectures. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[13]  Timothy S Gardner,et al.  Reverse-engineering transcription control networks. , 2005, Physics of life reviews.

[14]  Marcel J. T. Reinders,et al.  Studying the Conditions for Learning Dynamic Bayesian Networks to Discover Genetic Regulatory Networks , 2003, Simul..

[15]  Felix Streichert,et al.  Comparing mathematical models on the problem of network inference , 2006, GECCO.

[16]  E. O. Voit,et al.  Biochemical systems analysis of genome-wide expression data , 2000, Bioinform..

[17]  Zoubin Ghahramani,et al.  Modeling T-cell activation using gene expression profiling and state-space models , 2004, Bioinform..

[18]  O. Nelles Nonlinear System Identification , 2001 .

[19]  Francis J. Doyle,et al.  Simulation Studies for the Identification of Genetic Networks from cDNA Array and Regulatory Activity Data , 2001 .

[20]  Marcel J. T. Reinders,et al.  A Comparison of Genetic Network Models , 2000, Pacific Symposium on Biocomputing.

[21]  Rainer Spang,et al.  Non-transcriptional pathway features reconstructed from secondary effects of RNA interference , 2005, Bioinform..

[22]  S. Kauffman Metabolic stability and epigenesis in randomly constructed genetic nets. , 1969, Journal of theoretical biology.

[23]  Adriana Climescu-Haulica,et al.  A stochastic differential equation model for transcriptional regulatory networks , 2007, BMC Bioinformatics.

[24]  E. P. van Someren Searching for Limited Connectivity in Genetic Network Models , 2004 .

[25]  J. Collins,et al.  Chemogenomic profiling on a genome-wide scale using reverse-engineered gene networks , 2005, Nature Biotechnology.

[26]  J. Collins,et al.  Inferring Genetic Networks and Identifying Compound Mode of Action via Expression Profiling , 2003, Science.

[27]  B. Morgan,et al.  Non-uniqueness and Inversions in Cluster Analysis , 1995 .

[28]  Jaak Vilo,et al.  Building and analysing genome-wide gene disruption networks , 2002, ECCB.

[29]  Bettina Birkmeier Integrating Prior Knowledge into the Fitness Function of an Evolutionary Algorithm for Deriving Gene Regulatory Networks , 2006 .

[30]  Trupti Joshi,et al.  Inferring gene regulatory networks from multiple microarray datasets , 2006, Bioinform..

[31]  Stefan Bornholdt,et al.  Boolean network models of cellular regulation: prospects and limitations , 2008, Journal of The Royal Society Interface.

[32]  D. Kell,et al.  Metabolomics by numbers: acquiring and understanding global metabolite data. , 2004, Trends in biotechnology.

[33]  Jesper Tegnér,et al.  Reverse engineering gene networks using singular value decomposition and robust regression , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[34]  Vladimir Filkov,et al.  Identifying Gene Regulatory Networks from Gene Expression Data , 2005 .

[35]  Jennifer Prestigiacomo,et al.  A Hybrid Approach , 2018, How High the Sky?.

[36]  Lyle H. Ungar,et al.  Using prior knowledge to improve genetic network reconstruction from microarray data , 2004, Silico Biol..

[37]  Jonas S. Almeida,et al.  Parameter optimization in S-system models , 2008, BMC Systems Biology.

[38]  A. Fire,et al.  Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans , 1998, Nature.

[39]  Eyad Almasri,et al.  Rank-based edge reconstruction for scale-free genetic regulatory networks , 2008, BMC Bioinformatics.

[40]  Satoru Miyano,et al.  Using Protein-Protein Interactions for Refining Gene Networks Estimated from Microarray Data by Bayesian Networks , 2003, Pacific Symposium on Biocomputing.

[41]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1998, Learning in Graphical Models.

[42]  Adam A. Margolin,et al.  Reverse engineering of regulatory networks in human B cells , 2005, Nature Genetics.

[43]  Maria Rodriguez-Fernandez,et al.  A hybrid approach for efficient and robust parameter estimation in biochemical pathways. , 2006, Bio Systems.

[44]  Ulrich Möller,et al.  Quantitative Evaluation of Established Clustering Methods for Gene Expression Data , 2004, ISBMDA.

[45]  S. Shen-Orr,et al.  Network motifs in the transcriptional regulation network of Escherichia coli , 2002, Nature Genetics.

[46]  Alfred O. Hero,et al.  Using Directed Information to Build Biologically Relevant Influence Networks , 2007, J. Bioinform. Comput. Biol..

[47]  Shuhei Kimura,et al.  Inference of S-system models of genetic networks using a cooperative coevolutionary algorithm , 2005, Bioinform..

[48]  N. V. van Riel Dynamic modelling and analysis of biochemical networks: mechanism-based models and model-based experiments. , 2006, Briefings in bioinformatics.

[49]  M. Savageau Biochemical systems analysis. II. The steady-state solutions for an n-pool system using a power-law approximation. , 1969, Journal of theoretical biology.

[50]  Patrik D'haeseleer,et al.  Linear Modeling of mRNA Expression Levels During CNS Development and Injury , 1998, Pacific Symposium on Biocomputing.

[51]  C. Mello,et al.  Revealing the world of RNA interference , 2004, Nature.

[52]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[53]  Tommi S. Jaakkola,et al.  Combining Location and Expression Data for Principled Discovery of Genetic Regulatory Network Models , 2001, Pacific Symposium on Biocomputing.

[54]  Jean-Philippe Vert,et al.  SIRENE: supervised inference of regulatory networks , 2008, ECCB.

[55]  A. Califano,et al.  Dialogue on Reverse‐Engineering Assessment and Methods , 2007, Annals of the New York Academy of Sciences.

[56]  Ulrich Möller,et al.  Performance of data resampling methods for robust class discovery based on clustering , 2006, Intell. Data Anal..

[57]  Hyung-Seok Choi,et al.  Reverse engineering of gene regulatory networks. , 2007, IET systems biology.

[58]  Marcel J. T. Reinders,et al.  Linear Modeling of Genetic Networks from Experimental Data , 2000, ISMB.

[59]  Steven Skiena,et al.  Identifying gene regulatory networks from experimental data , 2001, Parallel Comput..

[60]  Francis D. Gibbons,et al.  Judging the quality of gene expression-based clustering methods using gene annotation. , 2002, Genome research.

[61]  Christian J. Stoeckert,et al.  Bayesian variable selection and data integration for biological regulatory networks , 2006, math/0610034.

[62]  Joshua M. Stuart,et al.  A Gene-Coexpression Network for Global Discovery of Conserved Genetic Modules , 2003, Science.

[63]  I. Shmulevich,et al.  Computational and Statistical Approaches to Genomics , 2007, Springer US.

[64]  Purvesh Khatri,et al.  Ontological analysis of gene expression data: current tools, limitations, and open problems , 2005, Bioinform..

[65]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[66]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[67]  H. Iba,et al.  Inferring a system of differential equations for a gene regulatory network by using genetic programming , 2001, Proceedings of the 2001 Congress on Evolutionary Computation (IEEE Cat. No.01TH8546).

[68]  Eberhard O Voit Modelling metabolic networks using power-laws and S-systems. , 2008, Essays in biochemistry.

[69]  Marcel J. T. Reinders,et al.  Least absolute regression network analysis of the murine osteoblast differentiation network , 2006, Bioinform..

[70]  V. Thorsson,et al.  Discovery of regulatory interactions through perturbation: inference and experimental design. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[71]  D. Pe’er,et al.  Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data , 2003, Nature Genetics.

[72]  Nicola J. Rinaldi,et al.  Computational discovery of gene modules and regulatory networks , 2003, Nature Biotechnology.

[73]  Patrik D'haeseleer,et al.  Genetic network inference: from co-expression clustering to reverse engineering , 2000, Bioinform..

[74]  Eric Mjolsness,et al.  From Coexpression to Coregulation: An Approach to Inferring Transcriptional Regulation among Gene Classes from Large-Scale Expression Data , 1999, NIPS.

[75]  Satoru Miyano,et al.  Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks , 2004, J. Bioinform. Comput. Biol..

[76]  D. di Bernardo,et al.  How to infer gene networks from expression profiles , 2007, Molecular systems biology.

[77]  Satoru Miyano,et al.  Estimating gene networks from gene expression data by combining Bayesian network model with promoter element detection , 2003, ECCB.

[78]  John J. Wyrick,et al.  Genome-wide location and function of DNA binding proteins. , 2000, Science.

[79]  Satoru Miyano,et al.  Identification of Genetic Networks from a Small Number of Gene Expression Patterns Under the Boolean Network Model , 1998, Pacific Symposium on Biocomputing.

[80]  Claudio Altafini,et al.  Comparing association network algorithms for reverse engineering of large-scale gene regulatory networks: synthetic versus real data , 2007, Bioinform..

[81]  Tommi S. Jaakkola,et al.  Using Graphical Models and Genomic Expression Data to Statistically Validate Models of Genetic Regulatory Networks , 2000, Pacific Symposium on Biocomputing.

[82]  Neal S. Holter,et al.  Dynamic modeling of gene expression data. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[83]  M Wahde,et al.  Coarse-grained reverse engineering of genetic regulatory networks. , 2000, Bio Systems.

[84]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[85]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[86]  I. Simon,et al.  Reconstructing dynamic regulatory maps , 2007, Molecular systems biology.

[87]  Eberhard O. Voit,et al.  Computational Analysis of Biochemical Systems: A Practical Guide for Biochemists and Molecular Biologists , 2000 .

[88]  Gail D. Baura,et al.  Nonlinear System Identification , 2002 .

[89]  Srinivas Aluru,et al.  Handbook Of Computational Molecular Biology , 2010 .

[90]  M. Mann,et al.  Proteomics to study genes and genomes , 2000, Nature.

[91]  R. Heinrich,et al.  The Regulation of Cellular Systems , 1996, Springer US.

[92]  Gary D. Stormo,et al.  Modeling Regulatory Networks with Weight Matrices , 1998, Pacific Symposium on Biocomputing.

[93]  D. Husmeier,et al.  Reconstructing Gene Regulatory Networks with Bayesian Networks by Combining Expression Data with Multiple Sources of Prior Knowledge , 2007, Statistical applications in genetics and molecular biology.

[94]  Pedro Mendes,et al.  Artificial gene networks for objective comparison of analysis algorithms , 2003, ECCB.

[95]  E. Kawasaki The end of the microarray Tower of Babel: will universal standards lead the way? , 2006, Journal of biomolecular techniques : JBT.

[96]  Hidde de Jong,et al.  Modeling and Simulation of Genetic Regulatory Systems: A Literature Review , 2002, J. Comput. Biol..

[97]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[98]  I S Kohane,et al.  Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[99]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[100]  Arun K. Ramani,et al.  Protein interaction networks from yeast to human. , 2004, Current opinion in structural biology.

[101]  Alexander J. Hartemink,et al.  Informative Structure Priors: Joint Learning of Dynamic Regulatory Networks from Multiple Types of Data , 2004, Pacific Symposium on Biocomputing.

[102]  A. Schuster,et al.  Tumor classification by gene expression profiling: comparison and validation of five clustering methods , 2001, SIGB.

[103]  T. Elston,et al.  Stochasticity in gene expression: from theories to phenotypes , 2005, Nature Reviews Genetics.

[104]  Diego di Bernardo,et al.  Inference of gene regulatory networks and compound mode of action from time course gene expression profiles , 2006, Bioinform..

[105]  Ting Chen,et al.  Modeling Gene Expression with Differential Equations , 1998, Pacific Symposium on Biocomputing.

[106]  Leon Glass,et al.  Reverse Engineering the Gap Gene Network of Drosophila melanogaster , 2006, PLoS Comput. Biol..

[107]  Aurélien Mazurie,et al.  Gene networks inference using dynamic Bayesian networks , 2003, ECCB.

[108]  M. Reinders,et al.  Genetic network modeling. , 2002, Pharmacogenomics.

[109]  Joshua M. Stuart,et al.  Conserved Genetic Modules 5 / 29 / 2003 1 A gene co-expression network for global discovery of conserved genetic modules in H . sapiens , D . melanogaster , C . elegans , and S . cerevisiae , 2003 .

[110]  Reinhard Guthke,et al.  Discovery of Gene Regulatory Networks in Aspergillus fumigatus , 2006, KDECB.

[111]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[112]  John Quackenbush Microarray data normalization and transformation , 2002, Nature Genetics.

[113]  Andrew J. Bulpitt,et al.  A Primer on Learning in Bayesian Networks for Computational Biology , 2007, PLoS Comput. Biol..

[114]  Masaru Tomita,et al.  Indeterminacy of Reverse Engineering of Gene Regulatory Networks: The Curse of Gene Elasticity , 2007, PloS one.

[115]  Marcel J. T. Reinders,et al.  Regularization and Noise Injection for Improving Genetic Network Models , 2006 .

[116]  Carsten O. Daub,et al.  The mutual information: Detecting and evaluating dependencies between variables , 2002, ECCB.

[117]  Andreas Zell,et al.  Inferring Regulatory Systems with Noisy Pathway Information , 2005, German Conference on Bioinformatics.

[118]  Reinhard Guthke,et al.  Molecular discrimination of responders and nonresponders to anti-TNFalpha therapy in rheumatoid arthritis by etanercept , 2008, Arthritis research & therapy.

[119]  Michael Q. Zhang Inferring Gene Regulatory Networks , 2008 .

[120]  N. Lee,et al.  Computational and experimental approaches for modeling gene regulatory networks. , 2007, Current pharmaceutical design.

[121]  E. Koonin,et al.  Conservation and coevolution in the scale-free human gene coexpression network. , 2004, Molecular biology and evolution.

[122]  Eyad Almasri,et al.  A statistical method to incorporate biological knowledge for generating testable novel gene regulatory interactions from microarray experiments , 2007, BMC Bioinformatics.

[123]  Roberto Marcondes Cesar Junior,et al.  Inference from Clustering with Application to Gene-Expression Microarrays , 2002, J. Comput. Biol..

[124]  Richard Bonneau,et al.  The Inferelator: an algorithm for learning parsimonious regulatory networks from systems-biology data sets de novo , 2006, Genome Biology.

[125]  Wei Pan,et al.  A comparative review of statistical methods for discovering differentially expressed genes in replicated microarray experiments , 2002, Bioinform..

[126]  Jens Timmer,et al.  Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge , 2007, BMC Systems Biology.

[127]  Gustavo Stolovitzky,et al.  Reconstructing biological networks using conditional correlation analysis , 2005, Bioinform..