Overview of computational methods for the inference of gene regulatory networks

Abstract Increasing volumes of data about the cellular phenotype and classes of intracellular molecules have necessitated the introduction of systemic methods for the analysis of biological systems. These methods bring to focus the integrated nature and complex interactions of biological molecules and processes and, as such, define the emerging field of systems biology. Of the multitude of systems thus analyzed, we provide here an overview of foundational and current methods in the inference of gene regulatory networks (GRNs) and sequence-based pattern discovery. In GRN analysis, the reverse engineering paradigm is given particular attention, including the various types of models (discrete, continuous, hybrid) which may be utilized in reverse engineering a network's structure. Future research directions in these areas are discussed, particularly the potential for ventures that integrate GRN inference, pattern discovery, and experimental methods into a cohesive, productive methodology.

[1]  E. Dougherty,et al.  Gene perturbation and intervention in probabilistic Boolean networks. , 2002, Bioinformatics.

[2]  David M. Allen,et al.  The Relationship Between Variable Selection and Data Agumentation and a Method for Prediction , 1974 .

[3]  Lior Pachter,et al.  VISTA : visualizing global DNA sequence alignments of arbitrary length , 2000, Bioinform..

[4]  Robert Rosen,et al.  Recent Developments in the Theory of Control and Regulation of Cellular Processes , 1968 .

[5]  Charles Elkan,et al.  Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer , 1994, ISMB.

[6]  D. Featherstone,et al.  Wrestling with pleiotropy: genomic and topological analysis of the yeast gene expression network. , 2002, BioEssays : news and reviews in molecular, cellular and developmental biology.

[7]  L. Pachter,et al.  rVista for comparative sequence-based discovery of functional transcription factor binding sites. , 2002, Genome research.

[8]  Farren J. Isaacs,et al.  Computational studies of gene regulatory networks: in numero molecular biology , 2001, Nature Reviews Genetics.

[9]  D. A. Baxter,et al.  Mathematical Modeling of Gene Networks , 2000, Neuron.

[10]  Gary D. Stormo,et al.  DNA binding sites: representation and discovery , 2000, Bioinform..

[11]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[12]  Alvis Brazma,et al.  Learning About Gene Regulatory Networks From Gene Deletion Experiments , 2002, Comparative and functional genomics.

[13]  Tommi S. Jaakkola,et al.  Bayesian Methods for Elucidating Genetic Regulatory Networks , 2002, IEEE Intell. Syst..

[14]  D. A. Baxter,et al.  Modeling transcriptional control in gene networks—methods, recent results, and future directions , 2000, Bulletin of mathematical biology.

[15]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[16]  K Sivakumar,et al.  General nonlinear framework for the analysis of gene interaction via multivariate expression arrays. , 2000, Journal of biomedical optics.

[17]  F R Adler,et al.  How to make a biological switch. , 2000, Journal of theoretical biology.

[18]  E. Davidson,et al.  Modeling transcriptional regulatory networks. , 2002, BioEssays : news and reviews in molecular, cellular and developmental biology.

[19]  Joaquín Dopazo,et al.  An Approach to Inferring Transcriptional Regulation Among Genes From Large-Scale Expression Data , 2003, Comparative and functional genomics.

[20]  Michael I. Jordan,et al.  Variational Probabilistic Inference and the QMR-DT Network , 2011, J. Artif. Intell. Res..

[21]  Douglas L. Brutlag,et al.  BioProspector: Discovering Conserved DNA Motifs in Upstream Regulatory Regions of Co-Expressed Genes , 2000, Pacific Symposium on Biocomputing.

[22]  Jun S. Liu,et al.  An algorithm for finding protein–DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments , 2002, Nature Biotechnology.

[23]  T. D. Schneider,et al.  Use of the 'Perceptron' algorithm to distinguish translational initiation sites in E. coli. , 1982, Nucleic acids research.

[24]  Hitoshi Iba,et al.  Evolutionary modeling and inference of gene network , 2002, Inf. Sci..

[25]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[26]  Aris Floratos,et al.  An Approximation Algorithm for Alignment of Multiple Sequences using Motif Discovery , 1999, J. Comb. Optim..

[27]  I-Min A. Dubchak,et al.  Active conservation of noncoding sequences revealed by three-way species comparisons. , 2000, Genome research.

[28]  R. Gibbs,et al.  PipMaker--a web server for aligning two genomic DNA sequences. , 2000, Genome research.

[29]  Patrik D'haeseleer,et al.  Genetic network inference: from co-expression clustering to reverse engineering , 2000, Bioinform..

[30]  G. Church,et al.  Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae. , 2000, Journal of molecular biology.

[31]  Yudong D. He,et al.  Functional Discovery via a Compendium of Expression Profiles , 2000, Cell.

[32]  Dirk Repsilber,et al.  Reverse engineering of regulatory networks: simulation studies on a genetic algorithm approach for ranking hypotheses. , 2002, Bio Systems.

[33]  Jun S. Liu,et al.  Discovery of Conserved Sequence Patterns Using a Stochastic Dictionary Model , 2003 .

[34]  Tao Jiang,et al.  Identifying transcription factor binding sites through Markov chain optimization , 2002, ECCB.

[35]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[36]  B. De Moor,et al.  Toucan: deciphering the cis-regulatory logic of coregulated genes. , 2003, Nucleic acids research.

[37]  Jaak Vilo,et al.  Building and analysing genome-wide gene disruption networks , 2002, ECCB.

[38]  R Staden Computer methods to locate signals in nucleic acid sequences , 1984, Nucleic Acids Res..

[39]  Satoru Miyano,et al.  Inferring Gene Regulatory Networks from Time-Ordered Gene Expression Data of Bacillus Subtilis Using Differential Equations , 2002, Pacific Symposium on Biocomputing.

[40]  C. Lawrence,et al.  Human-mouse genome comparisons to locate regulatory sites , 2000, Nature Genetics.

[41]  Julio Collado-Vides,et al.  RegulonDB (version 3.2): transcriptional regulation and operon organization in Escherichia coli K-12 , 2001, Nucleic Acids Res..

[42]  Hongzhe Li,et al.  Clustering of time-course gene expression data using a mixed-effects model with B-splines , 2003, Bioinform..

[43]  Gary D. Stormo,et al.  Identifying target sites for cooperatively binding factors , 2001, Bioinform..

[44]  Marcel J. T. Reinders,et al.  Multi-criterion optimization for genetic network modeling , 2003, Signal Process..

[45]  T. Kepler,et al.  Stochasticity in transcriptional regulation: origins, consequences, and mathematical representations. , 2001, Biophysical journal.

[46]  Andreas Wagner,et al.  Estimating coarse gene network structure from large-scale gene perturbation data. , 2002, Genome research.

[47]  J. Collins,et al.  Inferring Genetic Networks and Identifying Compound Mode of Action via Expression Profiling , 2003, Science.

[48]  Yaolin Wang,et al.  Analyses of p53 Target Genes in the Human Genome by Bioinformatic and Microarray Approaches* 210 , 2001, The Journal of Biological Chemistry.

[49]  Björn Olsson,et al.  Could correlation-based methods be used to derive genetic association networks? , 2002, Inf. Sci..

[50]  David H. Sharp,et al.  Mechanism of eve stripe formation , 1995, Mechanisms of Development.

[51]  G. Odell,et al.  The segment polarity network is a robust developmental module , 2000, Nature.

[52]  Jun S. Liu,et al.  Bayesian Models for Multiple Local Sequence Alignment and Gibbs Sampling Strategies , 1995 .

[53]  Nicholas L. Bray,et al.  AVID: A global alignment program. , 2003, Genome research.

[54]  Jesper Tegnér,et al.  Reverse engineering gene networks using singular value decomposition and robust regression , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[55]  Satoru Miyano,et al.  Inferring Gene Regulatory Networks from Time-Ordered Gene Expression Data Using Differential Equations , 2002, Discovery Science.

[56]  A. Arkin,et al.  Stochastic kinetic analysis of developmental pathway bifurcation in phage lambda-infected Escherichia coli cells. , 1998, Genetics.

[57]  Luquan Wang,et al.  Computational analysis of composite regulatory elements , 2002, Mammalian Genome.

[58]  T. Mestl,et al.  A mathematical framework for describing and analysing gene regulatory networks. , 1995, Journal of theoretical biology.

[59]  E. Dougherty,et al.  Multivariate measurement of gene expression relationships. , 2000, Genomics.

[60]  M. R. Mickey,et al.  Estimation of Error Rates in Discriminant Analysis , 1968 .

[61]  Edward R. Dougherty,et al.  Coefficient of determination in nonlinear signal processing , 2000, Signal Process..

[62]  W. H. Day,et al.  Critical comparison of consensus methods for molecular sequences. , 1992, Nucleic acids research.

[63]  Xin Chen,et al.  TRANSFAC: an integrated system for gene expression regulation , 2000, Nucleic Acids Res..

[64]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[65]  Jun S. Liu,et al.  The Collapsed Gibbs Sampler in Bayesian Computations with Applications to a Gene Regulation Problem , 1994 .

[66]  Hitoshi Iba,et al.  Inference of a gene regulatory network by means of interactive evolutionary computing , 2002, Inf. Sci..

[67]  Kathleen Marchal,et al.  Functional bioinformatics of microarray data: from expression to regulation , 2002, Proc. IEEE.

[68]  Araceli M. Huerta,et al.  From specific gene regulation to genomic networks: a global analysis of transcriptional regulation in Escherichia coli. , 1998, BioEssays : news and reviews in molecular, cellular and developmental biology.

[69]  J. Hasty,et al.  Reverse engineering gene networks: Integrating genetic perturbations with dynamical modeling , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[70]  Xiaobo Zhou,et al.  Construction of genomic networks using mutual-information clustering and reversible-jump Markov-chain-Monte-Carlo predictor design , 2003, Signal Process..

[71]  D. Pribnow Nucleotide sequence of an RNA polymerase binding site at an early T7 promoter. , 1975, Proceedings of the National Academy of Sciences of the United States of America.

[72]  Jun S. Liu,et al.  Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment. , 1993, Science.

[73]  Edward R. Dougherty,et al.  Probabilistic Boolean networks: a rule-based uncertainty model for gene regulatory networks , 2002, Bioinform..

[74]  J Craig Venter,et al.  Generating a synthetic genome by whole genome assembly: φX174 bacteriophage from synthetic oligonucleotides , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[75]  M. Blanchette,et al.  Discovery of regulatory elements by a computational method for phylogenetic footprinting. , 2002, Genome research.

[76]  R. Stoughton,et al.  Genetics of gene expression surveyed in maize, mouse and man , 2003, Nature.

[77]  Savageau Ma Rules for the evolution of gene circuitry. , 1998 .