From Boolean to probabilistic Boolean networks as models of genetic regulatory networks

Mathematical and computational modeling of genetic regulatory networks promises to uncover the fundamental principles governing biological systems in an integrative and holistic manner. It also paves the way toward the development of systematic approaches for effective therapeutic intervention in disease. The central theme in this paper is the Boolean formalism as a building block for modeling complex, large-scale, and dynamical networks of genetic interactions. We discuss the goals of modeling genetic networks as well as the data requirements. The Boolean formalism is justified from several points of view. We then introduce Boolean networks and discuss their relationships to nonlinear digital filters. The role of Boolean networks in understanding cell differentiation and cellular functional states is discussed. The inference of Boolean networks from real gene expression data is considered from the viewpoints of computational learning theory and nonlinear signal processing, touching on computational complexity of learning and robustness. Then, a discussion of the need to handle uncertainty in a probabilistic framework is presented, leading to an introduction of probabilistic Boolean networks and their relationships to Markov chains. Methods for quantifying the influence of genes on other genes are presented. The general question of the potential effect of individual genes on the global dynamical network behavior is considered using stochastic perturbation analysis. This discussion then leads into the problem of target identification for therapeutic intervention via the development of several computational tools based on first-passage times in Markov chains. Examples from biology are presented throughout the paper.

[1]  D. Thieffry,et al.  Dynamical behaviour of biological regulatory networks—I. Biological role of feedback loops and practical use of the concept of the loop-characteristic state , 1995 .

[2]  Sui Huang Gene expression profiling, genetic networks, and cellular states: an integrating concept for tumorigenesis and drug discovery , 1999, Journal of Molecular Medicine.

[3]  C. D. Meyer,et al.  Markov chain sensitivity measured by mean first passage times , 2000 .

[4]  Masahiro Okamoto,et al.  Development of a System for the Inference of Large Scale Genetic Networks , 2000, Pacific Symposium on Biocomputing.

[5]  Wei Zhang,et al.  Differential p53 phosphorylation and activation of apoptosis-promoting genes Bax and Fas/APO-1 by irradiation and ara-C treatment , 1998, Cell Death and Differentiation.

[6]  John H. Holland,et al.  Hidden Order: How Adaptation Builds Complexity , 1995 .

[7]  Jaakko Astola,et al.  On the Use of MDL Principle in Gene Expression Prediction , 2001, EURASIP J. Adv. Signal Process..

[8]  D. Wolf,et al.  On the relationship between genomic regulatory element organization and gene regulatory dynamics. , 1998, Journal of theoretical biology.

[9]  S. P. Fodor,et al.  High density synthetic oligonucleotide arrays , 1999, Nature Genetics.

[10]  Stuart A. Kauffman,et al.  The origins of order , 1993 .

[11]  Bruno O. Shubert,et al.  Random variables and stochastic processes , 1979 .

[12]  Satoru Miyano,et al.  Inferring qualitative relations in genetic networks and metabolic pathways , 2000, Bioinform..

[13]  E. F. Codd,et al.  Cellular automata , 1968 .

[14]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[15]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, CACM.

[16]  Jaakko Astola,et al.  Optimal weighted median filtering under structural constraints , 1995, IEEE Trans. Signal Process..

[17]  A. Arkin,et al.  It's a noisy business! Genetic regulation at the nanomolar scale. , 1999, Trends in genetics : TIG.

[18]  J. Fitch,et al.  Median filtering by threshold decomposition , 1984 .

[19]  Peter L. Hammer,et al.  Evaluation, Strength, and Relevance of Variables of Boolean Functions , 2000, SIAM J. Discret. Math..

[20]  Edward R. Dougherty,et al.  Precision of morphological-representation estimators for translation-invariant binary filters: Increasing and nonincreasing , 1994, Signal Process..

[21]  S. Kauffman Homeostasis and Differentiation in Random Genetic Control Networks , 1969, Nature.

[22]  Martin Anthony,et al.  Computational Learning Theory , 1992 .

[23]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory, Second Edition , 2000, Statistics for Engineering and Information Science.

[24]  Toshihide Ibaraki,et al.  Error-Free and Best-Fit Extensions of Partially Defined Boolean Functions , 1998, Inf. Comput..

[25]  Edward R. Dougherty,et al.  Coefficient of determination in nonlinear signal processing , 2000, Signal Process..

[26]  Melanie Mitchell,et al.  Evolving cellular automata to perform computations: mechanisms and impediments , 1994 .

[27]  Edward R. Dougherty,et al.  An introduction to morphological image processing , 1992 .

[28]  Y. Chen,et al.  Ratio-based decisions and the quantitative analysis of cDNA microarray images. , 1997, Journal of biomedical optics.

[29]  G. Wise,et al.  A theoretical analysis of the properties of median filters , 1981 .

[30]  J. Astola,et al.  INFERENCE OF GENETIC REGULATORY NETWORKS UNDER THE BEST-FIT EXTENSION PARADIGM , 2001 .

[31]  M. Stern,et al.  Emergence of homeostasis and "noise imprinting" in an evolution model. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[32]  Jaakko Astola,et al.  Complexity of the consistency problem for certain Post classes , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[33]  Jean Serra,et al.  Image Analysis and Mathematical Morphology , 1983 .

[34]  Pao-Ta Yu,et al.  Convergence behavior and N-roots of stack filters , 1990, IEEE Trans. Acoust. Speech Signal Process..

[35]  S. Kauffman The large scale structure and dynamics of gene control circuits: an ensemble approach. , 1974, Journal of theoretical biology.

[36]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[37]  Arthur W. Burks,et al.  VON NEUMANN'S SELF-REPRODUCING AUTOMATA , 1969 .

[38]  Pao-Ta Yu,et al.  On the existence and design of the best stack filter based associative memory , 1990, IEEE International Symposium on Circuits and Systems.

[39]  E. Dougherty,et al.  CONTROL OF STATIONARY BEHAVIOR IN PROBABILISTIC BOOLEAN NETWORKS BY MEANS OF STRUCTURAL INTERVENTION , 2002 .

[40]  Edward R. Dougherty,et al.  Optimal morphological restoration: The morphological filter mean-absolute-error theorem , 1992, J. Vis. Commun. Image Represent..

[41]  S Fuhrman,et al.  Reveal, a general reverse engineering algorithm for inference of genetic network architectures. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[42]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[43]  V. Thorsson,et al.  Discovery of regulatory interactions through perturbation: inference and experimental design. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[44]  Satoru Miyano,et al.  Identification of Genetic Networks from a Small Number of Gene Expression Patterns Under the Boolean Network Model , 1998, Pacific Symposium on Biocomputing.

[45]  Y. Crama,et al.  Cause-effect relationships and partially defined Boolean functions , 1988 .

[46]  M. Montenarh,et al.  Regulation of CAK kinase activity by p53 , 1998, Oncogene.

[47]  Hidde de Jong,et al.  Modeling and Simulation of Genetic Regulatory Systems: A Literature Review , 2002, J. Comput. Biol..

[48]  A. Gartel,et al.  Transcriptional regulation of the p21((WAF1/CIP1)) gene. , 1999, Experimental cell research.

[49]  E. Dougherty,et al.  Multivariate measurement of gene expression relationships. , 2000, Genomics.

[50]  S. Kauffman Metabolic stability and epigenesis in randomly constructed genetic nets. , 1969, Journal of theoretical biology.

[51]  Edward J. Coyle,et al.  Stack filters and the mean absolute error criterion , 1988, IEEE Trans. Acoust. Speech Signal Process..

[52]  A Wuensche,et al.  Genomic regulation modeled as a network with basins of attraction. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[53]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[54]  Yudong D. He,et al.  Expression profiling using microarrays fabricated by an ink-jet oligonucleotide synthesizer , 2001, Nature Biotechnology.

[55]  Farren J. Isaacs,et al.  Computational studies of gene regulatory networks: in numero molecular biology , 2001, Nature Reviews Genetics.

[56]  Stig K. Andersen,et al.  Probabilistic reasoning in intelligent systems: Networks of plausible inference , 1991 .

[57]  Roland Somogyi,et al.  Modeling the complexity of genetic networks: Understanding multigenic and pleiotropic regulation , 1996, Complex..

[58]  E. Dougherty,et al.  Optimal and adaptive design of logical granulometric filters , 2001 .

[59]  J. W. Bodnar Programming the Drosophila embryo. , 1997, Journal of theoretical biology.

[60]  Pao-Ta Yu,et al.  The classification and associative memory capability of stack filters , 1992, IEEE Trans. Signal Process..

[61]  E. Winzeler,et al.  Genomics, gene expression and DNA arrays , 2000, Nature.

[62]  John G. Proakis,et al.  Probability, random variables and stochastic processes , 1985, IEEE Trans. Acoust. Speech Signal Process..

[63]  J. Davies,et al.  Molecular Biology of the Cell , 1983, Bristol Medico-Chirurgical Journal.

[64]  James M. Bower,et al.  Computational modeling of genetic and biochemical networks , 2001 .

[65]  S Bornholdt,et al.  Robustness as an evolutionary principle , 2000, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[66]  E. Dougherty,et al.  Gene-expression profiles in hereditary breast cancer. , 2001, The New England journal of medicine.

[67]  Matsumoto,et al.  Finding Genetic Network from Experiments by Weighted Network Model. , 1998, Genome informatics. Workshop on Genome Informatics.

[68]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[69]  John R. Koza,et al.  Hidden Order: How Adaptation Builds Complexity. , 1995, Artificial Life.

[70]  M. Ringnér,et al.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks , 2001, Nature Medicine.

[71]  S. Huang,et al.  Genomics, complexity and drug discovery: insights from Boolean network models of cellular regulation. , 2001, Pharmacogenomics.

[72]  Jean-Michel Fourneau,et al.  A Methodology for Solving Markov Models of Parallel Systems , 1991, J. Parallel Distributed Comput..

[73]  D. A. Baxter,et al.  Mathematical Modeling of Gene Networks , 2000, Neuron.

[74]  Edward J. Wegman,et al.  Statistical Signal Processing , 1985 .

[75]  Denis Thieffry,et al.  Genetic control of flower morphogenesis in Arabidopsis thaliana: a logical analysis , 1999, Bioinform..

[76]  Pao-Ta Yu,et al.  Convergence behavior and root signal sets of stack filters , 1992 .

[77]  Ilya Shmulevich,et al.  Binary analysis and optimization-based normalization of gene expression data , 2002, Bioinform..

[78]  T. Ørntoft,et al.  Gene expression profiling: monitoring transcription and translation products using DNA microarrays and proteomics , 2000, FEBS letters.

[79]  E. Dougherty,et al.  Gene perturbation and intervention in probabilistic Boolean networks. , 2002, Bioinformatics.

[80]  Gary A. Churchill,et al.  Sources of Variation in Microarray Experiments , 2003 .

[81]  Edward J. Coyle,et al.  Root properties and convergence rates of median filters , 1985, IEEE Trans. Acoust. Speech Signal Process..

[82]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[83]  Nir Friedman,et al.  Tissue classification with gene expression profiles. , 2000 .

[84]  G. Moran ON THE PERIOD-TWO-PROPERTY OF THE MAJORITY OPERATOR IN INFINITE GRAPHS , 1995 .

[85]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[86]  S. Kauffman,et al.  Towards a general theory of adaptive walks on rugged landscapes. , 1987, Journal of theoretical biology.

[87]  James P. Crutchfield,et al.  Evolving two-dimensional cellular automata to perform density classification: A report on work in progress , 2001, Parallel Comput..

[88]  Edward J. Coyle,et al.  Stack filters , 1986, IEEE Trans. Acoust. Speech Signal Process..

[89]  Leslie G. Valiant,et al.  Computational limitations on learning from examples , 1988, JACM.

[90]  S. Wildsmith,et al.  Microarrays under the microscope , 2001, Molecular pathology : MP.

[91]  Andreas Wagner,et al.  How to reconstruct a large genetic network from n gene perturbations in fewer than n2 easy steps , 2001, Bioinform..

[92]  Edward R. Dougherty,et al.  Probabilistic Boolean networks: a rule-based uncertainty model for gene regulatory networks , 2002, Bioinform..

[93]  Pao-Ta Yu,et al.  On the existence and design of the best stack filter based associative memory , 1992 .

[94]  S. Huang,et al.  Shape-dependent control of cell growth, differentiation, and apoptosis: switching between attractors in cell regulatory networks. , 2000, Experimental cell research.

[95]  Ka Yee Yeung,et al.  Algorithms for choosing differential gene expression experiments , 1999, RECOMB.

[96]  Z. Szallasi,et al.  Modeling the normal and neoplastic cell cycle with "realistic Boolean genetic networks": their application for understanding carcinogenesis and assessing therapeutic strategies. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[97]  K Sivakumar,et al.  General nonlinear framework for the analysis of gene interaction via multivariate expression arrays. , 2000, Journal of biomedical optics.

[98]  E. Davidson,et al.  Genomic cis-regulatory logic: experimental and computational analysis of a sea urchin gene. , 1998, Science.

[99]  Moncef Gabbouj,et al.  Root properties of morphological filters , 1993, Signal Process..

[100]  Satoru Miyano,et al.  Identification of gene regulatory networks by strategic gene disruptions and gene overexpressions , 1998, SODA '98.

[101]  L. Glass,et al.  The logical analysis of continuous, non-linear biochemical control networks. , 1973, Journal of theoretical biology.

[102]  J. Rissanen,et al.  Normalized Maximum Likelihood Models for Boolean Regression with Application to Prediction and Classification in Genomics , 2003 .

[103]  C B Harley,et al.  Telomere loss: mitotic clock or genetic time bomb? , 1991, Mutation research.