Linear fuzzy gene network models obtained from microarray data by exhaustive search

BackgroundRecent technological advances in high-throughput data collection allow for experimental study of increasingly complex systems on the scale of the whole cellular genome and proteome. Gene network models are needed to interpret the resulting large and complex data sets. Rationally designed perturbations (e.g., gene knock-outs) can be used to iteratively refine hypothetical models, suggesting an approach for high-throughput biological system analysis. We introduce an approach to gene network modeling based on a scalable linear variant of fuzzy logic: a framework with greater resolution than Boolean logic models, but which, while still semi-quantitative, does not require the precise parameter measurement needed for chemical kinetics-based modeling.ResultsWe demonstrated our approach with exhaustive search for fuzzy gene interaction models that best fit transcription measurements by microarray of twelve selected genes regulating the yeast cell cycle. Applying an efficient, universally applicable data normalization and fuzzification scheme, the search converged to a small number of models that individually predict experimental data within an error tolerance. Because only gene transcription levels are used to develop the models, they include both direct and indirect regulation of genes.ConclusionBiological relationships in the best-fitting fuzzy gene network models successfully recover direct and indirect interactions predicted from previous knowledge to result in transcriptional correlation. Fuzzy models fit on one yeast cell cycle data set robustly predict another experimental data set for the same system. Linear fuzzy gene networks and exhaustive rule search are the first steps towards a framework for an integrated modeling and experiment approach to high-throughput "reverse engineering" of complex biological systems.

[1]  F. Cross,et al.  Testing a mathematical model of the yeast cell cycle. , 2002, Molecular biology of the cell.

[2]  B. Futcher Transcriptional regulatory networks and the yeast cell cycle. , 2002, Current opinion in cell biology.

[3]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[4]  David Horn,et al.  Novel Clustering Algorithm for Microarray Expression Data in A Truncated SVD Space , 2003, Bioinform..

[5]  Sun Yong Kim,et al.  Dynamic Bayesian Network and Nonparametric Regression Model for Inferring Gene Networks , 2002 .

[6]  J. Hasty,et al.  Reverse engineering gene networks: Integrating genetic perturbations with dynamical modeling , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Katherine C. Chen,et al.  Kinetic analysis of a molecular model of the budding yeast cell cycle. , 2000, Molecular biology of the cell.

[8]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[9]  L. Glass,et al.  The logical analysis of continuous, non-linear biochemical control networks. , 1973, Journal of theoretical biology.

[10]  J. Fitch,et al.  Genomic engineering: moving beyond DNA sequence to function , 2000, Proceedings of the IEEE.

[11]  J. Mendel Fuzzy logic systems for engineering: a tutorial , 1995, Proc. IEEE.

[12]  S Fuhrman,et al.  Reveal, a general reverse engineering algorithm for inference of genetic network architectures. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[13]  Ronald W. Davis,et al.  A genome-wide transcriptional analysis of the mitotic cell cycle. , 1998, Molecular cell.

[14]  L. Hood,et al.  Complementary Profiling of Gene Expression at the Transcriptome and Proteome Levels in Saccharomyces cerevisiae*S , 2002, Molecular & Cellular Proteomics.

[15]  P. Cutler Protein arrays: The current state‐of‐the‐art , 2003, Proteomics.

[16]  Susumu Goto,et al.  The KEGG databases at GenomeNet , 2002, Nucleic Acids Res..

[17]  A. Grigoriev On the number of protein-protein interactions in the yeast proteome. , 2003, Nucleic acids research.

[18]  A. Brazma,et al.  Towards reconstruction of gene networks from expression data by supervised learning , 2003, Genome Biology.

[19]  Michael E. Cusick,et al.  The Yeast Proteome Database (YPD) and Caenorhabditis elegans Proteome Database (WormPD): comprehensive resources for the organization and comparison of model organism protein information , 2000, Nucleic Acids Res..

[20]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[21]  Janet M Thornton,et al.  Sequence and structural differences between enzyme and nonenzyme homologs. , 2002, Structure.

[22]  Albert-László Barabási,et al.  Life's Complexity Pyramid , 2002, Science.

[23]  S. Shen-Orr,et al.  Networks Network Motifs : Simple Building Blocks of Complex , 2002 .

[24]  James E. Andrews,et al.  Combinatorial rule explosion eliminated by a fuzzy rule configuration , 1998, IEEE Trans. Fuzzy Syst..

[25]  P. Woolf,et al.  A fuzzy logic approach to analyzing gene expression data. , 2000, Physiological genomics.

[26]  Jesper Tegnér,et al.  Reverse engineering gene networks using singular value decomposition and robust regression , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Robert J. Marks,et al.  Layered URC fuzzy systems: a novel link between fuzzy systems and neural networks , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[28]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[29]  Bahrad A. Sokhansanj,et al.  Interpreting microarray data to build models of microbial genetic regulation networks , 2002, SPIE BiOS.

[30]  S. McCutchen-Maloney,et al.  Characterization of transcription factors by mass spectrometry and the role of SELDI-MS. , 2002, Mass spectrometry reviews.

[31]  J. P. Fitch,et al.  URC fuzzy modeling and simulation of gene regulation , 2001, 2001 Conference Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[32]  Andrew G Fraser,et al.  Identification of genes that protect the C. elegans genome against mutations by genome-wide RNAi. , 2003, Genes & development.

[33]  Peter D. Karp,et al.  The EcoCyc Database , 2002, Nucleic Acids Res..