Utilizing Evolutionary Information and Gene Expression Data for Estimating Gene Networks with Bayesian Network Models

Since microarray gene expression data do not contain sufficient information for estimating accurate gene networks, other biological information has been considered to improve the estimated networks. Recent studies have revealed that highly conserved proteins that exhibit similar expression patterns in different organisms, have almost the same function in each organism. Such conserved proteins are also known to play similar roles in terms of the regulation of genes. Therefore, this evolutionary information can be used to refine regulatory relationships among genes, which are estimated from gene expression data. We propose a statistical method for estimating gene networks from gene expression data by utilizing evolutionarily conserved relationships between genes. Our method simultaneously estimates two gene networks of two distinct organisms, with a Bayesian network model utilizing the evolutionary information so that gene expression data of one organism helps to estimate the gene network of the other. We show the effectiveness of the method through the analysis on Saccharomyces cerevisiae and Homo sapiens cell cycle gene expression data. Our method was successful in estimating gene networks that capture many known relationships as well as several unknown relationships which are likely to be novel. Supplementary information is available at http://bonsai.ims.u-tokyo.ac.jp/~tamada/bayesnet/.

[1]  Xuebiao Yao,et al.  NEK2A Interacts with MAD1 and Possibly Functions as a Novel Integrator of the Spindle Checkpoint Signaling* , 2004, Journal of Biological Chemistry.

[2]  R. Poon,et al.  Cyclin A in cell cycle control and cancer , 2002, Cellular and Molecular Life Sciences CMLS.

[3]  D. Kellogg,et al.  The Elm1 Kinase Functions in a Mitotic Signaling Network in Budding Yeast , 1999, Molecular and Cellular Biology.

[4]  Satoru Miyano,et al.  Combining Microarrays and Biological Knowledge for Estimating Gene Networks via Bayesian Networks , 2004, J. Bioinform. Comput. Biol..

[5]  Satoru Miyano,et al.  Dynamic Bayesian Network and Nonparametric Regression for Nonlinear Modeling of Gene Networks from Time Series Gene Expression Data , 2003, CMSB.

[6]  S. Teichmann,et al.  Gene regulatory network growth by duplication , 2004, Nature Genetics.

[7]  Satoru Miyano,et al.  Bayesian Network and Nonparametric Heteroscedastic Regression for Nonlinear Modeling of Genetic Network , 2003, J. Bioinform. Comput. Biol..

[8]  C. Ball,et al.  Identification of genes periodically expressed in the human cell cycle and their expression in tumors. , 2002, Molecular biology of the cell.

[9]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[10]  M. Reinders,et al.  Genetic network modeling. , 2002, Pharmacogenomics.

[11]  S. Muta,et al.  Use of gene networks from full genome microarray libraries to identify functionally relevant drug-affected genes and gene regulation cascades. , 2003, DNA research : an international journal for rapid publication of reports on genes and genomes.

[12]  V. Anne Smith,et al.  Evaluating functional network inference using simulations of complex biological systems , 2002, ISMB.

[13]  J. Diffley,et al.  Mutational analysis of conserved sequence motifs in the budding yeast Cdc6 protein. , 2001, Journal of molecular biology.

[14]  S. Miyano,et al.  Finding optimal gene networks using biological constraints. , 2003, Genome informatics. International Conference on Genome Informatics.

[15]  Satoru Miyano,et al.  Finding Optimal Models for Small Gene Networks , 2003 .

[16]  Rodrigo Bermejo,et al.  Regulation of CDC6, geminin, and CDT1 in human cells that undergo polyploidization. , 2002, Molecular biology of the cell.

[17]  Satoru Miyano,et al.  Estimation of Genetic Networks and Functional Structures Between Genes by Using Bayesian Networks and Nonparametric Regression , 2001, Pacific Symposium on Biocomputing.

[18]  S. Teichmann,et al.  Evolution of transcription factors and the gene regulatory network in Escherichia coli. , 2003, Nucleic acids research.

[19]  Bruce Stillman,et al.  Chromatin Association of Human Origin Recognition Complex, Cdc6, and Minichromosome Maintenance Proteins during the Cell Cycle: Assembly of Prereplication Complexes in Late Mitosis , 2000, Molecular and Cellular Biology.

[20]  Berend Snel,et al.  Gene co-regulation is highly conserved in the evolution of eukaryotes and prokaryotes. , 2004, Nucleic acids research.

[21]  Joshua M. Stuart,et al.  A Gene-Coexpression Network for Global Discovery of Conserved Genetic Modules , 2003, Science.

[22]  Sarah A Teichmann,et al.  Conservation of gene co-regulation in prokaryotes and eukaryotes. , 2002, Trends in biotechnology.

[23]  L. Johnston,et al.  Getting started: regulating the initiation of DNA replication in yeast. , 1997, Annual review of microbiology.

[24]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.

[25]  Tommi S. Jaakkola,et al.  Combining Location and Expression Data for Principled Discovery of Genetic Regulatory Network Models , 2001, Pacific Symposium on Biocomputing.

[26]  中尾 光輝,et al.  KEGG(Kyoto Encyclopedia of Genes and Genomes)〔和文〕 (特集 ゲノム医学の現在と未来--基礎と臨床) -- (データベース) , 2000 .

[27]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[28]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[29]  Satoru Miyano,et al.  Estimating gene networks from gene expression data by combining Bayesian network model with promoter element detection , 2003, ECCB.

[30]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[31]  D. Brady,et al.  Complex formation between Mad1p, Bub1p and Bub3p is crucial for spindle checkpoint function , 2000, Current Biology.

[32]  Satoru Miyano,et al.  Using Protein-Protein Interactions for Refining Gene Networks Estimated from Microarray Data by Bayesian Networks , 2003, Pacific Symposium on Biocomputing.

[33]  David Page,et al.  Modelling regulatory pathways in E. coli from time series expression profiles , 2002, ISMB.