Bayesian Inference for Gene Expression and Proteomics: Computational Methods for Learning Bayesian Networks from High-Throughput Biological Data

Data from high-throughput technologies, such as gene expression microarrays, promise to yield insight into the nature of the cellular processes that have been disrupted by disease, thus improving our understanding of the disease and hastening the discovery of effective new treatments. Most of the analysis thus far has focused on identifying differential measurements, which form the basis of biomarker discovery. However, merely listing differentially expressed genes or gene products is not sufficient to explain the molecular basis of disease. Consequently, there is increasing interest in extracting more information from available data in the form of biologically meaningful relationships between the quantities being measured. The holy grail of such techniques is the robust identification of causal models of disease from data. The goal of this chapter is to survey computational learning methods that extract models of altered interactions that lead to and occur in the diseased state. Our focus is on methods that represent biological processes as Bayesian networks and that learn these networks from experimental measurements of cellular activity. Specifically, we will survey computational methods for learning Bayesian networks from highthroughput biological data.

[1]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[2]  Steven C. Lawlor,et al.  GenMAPP, a new tool for viewing and analyzing microarray data on biological pathways , 2002, Nature Genetics.

[3]  P. Khatri,et al.  Global functional profiling of gene expression ? ? This work was funded in part by a Sun Microsystem , 2003 .

[4]  G Bartsch,et al.  Genes differentially expressed in prostate cancer , 2004, BJU international.

[5]  Nicola J. Rinaldi,et al.  Computational discovery of gene modules and regulatory networks , 2003, Nature Biotechnology.

[6]  E. Lander,et al.  Gene expression correlates of clinical prostate cancer behavior. , 2002, Cancer cell.

[7]  G. Miller,et al.  Molecular genetics and epidemiology of prostate carcinoma. , 1999, Endocrine reviews.

[8]  Mikko Koivisto,et al.  Exact Bayesian Structure Discovery in Bayesian Networks , 2004, J. Mach. Learn. Res..

[9]  Nir Friedman,et al.  Learning Bayesian Network Structure from Massive Datasets: The "Sparse Candidate" Algorithm , 1999, UAI.

[10]  K. Do,et al.  Clinical and Biomarker Correlates of Androgen-Independent, Locally Aggressive Prostate Cancer with Limited Metastatic Potential , 2004, Clinical Cancer Research.

[11]  T. Barrette,et al.  Meta-analysis of microarrays: interstudy validation of gene expression profiles reveals pathway dysregulation in prostate cancer. , 2002, Cancer research.

[12]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[13]  Nir Friedman,et al.  Being Bayesian About Network Structure. A Bayesian Approach to Structure Discovery in Bayesian Networks , 2004, Machine Learning.

[14]  Tommi S. Jaakkola,et al.  Using Graphical Models and Genomic Expression Data to Statistically Validate Models of Genetic Regulatory Networks , 2000, Pacific Symposium on Biocomputing.

[15]  K. Sachs,et al.  Causal Protein-Signaling Networks Derived from Multiparameter Single-Cell Data , 2005, Science.

[16]  D. Kuban,et al.  Adenoviral-mediated PTEN transgene expression sensitizes Bcl-2-expressing prostate cancer cells to radiation , 2004, Cancer Gene Therapy.

[17]  D. Koller,et al.  A module map showing conditional activity of expression modules in cancer , 2004, Nature Genetics.

[18]  M. Shen,et al.  Molecular genetics of prostate cancer. , 2000, Genes & development.

[19]  Hiroshi Mamitsuka,et al.  A hierarchical mixture of Markov models for finding biologically active metabolic paths using gene expression and protein classes , 2004 .

[20]  Min Zou,et al.  A new dynamic Bayesian network (DBN) approach for identifying gene regulatory networks from time course microarray data , 2005, Bioinform..

[21]  Gregory F. Cooper,et al.  A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[22]  T. Golub,et al.  mTOR inhibition reverses Akt-dependent prostate intraepithelial neoplasia through regulation of apoptotic and HIF-1-dependent pathways , 2004, Nature Medicine.

[23]  Hidde de Jong,et al.  Modeling and Simulation of Genetic Regulatory Systems: A Literature Review , 2002, J. Comput. Biol..

[24]  K. Do,et al.  Evidence That Transfer of Functional p53 Protein Results in Increased Apoptosis in Prostate Cancer , 2004, Clinical Cancer Research.

[25]  Trey Ideker,et al.  Building with a scaffold: emerging strategies for high- to low-level cellular modeling. , 2003, Trends in biotechnology.

[26]  K. Do,et al.  Combined laser capture microdissection and serial analysis of gene expression from human tissue samples , 2005, Modern Pathology.

[27]  Rui Henrique,et al.  Molecular detection of prostate cancer: a role for GSTP1 hypermethylation. , 2004, European urology.

[28]  K. Coombes,et al.  A comprehensive assessment of p53-responsive genes following adenoviral-p53 gene transfer in Bcl-2-expressing prostate cancer cells , 2004, Oncogene.

[29]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.