The Integration of Heterogeneous Biological Data using Bayesian Networks

Bayesian networks can provide a suitable framework for the integration of highly heterogeneous experimental data and domain knowledge from experts and ontologies. In addition, they can produce interpretable and understandable models for knowledge discovery within complex domains by providing knowledge of casual and other relationships in the data. We have developed a system using Bayesian Networks that enables domain experts to express their knowledge and integrate it with a variety of other sources such as protein-protein relationships and to cross-reference this against new knowledge discovered by the proteomics experiments. The underlying Bayesian mechanism enables a form of hypothesis testing and evaluation.

[1]  A. Owen,et al.  A Bayesian framework for combining heterogeneous data sources for gene function prediction (in Saccharomyces cerevisiae) , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Richard M Leahy,et al.  Multiplex three-dimensional brain gene expression mapping in a mouse model of Parkinson's disease. , 2002, Genome research.

[3]  Wanjin Hong,et al.  Syntaxin 6 regulates Glut4 trafficking in 3T3-L1 adipocytes. , 2003, Molecular biology of the cell.

[4]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[5]  David Page,et al.  Modelling regulatory pathways in E. coli from time series expression profiles , 2002, ISMB.

[6]  Eugene Charniak,et al.  Bayesian Networks without Tears , 1991, AI Mag..

[7]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[8]  Kevin B. Korb,et al.  Bayesian Artificial Intelligence , 2004, Computer science and data analysis series.

[9]  Tommi S. Jaakkola,et al.  Combining Location and Expression Data for Principled Discovery of Genetic Regulatory Network Models , 2001, Pacific Symposium on Biocomputing.

[10]  Chris Bowerman,et al.  Automated trend analysis of proteomics data using an intelligent data mining architecture , 2006, Expert Syst. Appl..

[11]  Gregory A Petsko,et al.  War and peace , 2003, Genome Biology.

[12]  Alexander J. Hartemink,et al.  Informative Structure Priors: Joint Learning of Dynamic Regulatory Networks from Multiple Types of Data , 2004, Pacific Symposium on Biocomputing.

[13]  Chris Bowerman,et al.  Intelligent hybrid Spatio-temporal mining for knowledge discovery on proteomics data , 2005 .

[14]  Ben Taskar,et al.  Rich probabilistic models for gene expression , 2001, ISMB.

[15]  Misha Kapushesky,et al.  Unraveling nature's networks , 2003, Genome Biology.

[16]  A. Brazma,et al.  Towards reconstruction of gene networks from expression data by supervised learning , 2003, Genome Biology.

[17]  Pat Langley,et al.  Incorporating Biological Knowledge into Evaluation of Causal Regulatory Hypotheses , 2002, Pacific Symposium on Biocomputing.

[18]  E. Kraegen,et al.  Alteration in phosphorylation of P20 is associated with insulin resistance. , 2001, Diabetes.

[19]  Dirk Husmeier,et al.  Sensitivity and specificity of inferring genetic regulatory interactions from microarray experiments with dynamic Bayesian networks , 2003, Bioinform..

[20]  Lyle H. Ungar,et al.  Using prior knowledge to improve genetic network reconstruction from microarray data , 2004, Silico Biol..