Using Simulated Data to Evaluate Bayesian Network Approach for Integrating Diverse Data

Large-scale high-dimensional omics data sets have been generate to survey complex biological systems. However, it is a challenge how to integrate multiple dimensions of biological data to biological causal networks where comprehensive knowledge can be derived in contexts. We developed a RIMBANet (Reconstructing Integrative Molecular Bayesian Networks) method to integrate diverse biological data. In this chapter, we disseminate results of applying our RIMBANet method on a series of simulated datasets. Two sets of networks are inferred with or without integrating genetic markers with gene expression data. We show that integration of genetic data into network reconstruction using RIMBANet approach improves network construction accuracy. Furthermore, false-positive links in reconstructed networks are not randomly distributed. More than 80 % of them connect nodes that are indirect neighbors.

[1]  J. Zhu,et al.  An integrative genomics approach to the reconstruction of gene networks in segregating populations , 2004, Cytogenetic and Genome Research.

[2]  Rachel B. Brem,et al.  Stitching together Multiple Data Dimensions Reveals Interacting Metabolomic and Transcriptomic Networks That Modulate Cell Regulation , 2012, PLoS biology.

[3]  J. Witte Genome-wide association studies and beyond. , 2010, Annual review of public health.

[4]  Wei Zhang,et al.  A Bayesian Partition Method for Detecting Pleiotropic and Epistatic eQTL Modules , 2010, PLoS Comput. Biol..

[5]  Xia Yang,et al.  Liver and Adipose Expression Associated SNPs Are Enriched for Association to Type 2 Diabetes , 2010, PLoS genetics.

[6]  M. Daly,et al.  Genetic Mapping in Human Disease , 2008, Science.

[7]  Diogo M. Camacho,et al.  Wisdom of crowds for robust gene network inference , 2012, Nature Methods.

[8]  Thomas Schiex,et al.  Gene Regulatory Network Reconstruction Using Bayesian Networks, the Dantzig Selector, the Lasso and Their Meta-Analysis , 2011, PloS one.

[9]  Rachel B. Brem,et al.  Integrating large-scale functional genomic data to dissect the complexity of yeast regulatory networks , 2008, Nature Genetics.

[10]  Judea Pearl,et al.  Chapter 2 – BAYESIAN INFERENCE , 1988 .

[11]  John D. Storey,et al.  Mapping the Genetic Architecture of Gene Expression in Human Liver , 2008, PLoS biology.

[12]  Nicole Soranzo,et al.  An Integration of Genome-Wide Association Study and Gene Expression Profiling to Prioritize the Discovery of Novel Susceptibility Loci for Osteoporosis-Related Traits , 2010, PLoS genetics.

[13]  Eric E. Schadt,et al.  The effect of food intake on gene expression in human peripheral blood , 2009, Human molecular genetics.

[14]  H. Stefánsson,et al.  Genetics of gene expression and its effect on disease , 2008, Nature.

[15]  Jun Zhu,et al.  Increasing the Power to Detect Causal Associations by Combining Genotypic and Expression Data in Segregating Populations , 2007, PLoS Comput. Biol..

[16]  S. Horvath,et al.  Variations in DNA elucidate molecular networks that cause disease , 2008, Nature.

[17]  J. York,et al.  Bayesian Graphical Models for Discrete Data , 1995 .

[18]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[19]  Kai Wang,et al.  Characterizing Dynamic Changes in the Human Blood Transcriptional Network , 2010, PLoS Comput. Biol..

[20]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[21]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..