Using Graphical Models and Genomic Expression Data to Statistically Validate Models of Genetic Regulatory Networks

We propose a model-driven approach for analyzing genomic expression data that permits genetic regulatory networks to be represented in a biologically interpretable computational form. Our models permit latent variables capturing unobserved factors, describe arbitrarily complex (more than pair-wise) relationships at varying levels of refinement, and can be scored rigorously against observational data. The models that we use are based on Bayesian networks and their extensions. As a demonstration of this approach, we utilize 52 genomes worth of Affymetrix GeneChip expression data to correctly differentiate between alternative hypotheses of the galactose regulatory network in S. cerevisiae. When we extend the graph semantics to permit annotated edges, we are able to score models describing relationships at a finer degree of specification.

[1]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[2]  P. Vittoz Man , 1962, Bloom.

[3]  B. Bainbridge,et al.  Genetics , 1981, Experientia.

[4]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[5]  AC Tose Cell , 1993, Cell.

[6]  Finn Verner Jensen,et al.  Introduction to Bayesian Networks , 2008, Innovations in Bayesian Networks.

[7]  M. Holcombe,et al.  Information Processing in Cells and Tissues , 1998, Springer US.