Learning Bayesian networks with mixed variables

This paper considers conditional Gaussian networks. The parameters in the network are learned by using conjugate Bayesian analysis. As conjugate local priors, we apply the Dirichlet distribution for discrete variables and the Gaussian-inverse gamma distribution for continuous variables, given a configuration of the discrete parents. We assume parameter independence and complete data. Further, to learn the structure of the network, the network score is deduced. We then develop a local master prior procedure, for deriving parameter priors in these networks. This procedure satisfies parameter independence, parameter modularity and likelihood equivalence. Bayes factors to be used in model search are introduced. Finally the methods derived are illustrated by a simple example.

[1]  David Maxwell Chickering,et al.  Learning Bayesian Networks is NP-Complete , 2016, AISTATS.

[2]  David Maxwell Chickering,et al.  Optimal Structure Identification With Greedy Search , 2002, J. Mach. Learn. Res..

[3]  M. West,et al.  Bayesian forecasting and dynamic models , 1989 .

[4]  S. Lauritzen Propagation of Probabilities, Means, and Variances in Mixed Graphical Association Models , 1992 .

[5]  Kevin P. Murphy,et al.  Learning the Structure of Dynamic Probabilistic Networks , 1998, UAI.

[6]  R. N. Bergman,et al.  Role of glucose and insulin resistance in development of type 2 diabetes mellitus: results of a 25-year follow-up study , 1992, The Lancet.

[7]  M. Frydenberg Marginalization and Collapsibility in Graphical Interaction Models , 1990 .

[8]  Keiji Kanazawa,et al.  A model for reasoning about persistence and causation , 1989 .

[9]  Gio Wiederhold,et al.  Knowledge bases , 1985, Future Gener. Comput. Syst..

[10]  Howell Tong,et al.  Non-Linear Time Series , 1990 .

[11]  R N Bergman,et al.  MINMOD: a computer program to calculate insulin sensitivity and pancreatic responsivity from the frequently sampled intravenous glucose tolerance test. , 1986, Computer methods and programs in biomedicine.

[12]  G. Yule On a Method of Investigating Periodicities in Disturbed Series, with Special Reference to Wolfer's Sunspot Numbers , 1927 .

[13]  M. Degroot Optimal Statistical Decisions , 1970 .

[14]  D. Haughton On the Choice of a Model to Fit Data from an Exponential Family , 1988 .

[15]  A. Dawid The Well-Calibrated Bayesian , 1982 .

[16]  Michael C. Horsch,et al.  Dynamic Bayesian networks , 1990 .

[17]  Stuart J. Russell,et al.  Dynamic bayesian networks: representation, inference and learning , 2002 .

[18]  T. W. Anderson,et al.  Statistical analysis of time series , 1972 .

[19]  Little,et al.  [Lecture Notes in Mathematics] Combinatorial Mathematics V Volume 622 || Counting unlabeled acyclic digraphs , 1977 .

[20]  David Heckerman,et al.  Learning Gaussian Networks , 1994, UAI.

[21]  A. Dawid,et al.  Hyper Markov Laws in the Statistical Analysis of Decomposable Graphical Models , 1993 .