We describe scoring metrics for learning Bayesian networks from a combination of user knowledge and statistical data. Previous work has concentrated on metrics for domains containing only discrete variables, under the assumption that data represents a multinomial sample. In this paper, we extend this work, developing scoring metrics for domains containing only continuous variables under the assumption that continuous data is sampled from a multivariate normal distribution. Our work extends traditional statistical approaches for identifying vanishing regression coefficients in that we identify two important assumptions, called event equivalence and parameter modularity, that when combined allow the construction of prior distributions for multivariate normal parameters from a single prior Bayesian network specified by a user.
[1]
C. Robert Kenley,et al.
Gaussian influence diagrams
,
1989
.
[2]
A. Dawid,et al.
Hyper Markov Laws in the Statistical Analysis of Decomposable Graphical Models
,
1993
.
[3]
G. Yule.
On the Theory of Correlation for any Number of Variables, Treated by a New System of Notation
,
1907
.
[4]
David J. Spiegelhalter,et al.
Bayesian analysis in expert systems
,
1993
.
[5]
Thomas G. Dietterich.
What is machine learning?
,
2020,
Archives of Disease in Childhood.
[6]
M. D. Wilkinson,et al.
Management science
,
1989,
British Dental Journal.