论文信息 - A Graphical Approach to Diagnosing the Validity of the Conditional Independence Assumptions of a Bayesian Network Given Data

A Graphical Approach to Diagnosing the Validity of the Conditional Independence Assumptions of a Bayesian Network Given Data

Bayesian networks (BNs) have attained widespread use in data analysis and decision making. Well-studied topics include efficient inference, evidence propagation, parameter learning from data for complete and incomplete data scenarios, expert elicitation for calibrating BN probabilities, and structure learning. It is common for the researcher to assume the structure of the BN or to glean the structure from expert elicitation or domain knowledge. In this scenario, the model may be calibrated through learning the parameters from relevant data. There is a lack of work on model diagnostics for fitted BNs; this is the contribution of this article. We key on the definition of (conditional) independence to develop a graphical diagnostic that indicates whether the conditional independence assumptions imposed, when one assumes the structure of the BN, are supported by the data. We develop the approach theoretically and describe a Monte Carlo method to generate uncertainty measures for the consistency of the data with conditional independence assumptions under the model structure. We describe how this theoretical information and the data are presented in a graphical diagnostic tool. We demonstrate the approach through data simulated from BNs under different conditional independence assumptions. We also apply the diagnostic to a real-world dataset. The results presented in this article show that this approach is most feasible for smaller BNs—this is not peculiar to the proposed diagnostic graphic, but rather is related to the general difficulty of combining large BNs with data in any manner (such as through parameter estimation). It is the authors’ hope that this article helps highlight the need for more research into BN model diagnostics. This article has supplementary materials online.

Stephen J. Walsh | Paul D. Whitney | P. Whitney | Stephen Walsh

[1] A. J. Feelders,et al. Learning Bayesian Network Models from Incomplete Data using Importance Sampling , 2005, AISTATS.

[2] J. Pearl. Causality: Models, Reasoning and Inference , 2000 .

[3] Kevin B. Korb,et al. Parameterising Bayesian Networks , 2004, Australian Conference on Artificial Intelligence.

[4] W. Cleveland,et al. The elements of graphing data , 1985 .

[5] Paul Whitney,et al. Parameterizing bayesian network representations of social-behavioral models by expert elicitation , 2010, 2010 IEEE International Conference on Intelligence and Security Informatics.

[6] Carsten Riggelsen,et al. Learning parameters of Bayesian networks from incomplete data via importance sampling , 2006, Int. J. Approx. Reason..

[7] Finn V. Jensen,et al. Bayesian Networks and Decision Graphs , 2001, Statistics for Engineering and Information Science.

[8] Paul Whitney,et al. Calibrating Bayesian Network Representations of Social-Behavioral Models , 2010, SBP.

[9] William S. Cleveland,et al. Visualizing Data , 1993 .

[10] David J. Spiegelhalter,et al. Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[11] M. Friendly. Mosaic Displays for Multi-Way Contingency Tables , 1994 .

[12] David Maxwell Chickering,et al. Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[13] A. Dawid. Conditional Independence in Statistical Theory , 1979 .