论文信息 - Learning Belief Networks in the Presence of Missing Values and Hidden Variables

Learning Belief Networks in the Presence of Missing Values and Hidden Variables

In recent years there has been a flurry of works on learning probabilistic belief networks. Current state of the art methods have been shown to be successful for two learning scenarios: learning both network structure and parameters from complete data, and learning parameters for a fixed network from incomplete data—that is, in the presence of missing values or hidden variables. However, no method has yet been demonstrated to effectively learn network structure from incomplete data. In this paper, we propose a new method for learning network structure from incomplete data. This method is based on an extension of the Expectation-Maximization (EM) algorithm for model selection problems that performs search for the best structure inside the EM procedure. We prove the convergence of this algorithm, and adapt it for learning belief networks. We then describe how to learn networks in two scenarios: when the data contains missing values, and in the presence of hidden variables. We provide experimental results that show the effectiveness of our procedure in both scenarios.

Nir Friedman | N. Friedman

[1] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[2] G. Schwarz. Estimating the Dimension of a Model , 1978 .

[3] David J. Spiegelhalter,et al. Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[4] Judea Pearl,et al. Probabilistic reasoning in intelligent systems , 1988 .

[5] James Kelly,et al. AutoClass: A Bayesian Classification System , 1993, ML.

[6] Gregory F. Cooper,et al. The ALARM Monitoring System: A Case Study with two Probabilistic Inference Techniques for Belief Networks , 1989, AIME.

[7] Andrew L. Rukhin,et al. Tools for statistical inference , 1991 .

[8] Wai Lam,et al. LEARNING BAYESIAN BELIEF NETWORKS: AN APPROACH BASED ON THE MDL PRINCIPLE , 1994, Comput. Intell..

[9] S. Lauritzen. The EM algorithm for graphical association models with missing data , 1995 .

[10] Michael P. Wellman,et al. Real-world applications of Bayesian networks , 1995, CACM.

[11] David Maxwell Chickering,et al. Efficient Approximations for the Marginal Likelihood of Incomplete Data Given a Bayesian Network , 1996, UAI.

[12] G. McLachlan,et al. The EM algorithm and extensions , 1996 .

[13] Catherine Blake,et al. UCI Repository of machine learning databases , 1998 .