Learning Belief Networks in Pseudo-independent Domains Learning Belief Networks in Pseudo-independent Domains

A pseudo-independent (PI) model is a probabilistic domain model (PDM) where proper subsets of a set of collectively dependent variables display marginal independence. It has been shown that several commonly used algorithms for learning belief networks will fail when the data generating model is a PI model. A better understanding of these models will facilitate the design of better learning algorithms to copy with them. We propose formal deenitions of the whole spectrum of discrete PI models. We present a parameterization of PI models which leads to a better understanding of the mechanism that forms PI models. We then show that the well-known parity problems in machine learning is a degeneration of PI models. Therefore, PI models form a more general class of challenging problems for learning from data. Using an abstraction of a class of algorithms for learning belief networks , we show that PI models form the dividing line between success and failure of an algorithm using some form of conditional independence test and a single link search. We proposed several strategies to manage the computational complexity when a single link search is extended into a multi-link search. The eeectiveness of these strategies are demonstrated with experimental results.

[1]  Yang Xiang,et al.  CONSTRUCTION OF A MARKOV NETWORK FROM DATA FOR PROBABILISTIC INFERENCE , 1994 .

[2]  Steffen L. Lauritzen,et al.  Bayesian updating in causal probabilistic networks by local computations , 1990 .

[3]  Gregory F. Cooper,et al.  A Bayesian Method for the Induction of Probabilistic Networks from Data , 1992 .

[4]  Kathryn B. Laskey,et al.  Network Engineering for Complex Belief Networks , 1996, UAI.

[5]  A. Dawid,et al.  Hyper Markov Laws in the Statistical Analysis of Decomposable Graphical Models , 1993 .

[6]  Yang Xiang,et al.  A Probabilistic Framework for Cooperative Multi-Agent Distributed Interpretation and Optimization of Communication , 1996, Artif. Intell..

[7]  Gregory F. Cooper,et al.  An Entropy-driven System for Construction of Probabilistic Expert Systems from Databases , 1990, UAI.

[8]  David J. Spiegelhalter,et al.  Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[9]  Steffen L. Lauritzen,et al.  Independence properties of directed markov fields , 1990, Networks.

[10]  Stuart J. Russell,et al.  The BATmobile: Towards a Bayesian Automated Taxi , 1995, IJCAI.

[11]  Judea Pearl,et al.  The recovery of causal poly-trees from statistical data , 1987, Int. J. Approx. Reason..

[12]  N. Wermuth,et al.  Graphical Models for Associations between Variables, some of which are Qualitative and some Quantitative , 1989 .

[13]  P. Spirtes,et al.  An Algorithm for Fast Recovery of Sparse Causal Graphs , 1991 .

[14]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[15]  M. Golumbic Algorithmic graph theory and perfect graphs , 1980 .

[16]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[17]  Eugene Charniak,et al.  Bayesian Networks without Tears , 1991, AI Mag..

[18]  David Heckerman,et al.  Decision-theoretic troubleshooting , 1995, CACM.

[19]  Wai Lam,et al.  LEARNING BAYESIAN BELIEF NETWORKS: AN APPROACH BASED ON THE MDL PRINCIPLE , 1994, Comput. Intell..

[20]  Allan Leck Jensen,et al.  MIDAS: An Influence Diagram for Management of Mildew in Winter Wheat , 1996, UAI.

[21]  Yang Xiang,et al.  Critical Remarks on Single Link Search in Learning Belief Networks , 1996, UAI.

[22]  Richard E. Neapolitan,et al.  Probabilistic reasoning in expert systems - theory and algorithms , 2012 .

[23]  D. Madigan,et al.  Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam's Window , 1994 .

[24]  Wray L. Buntine Operations for Learning with Graphical Models , 1994, J. Artif. Intell. Res..

[25]  Chris Thornton,et al.  Parity: The Problem that Won't Go Away , 1996, Canadian Conference on AI.

[26]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.