Learning possibilistic graphical models from data

Graphical models - especially probabilistic networks like Bayes networks and Markov networks - are very popular to make reasoning in high-dimensional domains feasible. Since constructing them manually can be tedious and time consuming, a large part of recent research has been devoted to learning them from data. However, if the dataset to learn from contains imprecise information in the form of sets of alternatives instead of precise values, this learning task can pose unpleasant problems. In this paper, we survey an approach to cope with these problems, which is not based on probability theory as the more common approaches like, e.g., expectation maximization, but uses the possibility theory as the underlying calculus of a graphical model. We provide semantic foundations of possibilistic graphical models, explain the rationale of possibilistic decomposition as well as the graphical representation of decompositions of possibility distributions and finally discuss the main approaches to learn possibilistic graphical models from data.

[1]  R. Hartley Transmission of information , 1928 .

[2]  J. M. Hammersley,et al.  Markov fields on finite graphs and lattices , 1971 .

[3]  Jeffrey D. Uuman Principles of database and knowledge- base systems , 1989 .

[4]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[5]  V. Isham An Introduction to Spatial Point Processes and Markov Random Fields , 1981 .

[6]  R. Kruse,et al.  Learning possibilistic networks from data , 1995, Proceedings of 1995 IEEE International Conference on Fuzzy Systems..

[7]  Didier Dubois,et al.  Readings in Fuzzy Sets for Intelligent Systems , 1993 .

[8]  Enrique F. Castillo,et al.  Expert Systems and Probabilistic Network Models , 1996, Monographs in Computer Science.

[9]  Steffen L. Lauritzen,et al.  Graphical models in R , 1996 .

[10]  C. Borgelt,et al.  Evaluation measures for learning probabilistic and possibilistic networks , 1997, Proceedings of 6th International Fuzzy Systems Conference.

[12]  J. Kruskal On the shortest spanning subtree of a graph and the traveling salesman problem , 1956 .

[13]  Enrique H. Ruspini,et al.  On the semantics of fuzzy logic , 1991, Int. J. Approx. Reason..

[14]  C. N. Liu,et al.  Approximating discrete probability distributions with dependence trees , 1968, IEEE Trans. Inf. Theory.

[15]  Judea Pearl,et al.  Causal networks: semantics and expressiveness , 2013, UAI.

[16]  L. Zadeh Fuzzy sets as a basis for a theory of possibility , 1999 .

[17]  Christian Borgelt,et al.  Efficient maximum projection of database-induced multivariate possibility distributions , 1998, 1998 IEEE International Conference on Fuzzy Systems Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36228).

[18]  Tod S. Levitt,et al.  Uncertainty in artificial intelligence , 1988 .

[19]  A. N. Kolmogorov,et al.  Foundations of the theory of probability , 1960 .

[20]  Christian Borgelt,et al.  Data Mining with Graphical Models , 2002, ALT.

[21]  Kristian G. Olesen,et al.  HUGIN - A Shell for Building Bayesian Belief Universes for Expert Systems , 1989, IJCAI.

[22]  Jeffrey D. Ullman,et al.  Principles Of Database And Knowledge-Base Systems , 1979 .

[23]  Eric Bauer,et al.  Update Rules for Parameter Estimation in Bayesian Networks , 1997, UAI.

[24]  Alessandro Saffiotti,et al.  Pulcinella: A General Tool for Propagating Uncertainty in Valuation Networks , 1991, UAI.

[25]  David J. Spiegelhalter,et al.  Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[26]  Didier Dubois,et al.  A semantics for possibility theory based on likelihoods , 1995, Proceedings of 1995 IEEE International Conference on Fuzzy Systems..

[27]  L. J. Savage,et al.  The Foundations of Statistics , 1955 .

[28]  Trevor P Martin,et al.  Fril- Fuzzy and Evidential Reasoning in Artificial Intelligence , 1995 .

[29]  Frank Klawonn,et al.  Foundations of fuzzy systems , 1994 .

[30]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[31]  L. Zadeh,et al.  Fuzzy Logic for the Management of Uncertainty , 1992 .

[32]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[33]  Hung T. Nguyen,et al.  On Random Sets and Belief Functions , 1978, Classic Works of the Dempster-Shafer Theory of Belief Functions.

[34]  Rudolf Kruse,et al.  The context model: An integrating view of vagueness and uncertainty , 1993, Int. J. Approx. Reason..

[35]  David Maxwell Chickering,et al.  Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[36]  Paul P. Wang Advances in Fuzzy Sets, Possibility Theory, and Applications , 1983 .

[37]  Judea Pearl,et al.  A Theory of Inferred Causation , 1991, KR.

[38]  D. Dubois,et al.  When upper probabilities are possibility measures , 1992 .

[39]  Gregory F. Cooper,et al.  A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[40]  Eric V. Slud Graphical Models: Methods for Data Analysis and Mining , 2003, Technometrics.

[41]  A. Kolmogoroff Grundbegriffe der Wahrscheinlichkeitsrechnung , 1933 .

[42]  Jörg Gebhardt,et al.  Learning from data: possibilistic graphical models , 2000 .

[43]  Rudolf Kruse,et al.  Fuzzy reasoning in a multidimensional space of hypotheses , 1990, Int. J. Approx. Reason..

[44]  David Heckerman,et al.  Probabilistic similarity networks , 1991, Networks.

[45]  Rina Dechter,et al.  Structure Identification in Relational Data , 1992, Artif. Intell..

[46]  Michael I. Jordan Learning in Graphical Models , 1999, NATO ASI Series.

[47]  Wolfgang Spohn,et al.  A general non-probabilistic theory of inductive reasoning , 2013, UAI.

[48]  Wang Pei-zhuang From the Fuzzy Statistics to the Falling Random Subsets , 1983 .

[49]  Nir Friedman,et al.  The Bayesian Structural EM Algorithm , 1998, UAI.

[50]  Hung T. Nguyen On modeling of linguistic information using random sets , 1984, Inf. Sci..

[51]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[52]  Christian Borgelt,et al.  Learning graphical models with hypertree structure using a simulated annealing approach , 2001, 10th IEEE International Conference on Fuzzy Systems. (Cat. No.01CH37297).

[53]  Prakash P. Shenoy,et al.  Valuation-based systems: a framework for managing uncertainty in expert systems , 1992 .