Most frugal explanations in Bayesian networks

Inferring the most probable explanation to a set of variables, given a partial observation of the remaining variables, is one of the canonical computational problems in Bayesian networks, with widespread applications in AI and beyond. This problem, known as MAP, is computationally intractable (NP-hard) and remains so even when only an approximate solution is sought. We propose a heuristic formulation of the MAP problem, denoted as Inference to the Most Frugal Explanation (MFE), based on the observation that many intermediate variables (that are neither observed nor to be explained) are irrelevant with respect to the outcome of the explanatory process. An explanation based on few samples (often even a singleton sample) from these irrelevant variables is typically almost as good as an explanation based on (the computationally costly) marginalization over these variables. We show that while MFE is computationally intractable in general (as is MAP), it can be tractably approximated under plausible situational constraints, and its inferences are fairly robust with respect to which intermediate variables are considered to be relevant.

[1]  Solomon Eyal Shimony,et al.  The role of relevance in explanation I: Irrelevance as statistical independence , 1993, Int. J. Approx. Reason..

[2]  Peter J. F. Lucas,et al.  A probabilistic and decision-theoretic approach to the management of infectious disease at the ICU , 2000, Artif. Intell. Medicine.

[3]  Lachlan T. H. Newham,et al.  A Bayesian Network approach to integrating economic and biophysical modelling , 2009 .

[4]  L. T. H. Newhama,et al.  A Bayesian Network approach to integrating economic and biophysical modelling , 2009 .

[5]  Ion Juvina,et al.  Proceedings of the 31st Annual Meeting of the Cognitive Science Society , 2009 .

[6]  M. Resnik,et al.  Aspects of Scientific Explanation. , 1966 .

[7]  Deirdre Wilson,et al.  Relevance theory: A tutorial , 2002 .

[8]  Jan-Ola Östman,et al.  Handbook of Pragmatics , 2018, Handbook of Pragmatics.

[9]  Marek J Druzdzel,et al.  Support of diagnosis of liver disorders based on a causal Bayesian network model. , 2001, Medical science monitor : international medical journal of experimental and clinical research.

[10]  Naomi H. Feldman,et al.  Performing Bayesian Inference with Exemplar Models , 2008 .

[11]  Thomas L. Griffiths,et al.  One and Done? Optimal Decisions From Very Few Samples , 2014, Cogn. Sci..

[12]  A. Darwiche,et al.  Complexity Results and Approximation Strategies for MAP Explanations , 2011, J. Artif. Intell. Res..

[13]  Jörg Flum,et al.  Parameterized Complexity Theory (Texts in Theoretical Computer Science. An EATCS Series) , 2006 .

[14]  Martin Suter,et al.  Small World , 2002 .

[15]  E. Olsson What Is the Problem of Coherence and Truth , 2002 .

[16]  Maria Sonino Legrenzi,et al.  Naive probability: a mental model theory of extensional reasoning. , 1999, Psychological review.

[17]  David H. Glass Inference to the best explanation: does it track truth? , 2010, Synthese.

[18]  Linda C. van der Gaag,et al.  On the Complexity of the MPA Problem in Probabilistic Networks , 2002, ECAI.

[19]  Iris van Rooij,et al.  The Tractable Cognition Thesis , 2008, Cogn. Sci..

[20]  Adnan Darwiche,et al.  Modeling and Reasoning with Bayesian Networks , 2009 .

[21]  Richard E. Neapolitan,et al.  Probabilistic reasoning in expert systems - theory and algorithms , 2012 .

[22]  大西 仁,et al.  Pearl, J. (1988, second printing 1991). Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan-Kaufmann. , 1994 .

[23]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[24]  Johan Kwisthout,et al.  Most probable explanations in Bayesian networks: Complexity and tractability , 2011, Int. J. Approx. Reason..

[25]  Jörg Flum,et al.  Parameterized Complexity Theory , 2006, Texts in Theoretical Computer Science. An EATCS Series.

[26]  Joseph Y. Halpern,et al.  Defining Explanation in Probabilistic Systems , 1997, UAI.

[27]  A. Hasman,et al.  Probabilistic reasoning in intelligent systems: Networks of plausible inference , 1991 .

[28]  Branden Fitelson A probabilistic theory of coherence , 2003 .

[29]  Larry J. Stockmeyer,et al.  The Polynomial-Time Hierarchy , 1976, Theor. Comput. Sci..

[30]  Hans L. Bodlaender,et al.  Treewidth: Characterizations, Applications, and Computations , 2006, WG.

[31]  Charles Kemp,et al.  How to Grow a Mind: Statistics, Structure, and Abstraction , 2011, Science.

[32]  E. Olsson,et al.  Coherentism, reliability and Bayesian networks , 2000 .

[33]  Teemu Roos,et al.  Proceedings of the Fifth European Workshop on Probabilistic Graphical Models , 2010 .

[34]  J. Kwisthout Most Frugal Explanations: Occams Razor Applied to Bayesian Abduction , 2013 .

[35]  Dan Geiger,et al.  Identifying independence in bayesian networks , 1990, Networks.

[36]  Silja Renooij,et al.  Probabilities for a probabilistic network: a case study in oesophageal cancer , 2002, Artif. Intell. Medicine.

[37]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[38]  Marek J Druzdzel,et al.  Relevance in Probabilistic Models: "Backyards" in a "Small World" , 1994 .

[39]  Johan Kwisthout,et al.  Relevancy in Problem Solving: A Computational Framework , 2012, J. Probl. Solving.

[40]  James A. Stori,et al.  A Bayesian network approach to root cause diagnosis of process variations , 2005 .

[41]  Riza Demirer,et al.  Bayesian Networks: A Decision Tool to Improve Portfolio Risk Analysis , 2006 .

[42]  Laurence R. Horn,et al.  The handbook of pragmatics , 2004 .

[43]  Johan Kwisthout,et al.  Bridging the gap between theory and practice of approximate Bayesian inference , 2013, Cognitive Systems Research.

[44]  Gregory F. Cooper,et al.  The ALARM Monitoring System: A Case Study with two Probabilistic Inference Techniques for Belief Networks , 1989, AIME.

[45]  Johan Kwisthout,et al.  Structure Approximation of Most Probable Explanations in Bayesian Networks , 2013, ECSQARU.

[46]  J. Kwisthout,et al.  The Computational Complexity of Probabilistic Networks , 2009 .

[47]  Jacobo Torán,et al.  Complexity classes defined by counting quantifiers , 1991, JACM.

[48]  Van der Gaag,et al.  Development of a Probabilistic Network for Clinical Detection of Classical Swine Fever , .

[49]  Michael R. Fellows,et al.  Parameterized Complexity , 1998 .

[50]  Changhe Yuan,et al.  Most Relevant Explanation in Bayesian Networks , 2011, J. Artif. Intell. Res..

[51]  Bruce D'Ambrosio,et al.  Proceedings of the Eighth international conference on Uncertainty in artificial intelligence , 1992 .

[52]  J. Fodor The Modularity of mind. An essay on faculty psychology , 1986 .

[53]  Johan Kwisthout Two New Notions of Abduction in Bayesian Networks , 2010 .

[54]  Solomon Eyal Shimony,et al.  Finding MAPs for Belief Networks is NP-Hard , 1994, Artif. Intell..

[55]  Philip N. Johnson-Laird,et al.  Naive Probability: A Mental Model Theory of Extensional Reasoning , 1999 .

[56]  David H. Glass,et al.  Coherence measures and inference to the best explanation , 2007, Synthese.

[57]  Salil P. Vadhan,et al.  Computational Complexity , 2005, Encyclopedia of Cryptography and Security.

[58]  Johan Kwisthout,et al.  The Complexity of Finding kth Most Probable Explanations in Probabilistic Networks , 2011, SOFSEM.

[59]  F. Brown The frame problem in artificial intelligence , 1987 .

[60]  Johan Kwisthout Most Inforbable Explanations: Finding Explanations in Bayesian Networks That Are Both Probable and Informative , 2013, ECSQARU.

[61]  Marek J. Druzdzel,et al.  Some Properties of joint Probability Distributions , 1994, UAI.

[62]  Ashraf M. Abdelbar,et al.  Approximating MAPs for Belief Networks is NP-Hard and Other Theorems , 1998, Artif. Intell..

[63]  Richard E. Neapolitan,et al.  Probabilistic reasoning in expert systems - theory and algorithms , 2012 .

[64]  Tibor Gyimóthy,et al.  Proceedings of the 37th international conference on Current trends in theory and practice of computer science , 2011 .

[65]  K. McRae,et al.  Proceedings of the 30th Annual Conference of the Cognitive Science Society. , 2008 .

[66]  Kevin Murphy,et al.  Bayes net toolbox for Matlab , 1999 .

[67]  Joyojeet Pal,et al.  Bayesian Networks: an Exploratory Tool for Understanding ICT Adoption , 2006, 2006 International Conference on Information and Communication Technologies and Development.

[68]  Jozef Gemela,et al.  Financial analysis using Bayesian networks , 2001 .

[69]  José Manuel Gutiérrez,et al.  Bayesian Networks for Probabilistic Weather Prediction , 2002, ECAI.

[70]  Ernest Lepore,et al.  Holism: A Shopper's Guide , 1992 .

[71]  Kentaro Toyama,et al.  Proceedings of the 4th ACM/IEEE International Conference on Information and Communication Technologies and Development , 2010, ICTD 2010.

[72]  Kevin B. Korb,et al.  Seabreeze Prediction Using Bayesian Networks , 2001, PAKDD.

[73]  Bart Verheij,et al.  Proceedings of the 22nd Benelux Conference on Artificial Intelligence (BNAIC 2010) , 2010 .

[74]  Gordon D. A. Brown,et al.  Decision by sampling , 2006, Cognitive Psychology.

[75]  David H. Glass,et al.  Inference to the Best Explanation: a comparison of approaches , 2009 .

[76]  Karolin Baecker,et al.  Inference to the Best Explanation: , 2021, The Material Theory of Induction.

[77]  S.Wayne,et al.  INTERNATIONAL SOCIETY FOR VETERINARY EPIDEMIOLOGY AND ECONOMICS , 1991 .

[78]  Klaus W. Wagner,et al.  The complexity of combinatorial problems with succinct input representation , 1986, Acta Informatica.

[79]  Judea Pearl,et al.  Bayesian Networks , 1998, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[80]  M. Tribus,et al.  Probability theory: the logic of science , 2003 .

[81]  Johan Kwisthout,et al.  Bayesian Intractability Is Not an Ailment That Approximation Can Cure , 2011, Cogn. Sci..