论文信息 - Interventional Sum-Product Networks: Causal Inference with Tractable Probabilistic Models

Interventional Sum-Product Networks: Causal Inference with Tractable Probabilistic Models

While probabilistic models are an important tool for studying causality, doing so suffers from the intractability of inference. As a step towards tractable causal models, we consider the problem of learning interventional distributions using sum-product networks (SPNs) that are over-parameterized by gate functions, e.g., neural networks. Providing an arbitrarily intervened causal graph as input, effectively subsuming Pearl’s do-operator, the gate function predicts the parameters of the SPN. The resulting interventional SPNs are motivated and illustrated by a structural causal model themed around personal health. Our empirical evaluation on three benchmark data sets as well as a synthetic health data set clearly demonstrates that interventional SPNs indeed are both expressive in modelling and flexible in adapting to the interventions.

[1] E. H. Simpson,et al. The Interpretation of Interaction in Contingency Tables , 1951 .

[2] David J. Spiegelhalter,et al. Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[3] Gregory F. Cooper,et al. The Computational Complexity of Probabilistic Inference Using Bayesian Belief Networks , 1990, Artif. Intell..

[4] Dan Roth,et al. On the Hardness of Approximate Reasoning , 1993, IJCAI.

[5] S. Srihari. Mixture Density Networks , 1994 .

[6] Didier Dubois,et al. Mathematical models for handling partial knowledge in artificial intelligence , 1995 .

[7] Judea Pearl,et al. From Bayesian networks to causal networks , 1995 .

[8] C. Granger. Investigating causal relations by econometric models and cross-spectral methods , 1969 .

[9] Adnan Darwiche,et al. A differential approach to inference in Bayesian networks , 2000, JACM.

[10] David Maxwell Chickering,et al. Learning Bayesian Networks: The Combination of Knowledge and Statistical Data , 1994, Machine Learning.

[11] Daniel Zelterman,et al. Bayesian Artificial Intelligence , 2005, Technometrics.

[12] A. Gopnik,et al. Causal learning : psychology, philosophy, and computation , 2007 .

[13] Steven A. Sloman,et al. Causal reasoning through intervention , 2007 .

[14] Richard E. Neapolitan,et al. Learning Bayesian networks , 2007, KDD '07.

[15] Pedro M. Domingos,et al. Sum-product networks: A new deep architecture , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[16] Guy Van den Broeck,et al. Probabilistic Sentential Decision Diagrams , 2014, KR.

[17] Hugo Larochelle,et al. MADE: Masked Autoencoder for Distribution Estimation , 2015, ICML.

[18] Han Zhao,et al. On the Relationship between Sum-Product Networks and Bayesian Networks , 2015, ICML.

[19] Alexandros G. Dimakis,et al. Learning Causal Graphs with Small Interventions , 2015, NIPS.

[20] Mihaela van der Schaar,et al. Bounded Off-Policy Evaluation with Missing Data for Course Recommendation and Curriculum Design , 2016, ICML.

[21] J. Pearl,et al. Causal Inference in Statistics: A Primer , 2016 .

[22] Elias Bareinboim,et al. Causal inference and the data-fusion problem , 2016, Proceedings of the National Academy of Sciences.

[23] R. Eisinger,et al. A causal Bayesian network model of disease progression mechanisms in chronic myeloid leukemia. , 2017, Journal of theoretical biology.

[24] Nick Chater,et al. Causal Models and Conditional Reasoning , 2017 .

[25] Michael R. Waldmann,et al. The Oxford handbook of causal reasoning , 2017 .

[26] Pascal Poupart,et al. Online Structure Learning for Feed-Forward and Recurrent Sum-Product Networks , 2018, NeurIPS.

[27] Amnon Shashua,et al. Sum-Product-Quotient Networks , 2018, AISTATS.