Finding the Minimal Sufficient Set in Causal Graph Under Interventions: A Dimension Reduction Method for Data Analysis

One of the ultimate goals of data analysis is to uncover the inherent causality in data. Pearl’s causal graph model provides the fundamental framework for measuring changes caused by external interventions that are commonly used to reveal the causality. Although the criteria and algorithms of atomic intervention for evaluating causal effects have been proposed in previous studies, the research of causal effects under conditional intervention remains inadequate. In this paper, we propose a criterion to combine the back-door criterion of atomic intervention with conditional intervention when the treatment variable is unique. Under the criterion, the derived minimal sufficient sets under atomic intervention (a-MSSs) can be converted into the one under conditional intervention (c-MSSs). The calculation of c-MSSs can remarkably decrease the complexity of target data analysis regarding data dimension reduction. Based on those steps, we also develop an algorithm to implement the conversion. Case studies demonstrate that our algorithm can enumerate both a-MSSs and c-MSSs effectively when a causal graph is given, which validates the effectiveness of our proposed scheme.

[1]  The International Journal of Biostatistics Interval Estimation of Some Epidemiological Measures of Association Tasneem Zaihra , University of New Brunswick , 2011 .

[2]  Steffen L. Lauritzen,et al.  Independence properties of directed markov fields , 1990, Networks.

[3]  Elias Bareinboim,et al.  Causal inference and the data-fusion problem , 2016, Proceedings of the National Academy of Sciences.

[4]  Johannes Textor,et al.  A Complete Generalized Adjustment Criterion , 2015, UAI.

[5]  Maciej Liskiewicz,et al.  Adjustment Criteria in Causal Diagrams: An Algorithmic Perspective , 2011, UAI.

[6]  Johannes Textor,et al.  Drawing and Analyzing Causal DAGs with DAGitty , 2015, ArXiv.

[7]  Ian Shrier,et al.  Reducing bias through directed acyclic graphs , 2008 .

[8]  J. Pearl Causal diagrams for empirical research , 1995 .

[9]  Johannes Textor,et al.  DAGitty: a graphical tool for analyzing causal diagrams. , 2011, Epidemiology.

[10]  Francesco Marcelloni,et al.  A MapReduce solution for associative classification of big data , 2016, Inf. Sci..

[11]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[12]  B. Shepherd,et al.  GUIDO IMBENS, DONALD RUBIN, Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction. New York: Cambridge University Press. , 2016, Biometrics.

[13]  Jill K Thompson,et al.  Thiostrepton-resistant mutants of Thermus thermophilus. , 2004, Nucleic acids research.

[14]  Kim-Kwang Raymond Choo,et al.  Privacy-Preserving-Outsourced Association Rule Mining on Vertically Partitioned Databases , 2016, IEEE Transactions on Information Forensics and Security.

[15]  Jun Wang,et al.  The Visual Causality Analyst: An Interactive Interface for Causal Reasoning , 2016, IEEE Transactions on Visualization and Computer Graphics.

[16]  Judea Pearl,et al.  Identification of Conditional Interventional Distributions , 2006, UAI.

[17]  Judea Pearl,et al.  The International Journal of Biostatistics C AUSAL I NFERENCE An Introduction to Causal Inference , 2011 .