Partial Correlation-and Regression-Based Approaches to Causal Structure Learning Technical Report

We present the Total Conditioning (TC) algorithm for causal discovery suited in the presence of continuous variables. Given a set of n data points drawn from a distribution whose underlying causal structure is a directed acyclic graph (DAG), the TC algorithm returns a structure, i.e., a DAG, over the variables that tends to the correct structure when n tends to infinity. The approach builds on the structural equation modeling framework, well suited for continuous variables, and relies on causal Bayesian networks semantics to handle consistency. We compare TC and a variant, TCbw, which borrows techniques from feature selection for robustness when the number of samples is small, to the state-of-the-art PC algorithm. We show that TCbw has identical or better performance when n exceeds the number of variables, while benefiting from a better time complexity.

[1]  Tao Lin,et al.  A Structural Characterization of DAG-Isomorphic Dependency Models , 2002, Canadian Conference on AI.

[2]  A. H. Murphy,et al.  Hailfinder: A Bayesian system for forecasting severe weather , 1996 .

[3]  Ewart R. Carson,et al.  A Model-Based Approach to Insulin Adjustment , 1991, AIME.

[4]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[5]  Judea Pearl,et al.  A Theory of Inferred Causation , 1991, KR.

[6]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[7]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[8]  Christopher Meek,et al.  Causal inference and causal explanation with background knowledge , 1995, UAI.

[9]  Dimitris Margaritis,et al.  Distribution-Free Learning of Bayesian Network Structure in Continuous Domains , 2005, AAAI.

[10]  Gregory F. Cooper,et al.  The ALARM Monitoring System: A Case Study with two Probabilistic Inference Techniques for Belief Networks , 1989, AIME.

[11]  Constantin F. Aliferis,et al.  Time and sample efficient discovery of Markov blankets and direct causal relations , 2003, KDD '03.

[12]  G. Judge,et al.  Introduction to the Theory and Practice , 1988 .

[13]  R. Shibata,et al.  PARTIAL CORRELATION AND CONDITIONAL CORRELATION AS MEASURES OF CONDITIONAL INDEPENDENCE , 2004 .

[14]  Constantin F. Aliferis,et al.  HITON: A Novel Markov Blanket Algorithm for Optimal Variable Selection , 2003, AMIA.

[15]  R Scheines,et al.  The TETRAD Project: Constraint Based Aids to Causal Model Specification. , 1998, Multivariate behavioral research.

[16]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[17]  Constantin F. Aliferis,et al.  The max-min hill-climbing Bayesian network structure learning algorithm , 2006, Machine Learning.

[18]  A. Raveh On the use of the Inverse of the Correlation Matrix in Multivariate Data Analysis , 1985 .

[19]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.