Causal Discovery in the Presence of Measurement Error: Identifiability Conditions

Measurement error in the observed values of the variables can greatly change the output of various causal discovery methods. This problem has received much attention in multiple fields, but it is not clear to what extent the causal model for the measurement-error-free variables can be identified in the presence of measurement error with unknown variance. In this paper, we study precise sufficient identifiability conditions for the measurement-error-free causal model and show what information of the causal model can be recovered from observed data. In particular, we present two different sets of identifiability conditions, based on the second-order statistics and higher-order statistics of the data, respectively. The former was inspired by the relationship between the generating model of the measurement-error-contaminated data and the factor analysis model, and the latter makes use of the identifiability result of the over-complete independent component analysis problem.

[1]  Richard Scheines,et al.  Measurement Error and Causal Discovery , 2016, CFA@UAI.

[2]  Calyampudi R. Rao,et al.  Characterization Problems in Mathematical Statistics , 1976 .

[3]  Aapo Hyvärinen,et al.  A Linear Non-Gaussian Acyclic Model for Causal Discovery , 2006, J. Mach. Learn. Res..

[4]  Visa Koivunen,et al.  Identifiability, separability, and uniqueness of linear ICA models , 2004, IEEE Signal Processing Letters.

[5]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[6]  Aapo Hyvärinen,et al.  DirectLiNGAM: A Direct Method for Learning a Linear Non-Gaussian Structural Equation Model , 2011, J. Mach. Learn. Res..

[7]  Clark Glymour,et al.  Learning the Structure of Deterministic Systems , 2007 .

[8]  Wei Luo,et al.  Learning Bayesian Networks in Semi-deterministic Systems , 2006, Canadian Conference on AI.

[9]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[10]  T. Berge,et al.  Generic global indentification in factor analysis , 1997 .

[11]  Brian Everitt,et al.  An Introduction to Latent Variable Models , 1984 .

[12]  Richard Scheines,et al.  Causal Clustering for 1-Factor Measurement Models , 2016, KDD.

[13]  Alexander Shapiro,et al.  Identifiability of factor analysis: Some results and open problems , 1985 .

[14]  David Maxwell Chickering,et al.  Optimal Structure Identification With Greedy Search , 2003, J. Mach. Learn. Res..

[15]  Richard Scheines,et al.  Learning the Structure of Linear Latent Variable Models , 2006, J. Mach. Learn. Res..