Discovery of Linear Non-Gaussian Acyclic Models in the Presence of Latent Classes

An effective way to examine causality is to conduct an experiment with random assignment. However, in many cases it is impossible or too expensive to perform controlled experiments, and hence one often has to resort to methods for discovering good initial causal models from data which do not come from such controlled experiments. We have recently proposed such a discovery method based on independent component analysis (ICA) called LiNGAM and shown how to completely identify the data generating process under the assumptions of linearity, non-gaussianity, and no latent variables. In this paper, after briefly recapitulating this approach, we extend the framework to cases where latent classes (hidden groups) are present. The model identification can be accomplished using a method based on ICA mixtures. Simulations confirm the validity of the proposed method.

[1]  Pierre Comon,et al.  Independent component analysis, A new concept? , 1994, Signal Process..

[2]  B. Muthén BEYOND SEM: GENERAL LATENT VARIABLE MODELING , 2002 .

[3]  Shohei Shimizu,et al.  Use of non-normality in structural equation modeling: Application to direction of causation , 2008 .

[4]  Aapo Hyvärinen,et al.  Finding a causal ordering via independent component analysis , 2006, Comput. Stat. Data Anal..

[5]  Patrik O. Hoyer,et al.  Estimation of linear, non-gaussian causal models in the presence of confounding latent variables , 2006, Probabilistic Graphical Models.

[6]  Tom Burr,et al.  Causation, Prediction, and Search , 2003, Technometrics.

[7]  Wei Zhu,et al.  Unified structural equation modeling approach for the analysis of multisubject, multivariate functional MRI data , 2007, Human brain mapping.

[8]  J. Tanner,et al.  Parent‐child correlations for body measurements of children between the ages one month and seven years , 1963, Annals of human genetics.

[9]  P. Spirtes,et al.  Causation, Prediction, and Search, 2nd Edition , 2001 .

[10]  Aapo Hyvärinen,et al.  A Linear Non-Gaussian Acyclic Model for Causal Discovery , 2006, J. Mach. Learn. Res..

[11]  Mihoko Minami,et al.  Exploring Latent Structure of Mixture ICA Models by the Minimum -Divergence Method , 2006, Neural Computation.

[12]  P. Holland Statistics and Causal Inference , 1985 .

[13]  Philippe Garat,et al.  Blind separation of mixture of independent sources through a quasi-maximum likelihood approach , 1997, IEEE Trans. Signal Process..

[14]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[15]  Terrence J. Sejnowski,et al.  ICA Mixture Models for Unsupervised Classification of Non-Gaussian Classes and Automatic Context Switching in Blind Signal Separation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Satoru Miyano,et al.  Estimation of Genetic Networks and Functional Structures Between Genes by Using Bayesian Networks and Nonparametric Regression , 2001, Pacific Symposium on Biocomputing.