Extensions of ICA for Causality Discovery in the Hong Kong Stock Market

Recently independent component analysis (ICA) has been proposed for discovery of linear, non-Gaussian, and acyclic causal models (LiNGAM). As in practice the LiNGAM assumption usually does not exactly hold, in this paper we propose some methods to perform causality discovery even when LiNGAM is violated. The first method is ICA with a sparse separation matrix. By incorporating a suitable penalty term, the separation matrix produced by this method tends to satisfy the LiNGAM assumption. The other two methods are proposed to tackle nonlinearity in the data generation procedure, which violates the LiNGAM assumption. In the second method, the post-nonlinear mixing ICA model is exploited to do causality discovery when the nonlinearity is component-wise. The third method is proposed for the case where the nonlinear distortion in data generation is of arbitrary form, but smooth and weak. The separation system for such data is a linear transformation coupled with a nonlinear one, and the nonlinear one is as weak as possible such that it can be neglected when performing causality discovery. The linear causal relations in the data are then revealed. The proposed methods are applied to discover the causal relations in the Hong Kong stock market, and the last method works very well. The resulting causal diagram shows some interesting information in the stock market.

[1]  Y. Dodge,et al.  On Asymmetric Properties of the Correlation Coeffcient in the Regression Setting , 2001 .

[2]  L. B. Almeida,et al.  Linear and nonlinear ICA based on mutual information , 2000, Proceedings of the IEEE 2000 Adaptive Systems for Signal Processing, Communications, and Control Symposium (Cat. No.00EX373).

[3]  Jan W. Rivkin,et al.  Interorganizational Ties and Business Group Boundaries: Evidence from an Emerging Economy , 2006, Organ. Sci..

[4]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[5]  J. Karhunen,et al.  Advances in Nonlinear Blind Source Separation , 2003 .

[6]  Aapo Hyvärinen,et al.  A Linear Non-Gaussian Acyclic Model for Causal Discovery , 2006, J. Mach. Learn. Res..

[7]  Christian Jutten,et al.  Source separation in post-nonlinear mixtures , 1999, IEEE Trans. Signal Process..

[8]  E. Oja,et al.  Independent Component Analysis , 2013 .

[9]  Lai-Wan Chan,et al.  Extended Gaussianization Method for Blind Separation of Post-Nonlinear Mixtures , 2005, Neural Computation.

[10]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[11]  Luís B. Almeida,et al.  MISEP -- Linear and Nonlinear ICA Based on Mutual Information , 2003, J. Mach. Learn. Res..

[12]  Philippe Garat,et al.  Blind separation of mixture of independent sources through a quasi-maximum likelihood approach , 1997, IEEE Trans. Signal Process..

[13]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .