Reconstructing directional causal networks with random forest: Causality meeting machine learning.

Inspired by the decision tree algorithm in machine learning, a novel causal network reconstruction framework is proposed with the name Importance Causal Analysis (ICA). The ICA framework is designed in a network level and fills the gap between traditional mutual causality detection methods and the reconstruction of causal networks. The potential of the method to identify the true causal relations in complex networks is validated by both benchmark systems and real-world data sets.

[1]  A. Seth,et al.  Granger causality and transfer entropy are equivalent for Gaussian variables. , 2009, Physical review letters.

[2]  Thomas E. Nichols,et al.  Nonparametric permutation tests for functional neuroimaging: A primer with examples , 2002, Human brain mapping.

[3]  Anil K. Seth,et al.  The MVGC multivariate Granger causality toolbox: A new approach to Granger-causal inference , 2014, Journal of Neuroscience Methods.

[4]  K. Kendrick,et al.  Partial Granger causality—Eliminating exogenous inputs and latent variables , 2008, Journal of Neuroscience Methods.

[5]  B. Graham Methods of Identification in Social Networks , 2014 .

[6]  C. Granger Investigating Causal Relations by Econometric Models and Cross-Spectral Methods , 1969 .

[7]  W. Cohen,et al.  Testing a Landsat-based approach for mapping disturbance causality in U.S. forests , 2017 .

[8]  Hongzhe Li,et al.  Gradient directed regularization for sparse Gaussian concentration graphs, with applications to inference of genetic networks. , 2006, Biostatistics.

[9]  Xiang-Sun Zhang,et al.  A network biology study on circadian rhythm by integrating various omics data. , 2009, Omics : a journal of integrative biology.

[10]  George Sugihara,et al.  Dynamical evidence for causality between galactic cosmic rays and interannual variation in global temperature , 2015, Proceedings of the National Academy of Sciences.

[11]  J. Takahashi,et al.  Molecular components of the mammalian circadian clock. , 2006, Human molecular genetics.

[12]  Haiyan Huang,et al.  Review on statistical methods for gene network reconstruction using expression data. , 2014, Journal of theoretical biology.

[13]  Masamitsu Iino,et al.  System-level identification of transcriptional circuits underlying mammalian circadian clocks , 2005, Nature Genetics.

[14]  D. Floreano,et al.  Revealing strengths and weaknesses of methods for gene network inference , 2010, Proceedings of the National Academy of Sciences.

[15]  Luonan Chen,et al.  Data-based prediction and causality inference of nonlinear dynamics , 2017, Science China Mathematics.

[16]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[17]  M. Scheffer,et al.  Causal feedbacks in climate change , 2015 .

[18]  René J. Huster,et al.  Methods for Simultaneous EEG-fMRI: An Introductory Review , 2012, The Journal of Neuroscience.

[19]  George Sugihara,et al.  Detecting Causality in Complex Ecosystems , 2012, Science.

[20]  U. Alon Network motifs: theory and experimental approaches , 2007, Nature Reviews Genetics.

[21]  Dario Floreano,et al.  GeneNetWeaver: in silico benchmark generation and performance profiling of network inference methods , 2011, Bioinform..

[22]  Ying-Cheng Lai,et al.  Detection of time delays and directional interactions based on time series from complex dynamical systems. , 2017, Physical review. E.

[23]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[24]  Gordon Pipa,et al.  Transfer entropy—a model-free measure of effective connectivity for the neurosciences , 2010, Journal of Computational Neuroscience.

[25]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[26]  P. F. Verdes Assessing causality from multivariate time series. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[27]  Y. Sakaki,et al.  Establishment of cell lines derived from the rat suprachiasmatic nucleus. , 2007, Biochemical and biophysical research communications.

[28]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[29]  Bernhard Schölkopf,et al.  Inferring causation from time series in Earth system sciences , 2019, Nature Communications.

[30]  D. Helbing,et al.  The Hidden Geometry of Complex, Network-Driven Contagion Phenomena , 2013, Science.

[31]  S. Frenzel,et al.  Partial mutual information for coupling analysis of multivariate time series. , 2007, Physical review letters.

[32]  Dario Floreano,et al.  Generating Realistic In Silico Gene Networks for Performance Assessment of Reverse Engineering Methods , 2009, J. Comput. Biol..

[33]  J Runge,et al.  Causal network reconstruction from time series: From theoretical assumptions to practical estimation. , 2018, Chaos.

[34]  N. D. Clarke,et al.  Towards a Rigorous Assessment of Systems Biology Models: The DREAM3 Challenges , 2010, PloS one.

[35]  Kazuyuki Aihara,et al.  Detecting Causality by Combined Use of Multiple Methods: Climate and Brain Examples , 2016, PloS one.

[36]  Jürgen Kurths,et al.  Escaping the curse of dimensionality in estimating multivariate transfer entropy. , 2012, Physical review letters.

[37]  Willem Waegeman,et al.  A non-linear Granger-causality framework to investigate climate–vegetation dynamics , 2016 .

[38]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[39]  Kazuyuki Aihara,et al.  Detecting Causality from Nonlinear Dynamics with Short-term Time Series , 2014, Scientific Reports.

[40]  Kilian Stoffel,et al.  Theoretical Comparison between the Gini Index and Information Gain Criteria , 2004, Annals of Mathematics and Artificial Intelligence.

[41]  Paul L. Joskow,et al.  The effects of economic regulation , 1989 .

[42]  Luonan Chen,et al.  Part mutual information for quantifying direct associations in networks , 2016, Proceedings of the National Academy of Sciences.

[43]  R. Burke,et al.  Detecting dynamical interdependence and generalized synchrony through mutual prediction in a neural ensemble. , 1996, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[44]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[45]  George Sugihara,et al.  Predicting climate effects on Pacific sardine , 2013, Proceedings of the National Academy of Sciences.

[46]  Consolación Gil,et al.  Optimization methods applied to renewable and sustainable energy: A review , 2011 .

[47]  Schreiber,et al.  Measuring information transfer , 2000, Physical review letters.

[48]  Zoran Levnajic,et al.  Reconstructing dynamical networks via feature ranking , 2019, Chaos.