Causal discovery from sequential data in ALS disease based on entropy criteria

One of the most important issues in predictive modeling is to determine major cause factors of a phenomenon and causal relationships between them. Extracting causal relationships between parameters in a natural phenomenon can be accomplished through checking the parameters' changes in consecutive events. In addition, using information and probabilistic theory help better conception of causal relationships of a phenomenon. Therefore, probabilistic causal discovery from sequential data of a natural phenomenon can be useful for dimension reduction and predicting the future trend of a process. In this paper, we introduce a novel method for causal discovery from a sequential data based on a probabilistic causal graph. In this method, first, Causal Feature Dependency matrix (CFD matrix) is generated based on the features' changes in consecutive events. Then, a probabilistic causal graph is created from CFD matrix. In this graph, some valueless features will be eliminated on the basis of entropy value of each conditional density function. Finally, prediction operation is performed based on the output of causal graph. Experimental results on the Pooled Resource Open-Access ALS Clinical Trials (PRO-ACT) sequential data set from Amyotrophic Lateral Sclerosis (ALS) disease show that our proposed algorithm can predict the progression rate of ALS disease properly with high precision.

[1]  Snigdhansu Chatterjee,et al.  Causality and pathway search in microarray time series experiment , 2007, Bioinform..

[2]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[3]  I. Good A CAUSAL CALCULUS (I)* , 1961, The British Journal for the Philosophy of Science.

[4]  R. Kass,et al.  Multiple neural spike train data analysis: state-of-the-art and future challenges , 2004, Nature Neuroscience.

[5]  J. Pearl Causal inference in statistics: An overview , 2009 .

[6]  George Hripcsak,et al.  Methodological Review: A review of causal inference for biomedical informatics , 2011 .

[7]  Nancy Green,et al.  A Bayesian network coding scheme for annotating biomedical information presented to genetic counseling clients , 2005, J. Biomed. Informatics.

[8]  Evert de Jonge,et al.  Using hierarchical dynamic Bayesian networks to investigate dynamics of organ failure in patients in the Intensive Care Unit , 2010, J. Biomed. Informatics.

[9]  J. Castle,et al.  An integrative genomics approach to infer causal associations between gene expression and disease , 2005, Nature Genetics.

[10]  David Hume A Treatise of Human Nature: Being an Attempt to introduce the experimental Method of Reasoning into Moral Subjects , 1972 .

[11]  S. Bressler,et al.  Granger Causality: Basic Theory and Application to Neuroscience , 2006, q-bio/0608035.

[12]  Junpeng Zhang,et al.  Identifying direct miRNA-mRNA causal regulatory relationships in heterogeneous data , 2014, J. Biomed. Informatics.

[13]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[14]  Jürgen Hesser,et al.  Virtual Intensive Care Unit (ICU): Real-Time Simulation Environment Applying Hybrid Approach Using Dynamic Bayesian Networks and ODEs , 2009, MMVR.

[15]  Bhubaneswar Mishra,et al.  An Algorithmic Enquiry Concerning Causality , 2010 .

[16]  A. Seth,et al.  Granger Causality Analysis in Neuroscience and Neuroimaging , 2015, The Journal of Neuroscience.

[17]  Claudio Cobelli,et al.  A Dynamic Bayesian Network model for long-term simulation of clinical complications in type 1 diabetes , 2015, J. Biomed. Informatics.

[18]  Steen Andreassen,et al.  Generic causal probabilistic networks: A solution to a problem of transferability in medical decision support , 2008, Comput. Methods Programs Biomed..

[19]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[20]  S A Kauffman,et al.  LETTER TO THE EDITOR: A simple method for reverse engineering causal networks , 2006 .

[21]  Min Zou,et al.  A new dynamic Bayesian network (DBN) approach for identifying gene regulatory networks from time course microarray data , 2005, Bioinform..

[22]  André Elisseeff,et al.  Using Markov Blankets for Causal Structure Learning , 2008, J. Mach. Learn. Res..

[23]  Steffen L. Lauritzen,et al.  Causal Inference from Graphical Models , 2001 .

[24]  R. Borgoni,et al.  Simulating interventions in graphical chain models for longitudinal data , 2015 .

[25]  Dimitrios I. Fotiadis,et al.  A Multiscale Approach for Modeling Atherosclerosis Progression , 2015, IEEE Journal of Biomedical and Health Informatics.

[26]  Loris Zanier,et al.  The Incidence of Amyotrophic Lateral Sclerosis in Friuli Venezia Giulia, Italy, from 2002 to 2009: A Retrospective Population-Based Study , 2013, Neuroepidemiology.

[27]  M. Cudkowicz,et al.  The PRO-ACT database , 2014, Neurology.

[28]  Ahmad Samiei,et al.  Identification, analysis, and interpretation of a human serum metabolomics causal network in an observational study , 2016, J. Biomed. Informatics.

[29]  Johann S. Hawe,et al.  Crowdsourced analysis of clinical trial data to predict amyotrophic lateral sclerosis progression , 2014, Nature Biotechnology.

[30]  Rong Jin,et al.  On the Use of Dynamic Bayesian Networks in Reconstructing Functional Neuronal Networks from Spike Train Ensembles , 2010, Neural Computation.

[31]  David A. Bell,et al.  Learning Bayesian networks from data: An information-theory based approach , 2002, Artif. Intell..

[32]  Peter J. F. Lucas,et al.  Dynamic Bayesian networks as prognostic models for clinical patient management , 2008, J. Biomed. Informatics.

[33]  Ahmad Samiei,et al.  Generating a robust statistical causal structure over 13 cardiovascular disease risk factors using genomics data , 2016, J. Biomed. Informatics.

[34]  Ileana Baldi,et al.  Dynamic Bayesian Networks to predict sequences of organ failures in patients admitted to ICU , 2014, J. Biomed. Informatics.