A Logic for Causal Inference in Time Series with Discrete and Continuous Variables

Many applications of causal inference, such as finding the relationship between stock prices and news reports, involve both discrete and continuous variables observed over time. Inference with these complex sets of temporal data, though, has remained difficult and required a number of simplifications. We show that recent approaches for inferring temporal relationships (represented as logical formulas) can be adapted for inference with continuous valued effects. Building on advances in logic, PCTLc (an extension of PCTL with numerical constraints) is introduced here to allow representation and inference of relationships with a mixture of discrete and continuous components. Then, finding significant relationships in the continuous case can be done using the conditional expectation of an effect, rather than its conditional probability. We evaluate this approach on both synthetically generated and actual financial market data, demonstrating that it can allow us to answer different questions than the discrete approach can.

[1]  Stuart J. Russell,et al.  Dynamic bayesian networks: representation, inference and learning , 2002 .

[2]  Keiji Kanazawa,et al.  A Logic and Time Nets for Probabilistic Inference , 1991, AAAI.

[3]  Petter N. Kolm,et al.  Investigating Causal Relationships in Stock Returns with Temporal Logic Based Methods , 2010 .

[4]  Dirk Husmeier,et al.  Heterogeneous Continuous Dynamic Bayesian Networks with Flexible Structure and Inter-Time Segment Information Sharing , 2010, ICML.

[5]  David R. Gilbert,et al.  A Model Checking Approach to the Parameter Estimation of Biochemical Pathways , 2008, CMSB.

[6]  Kevin P. Murphy,et al.  A Variational Approximation for Bayesian Networks with Discrete and Continuous Latent Variables , 1999, UAI.

[7]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[8]  Bud Mishra,et al.  The Temporal Logic of Causal Structures , 2009, UAI.

[9]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[10]  A. Seth,et al.  Multivariate Granger causality and generalized variance. , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Marco Grzegorczyk,et al.  Non-stationary continuous dynamic Bayesian networks , 2009, NIPS.

[12]  B. Efron Large-Scale Simultaneous Hypothesis Testing , 2004 .

[13]  C. Granger Investigating causal relations by econometric models and cross-spectral methods , 1969 .

[14]  Paulo Shakarian,et al.  Annotated probabilistic temporal logic , 2011, TOCL.

[15]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[16]  M. Newton Large-Scale Simultaneous Hypothesis Testing: The Choice of a Null Hypothesis , 2008 .

[17]  E. Fama,et al.  Common risk factors in the returns on stocks and bonds , 1993 .

[18]  Bhubaneswar Mishra,et al.  An Algorithmic Enquiry Concerning Causality , 2010 .

[19]  N. Wermuth,et al.  Graphical Models for Associations between Variables, some of which are Qualitative and some Quantitative , 1989 .

[20]  Bengt Jonsson,et al.  A logic for reasoning about time and reliability , 1990, Formal Aspects of Computing.

[21]  Uri Lerner,et al.  Inference in Hybrid Networks: Theoretical Limits and Practical Algorithms , 2001, UAI.