Exploring causal influences

Recent data mining techniques exploit patterns of statistical independence in multivariate data to make conjectures about cause/effect relationships. These relationships can be used to construct causal graphs, which are sometimes represented by weighted node-link diagrams, with nodes representing variables and combinations of weighted links and/or nodes showing the strength of causal relationships. We present an interactive visualization for causal graphs (ICGs), inspired in part by the Influence Explorer. The key principles of this visualization are as follows: Variables are represented with vertical bars attached to nodes in a graph. Direct manipulation of variables is achieved by sliding a variable value up and down, which reveals causality by producing instantaneous change in causally and/or probabilistically linked variables. This direct manipulation technique gives users the impression they are causally influencing the variables linked to the one they are manipulating. In this context, we demonstrate the subtle distinction between seeing and setting of variable values, and in an extended example, show how this visualization can help a user understand the relationships in a large variable set, and with some intuitions about the domain and a few basic concepts, quickly detect bugs in causal models constructed from these data mining techniques.

[1]  Judea Pearl,et al.  Fusion, Propagation, and Structuring in Belief Networks , 1986, Artif. Intell..

[2]  L. Dennis,et al.  Sunscreen Use and the Risk for Melanoma: A Quantitative Review , 2003, Annals of Internal Medicine.

[3]  Eric Neufeld,et al.  SIMPSON'S PARADOX IN ARTIFICIAL INTELLIGENCE AND IN REAL LIFE , 1995, Comput. Intell..

[4]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[5]  P. Games Correlation and Causation: A Logical Snafu , 1990 .

[6]  Robert H. Strotz,et al.  Recursive versus non-recursive systems: An attempt at a synthesis , 2017 .

[7]  Steffen L. Lauritzen,et al.  Stable local computation with conditional Gaussian distributions , 2001, Stat. Comput..

[8]  R. Rodgers,et al.  Causal models of publishing productivity in psychology. , 1989 .

[9]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[10]  K. Holzinger,et al.  A study in factor analysis : the stability of a bi-factor solution , 1939 .

[11]  Benjamin B. Bederson,et al.  Space-scale diagrams: understanding multiscale interfaces , 1995, CHI '95.

[12]  D. A. Kenny,et al.  Correlation and Causation , 1937, Wilmott.

[13]  Colin Ware,et al.  Information Visualization: Perception for Design , 2000 .

[14]  D. Freedman From Association to Causation via Regression , 1997 .

[15]  Paul U. Lee,et al.  Lines, Blobs, Crosses and Arrows: Diagrammatic Communication with Schematic Figures , 2000, Diagrams.

[16]  Robert Spence,et al.  Externalising abstract mathematical models , 1996, CHI '96.