Why scatter plots suggest causality, and what we can do about it

Scatter plots carry an implicit if subtle message about causality. Whether we look at functions of one variable in pure mathematics, plots of experimental measurements as a function of the experimental conditions, or scatter plots of predictor and response variables, the value plotted on the vertical axis is by convention assumed to be determined or influenced by the value on the horizontal axis. This is a problem for the public understanding of scientific results and perhaps also for professional scientists' interpretations of scatter plots. To avoid suggesting a causal relationship between the x and y values in a scatter plot, we propose a new type of data visualization, the diamond plot. Diamond plots are essentially 45 degree rotations of ordinary scatter plots; by visually jarring the viewer they clearly indicate that she should not draw the usual distinction between independent/predictor variable and dependent/response variable. Instead, she should see the relationship as purely correlative.

[1]  Michael Gleicher,et al.  Scatterplots: Tasks, Data, and Designs , 2018, IEEE Transactions on Visualization and Computer Graphics.

[2]  Cewin Chao,et al.  Full fat milk consumption protects against severe childhood obesity in Latinos☆☆☆ , 2017, Preventive medicine reports.

[3]  Min Chen,et al.  A Multi‐task Comparative Study on Scatter Plots and Parallel Coordinates Plots , 2015, Comput. Graph. Forum.

[4]  Christopher Rao,et al.  Graphs in Statistical Analysis , 2010 .

[5]  Daniel J. Denis,et al.  The early origins and development of the scatterplot. , 2005, Journal of the history of the behavioral sciences.

[6]  R. Macarthur,et al.  The Theory of Island Biogeography , 1969 .

[7]  H. Borsook,et al.  The Dissociation Constants of Arginine. , 1924, The Biochemical journal.