Reveal, Don't Conceal: Transforming Data Visualization to Improve Transparency.

Reports highlighting the problems with the standard practice of using bar graphs to show continuous data have prompted many journals to adopt new visualization policies. These policies encourage authors to avoid bar graphs and use graphics that show the data distribution; however, they provide little guidance on how to effectively display data. We conducted a systematic review of studies published in top peripheral vascular disease journals to determine what types of figures are used, and to assess the prevalence of suboptimal data visualization practices. Among papers with data figures, 47.7% of papers used bar graphs to present continuous data. This primer provides a detailed overview of strategies for addressing this issue by (1) outlining strategies for selecting the correct type of figure depending on the study design, sample size, and the type of variable; (2) examining techniques for making effective dot plots, box plots, and violin plots; and (3) illustrating how to avoid sending mixed messages by aligning the figure structure with the study design and statistical analysis. We also present solutions to other common problems identified in the systematic review. Resources include a list of free tools and templates that authors can use to create more informative figures and an online simulator that illustrates why summary statistics are meaningful only when there are enough data to summarize. Last, we consider steps that investigators can take to improve figures in the scientific literature.

[1]  Marten Postma,et al.  PlotsOfData—A web app for visualizing data together with their summaries , 2019, PLoS biology.

[2]  V. Garovic,et al.  Why we need to report more than 'Data were Analyzed by t-tests or ANOVA' , 2018, eLife.

[3]  Tracey L. Weissgerber,et al.  Data visualization practices in peripheral vascular disease journals: How can we improve? , 2018 .

[4]  Kirstie J. Whitaker,et al.  Raincloud plots: a multi-platform tool for robust data visualization , 2018, PeerJ Prepr..

[5]  Michael C. Frank,et al.  Data availability, reusability, and analytic reproducibility: evaluating the impact of a mandatory open data policy at the journal Cognition , 2018, Royal Society Open Science.

[6]  Hyungwon Choi,et al.  Moving beyond P values: Everyday data analysis with estimation plots , 2018, bioRxiv.

[7]  Marko Savic,et al.  Data visualization, bar naked: A free tool for creating interactive graphics , 2017, The Journal of Biological Chemistry.

[8]  Brian Lings,et al.  The Experimental Design Assistant , 2017, Nature Methods.

[9]  D. Schriger Graphic Portrayal of Studies With Paired Data: A Tutorial. , 2017, Annals of emergency medicine.

[10]  Announcement: Towards greater reproducibility for life-sciences research in Nature , 2017, Nature.

[11]  Cyril R Pernet,et al.  Beyond differences in means: robust graphical methods to compare two groups in neuroscience , 2017, bioRxiv.

[12]  Guillaume A. Rousselet,et al.  A few simple steps to improve the description of group results in neuroscience , 2016, The European journal of neuroscience.

[13]  Eric M Prager,et al.  Transparent reporting for reproducible science , 2016, Journal of neuroscience research.

[14]  M Dawn Teare,et al.  Transparent reporting of research results in eLife , 2016, eLife.

[15]  Kong Y. Chen,et al.  Persistent metabolic adaptation 6 years after “The Biggest Loser” competition , 2016, Obesity.

[16]  Steven N. Goodman,et al.  Aligning statistical and scientific reasoning , 2016, Science.

[17]  V. Garovic,et al.  From Static to Interactive: Transforming Data Visualization to Improve Transparency , 2016, PLoS biology.

[18]  N. Lazar,et al.  The ASA Statement on p-Values: Context, Process, and Purpose , 2016 .

[19]  Aakanksha Angra,et al.  Development of a framework for graph choice and construction. , 2016, Advances in physiology education.

[20]  T. Lumley,et al.  Graphics and statistics for cardiology: comparing categorical and continuous variables , 2016, Heart.

[21]  U. Dirnagl,et al.  Where Have All the Rodents Gone? The Effects of Attrition in Experimental Research on Cancer and Stroke , 2016, PLoS biology.

[22]  R. Colbran,et al.  Transparency Is the Key to Quality , 2015, The Journal of Biological Chemistry.

[23]  David A. Ellis,et al.  Thinking Outside the Box: Developing Dynamic Data Visualizations for Psychology with Shiny , 2015, Front. Psychol..

[24]  Michèle B. Nuijten,et al.  The prevalence of statistical reporting errors in psychology (1985–2013) , 2015, Behavior research methods.

[25]  L. Hothorn,et al.  Boxplots for grouped and clustered data in toxicology , 2015, Archives of Toxicology.

[26]  Emma B. Saxon Beyond bar charts , 2015, BMC Biology.

[27]  V. Garovic,et al.  Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm , 2015, PLoS biology.

[28]  L. Halsey,et al.  The fickle P value generates irreproducible results , 2015, Nature Methods.

[29]  Douglas G Altman,et al.  The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) Statement: guidelines for reporting observational studies. , 2014, International journal of surgery.

[30]  L. Poston,et al.  Strategy for Standardization of Preeclampsia Research Study Design , 2014, Hypertension.

[31]  D. Lakens,et al.  Sailing From the Seas of Chaos Into the Corridor of Stability , 2014, Perspectives on psychological science : a journal of the Association for Psychological Science.

[32]  M. Tyers,et al.  BoxPlotR: a web tool for generation of box plots , 2014, Nature Methods.

[33]  Florence Debarre,et al.  The Availability of Research Data Declines Rapidly with Article Age , 2013, Current Biology.

[34]  Felix D. Schönbrodt,et al.  At what sample size do correlations stabilize , 2013 .

[35]  Announcement: Reducing our irreproducibility , 2013, Nature.

[36]  A. Jeyabalan,et al.  Low Placental Growth Factor Across Pregnancy Identifies a Subset of Women With Preterm Preeclampsia: Type 1 Versus Type 2 Preeclampsia? , 2012, Hypertension.

[37]  George E. Newman,et al.  Bar graphs depicting averages are perceptually misinterpreted: The within-the-bar bias , 2012, Psychonomic Bulletin & Review.

[38]  J. Ioannidis,et al.  Public Availability of Published Research Data in High-Impact Journals , 2011, PloS one.

[39]  Leland Wilkinson,et al.  Stacking Graphic Elements to Avoid Over-Plotting , 2010, IEEE Transactions on Visualization and Computer Graphics.

[40]  D. Moher,et al.  CONSORT 2010 Statement: updated guidelines for reporting parallel group randomised trials , 2010, Journal of pharmacology & pharmacotherapeutics.

[41]  D. Moher,et al.  CONSORT 2010 Statement: Updated Guidelines for Reporting Parallel Group Randomised Trials , 2010, PLoS medicine.

[42]  S. Pocock,et al.  Strengthening the reporting of observational studies in epidemiology (STROBE) statement: guidelines for reporting observational studies , 2007, BMJ : British Medical Journal.

[43]  Matthias Egger,et al.  The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) Statement: Guidelines for Reporting Observational Studies , 2007, PLoS medicine.

[44]  H. Davies,et al.  Describing and estimating: use and abuse of standard deviations and standard errors. , 1998, Hospital medicine.

[45]  B. Everitt,et al.  Analysis of longitudinal data , 1998, British Journal of Psychiatry.

[46]  S. Ryan,et al.  The National Eye Institute. , 1987, Archives of ophthalmology.

[47]  M. Lippman Instructions for Authors , 1985, PAIN.

[48]  Edward J. Mulrow,et al.  The Visual Display of Quantitative Information , 1985, Technometrics.

[49]  K. E. Holley,et al.  From the Mayo Clinic , 1969 .

[50]  Helen Marriott Monash University , 2019, The Grants Register 2022.

[51]  B. Caffo,et al.  eAppendix 1 : Lasagna plots : A saucy alternative to spaghetti plots , 2010 .

[52]  Tapabrata Maiti,et al.  Analysis of Longitudinal Data (2nd ed.) (Book) , 2004 .

[53]  Deane B. Judd,et al.  FACTS OF COLOR-BLINDNESS , 1944 .