Combining Visual Cleansing and Exploration for Clinical Data

Clinical data have their own peculiarities, as they evolve over time, may be incomplete, and are highly heterogeneous. These characteristics turn a thorough analysis into a challenging task, especially since domain experts are aware of the data flaws, which may impact their trust in the data. As we obtained anonymized clinical data from more than 3,500 patients with retinal diseases, we have to address these challenges. We define a workflow that integrates data cleansing and exploration in an iterative process, so that users are able to easily find anomalies and patterns in the data at any point in their analysis. We implement our workflow in a user-centered visual analytics tool with dedicated visualization and interaction techniques. In collaboration with experts, we apply our tool to examine the interdependency between patients’ visual acuity developments and treatment patterns. We find, that real-life data often have unforeseen incidents which can strongly influence the overall visual acuity development. This differs to study results, which are usually conducted under restrictive conditions and have shown visual acuity improvement with on-schedule treatment.

[1]  Bernhard Preim,et al.  Visual Analytics of Missing Data in Epidemiological Cohort Studies , 2017, VCBM.

[2]  I L Bailey,et al.  New Design Principles for Visual Acuity Letter Charts* , 1976, American journal of optometry and physiological optics.

[3]  Christoph Ehlken,et al.  Five-year visual acuity outcomes and injection patterns in patients with pro-re-nata treatments for AMD, DME, RVO and myopic CNV , 2016, British Journal of Ophthalmology.

[4]  Jeffrey Heer,et al.  Profiler: integrated statistical analysis and visualization for data quality assessment , 2012, AVI.

[5]  Heidrun Schumann,et al.  Visual analysis of retinal changes with optical coherence tomography , 2018, The Visual Computer.

[6]  Hans-Peter Seidel,et al.  Efficient and Robust Annotation of Motion Capture Data , 2009 .

[7]  Yuval Shahar,et al.  Distributed, intelligent, interactive visualization and exploration of time-oriented clinical data and their abstractions , 2006, Artif. Intell. Medicine.

[8]  Michael G. Kahn,et al.  The visual display of temporal information , 1991, Artif. Intell. Medicine.

[9]  Miriah D. Meyer,et al.  A Framework for Externalizing Implicit Error Using Visualization , 2019, IEEE Transactions on Visualization and Computer Graphics.

[10]  Jennifer G Stadler,et al.  Improving the Efficiency and Ease of Healthcare Analysis Through Use of Data Visualization Dashboards , 2016, Big Data.

[11]  Heidrun Schumann,et al.  Annotations as a Support for Knowledge Generation - Supporting Visual Analytics in the Field of Ophthalmology , 2018, VISIGRAPP.

[12]  Heather Richter Lipford,et al.  Helping users recall their reasoning process , 2010, 2010 IEEE Symposium on Visual Analytics Science and Technology.

[13]  N. Menachemi,et al.  Benefits and drawbacks of electronic health record systems , 2011 .

[14]  Jeffrey Heer,et al.  CommentSpace: structured support for collaborative visual analysis , 2011, CHI.

[15]  Paolo Papotti,et al.  KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing , 2015, SIGMOD Conference.

[16]  Silvia Miksch,et al.  A Taxonomy of Dirty Time-Oriented Data , 2012, CD-ARES.

[17]  G. Rubin,et al.  Causes of blindness and visual impairment in a population of older Americans: The Salisbury Eye Evaluation Study. , 2000, Archives of ophthalmology.

[18]  Harald Piringer,et al.  Visplause: Visual Data Quality Assessment of Many Time Series Using Plausibility Checks , 2017, IEEE Transactions on Visualization and Computer Graphics.

[19]  Michael Larsen,et al.  Incidence of legal blindness from age-related macular degeneration in denmark: year 2000 to 2010. , 2012, American journal of ophthalmology.

[20]  Masroor Rasheed,et al.  A Visualization Architecture for Collaborative Analytical and Data Provenance Activities , 2013, 2013 17th International Conference on Information Visualisation.

[21]  Andreas Holzinger,et al.  On Computationally-Enhanced Visual Analysis of Heterogeneous Data and Its Application in Biomedical Informatics , 2014, Interactive Knowledge Discovery and Data Mining in Biomedical Informatics.

[22]  Jeffrey Heer,et al.  Self-Service Data Preparation: Research to Practice , 2018, IEEE Data Eng. Bull..

[23]  Jürgen Bernard,et al.  Visual-Interactive Segmentation of Multivariate Time Series , 2016, EuroVA@EuroVis.

[24]  Tobias Schreck,et al.  MotionExplorer: Exploratory Search in Human Motion Capture Data Based on Hierarchical Aggregation , 2013, IEEE Transactions on Visualization and Computer Graphics.

[25]  Cody Dunne,et al.  IDMVis: Temporal Event Sequence Visualization for Type 1 Diabetes Treatment Decision Support , 2019, IEEE Transactions on Visualization and Computer Graphics.

[26]  Heidrun Schumann,et al.  Visualization of Time-Oriented Data , 2011, Human-Computer Interaction Series.

[27]  Daniel A. Keim,et al.  Real-time visual analytics for event data streams , 2012, SAC '12.

[28]  Cynthia A. Brewer,et al.  ColorBrewer.org: An Online Tool for Selecting Colour Schemes for Maps , 2003 .

[29]  Tamara Munzner,et al.  A Nested Model for Visualization Design and Validation , 2009, IEEE Transactions on Visualization and Computer Graphics.

[30]  A. Schachat,et al.  A review of clinical trials of anti-VEGF agents for diabetic retinopathy , 2010, Graefe's Archive for Clinical and Experimental Ophthalmology.

[31]  Ben Shneiderman,et al.  LifeLines: using visualization to enhance navigation and analysis of patient records , 1998, AMIA.

[32]  Melanie Tory,et al.  Note-taking in co-located collaborative visual analytics: Analysis of an observational study , 2012, Inf. Vis..

[33]  Silvia Miksch,et al.  TimeCleanser: a visual analytics approach for data cleansing of time-oriented data , 2014, i-KNOW '14.

[34]  Sean D. Adrean,et al.  Consistent Long-Term Therapy of Neovascular Age-Related Macular Degeneration Managed by 50 or More Anti-VEGF Injections Using a Treat-Extend-Stop Protocol. , 2018, Ophthalmology.

[35]  Otto-von-Guericke Connecting Time-Oriented Data and Information to a Coherent Interactive Visualization , 2004 .

[36]  Viju Raghupathi,et al.  Big data analytics in healthcare: promise and potential , 2014, Health Information Science and Systems.

[37]  Sanjay Krishnan,et al.  Towards reliable interactive data cleaning: a user survey and recommendations , 2016, HILDA '16.

[38]  K. Falavarjani,et al.  Adverse events and complications associated with intravitreal injection of anti-VEGF agents: a review of literature , 2013, Eye.

[39]  Matthias Briel,et al.  Current use and costs of electronic health records for clinical trial research: a descriptive study. , 2019, CMAJ open.

[40]  Ben Shneiderman,et al.  Temporal Summaries: Supporting Temporal Categorical Searching, Aggregation and Comparison , 2009, IEEE Transactions on Visualization and Computer Graphics.

[41]  Sanjay Krishnan,et al.  Wisteria: Nurturing Scalable Data Cleaning Infrastructure , 2015, Proc. VLDB Endow..