Visualizing Confidence in Cluster-Based Ensemble Weather Forecast Analyses

In meteorology, cluster analysis is frequently used to determine representative trends in ensemble weather predictions in a selected spatio-temporal region, e.g., to reduce a set of ensemble members to simplify and improve their analysis. Identified clusters (i.e., groups of similar members), however, can be very sensitive to small changes of the selected region, so that clustering results can be misleading and bias subsequent analyses. In this article, we — a team of visualization scientists and meteorologists-deliver visual analytics solutions to analyze the sensitivity of clustering results with respect to changes of a selected region. We propose an interactive visual interface that enables simultaneous visualization of a) the variation in composition of identified clusters (i.e., their robustness), b) the variability in cluster membership for individual ensemble members, and c) the uncertainty in the spatial locations of identified trends. We demonstrate that our solution shows meteorologists how representative a clustering result is, and with respect to which changes in the selected region it becomes unstable. Furthermore, our solution helps to identify those ensemble members which stably belong to a given cluster and can thus be considered similar. In a real-world application case we show how our approach is used to analyze the clustering behavior of different regions in a forecast of “Tropical Cyclone Karl”, guiding the user towards the cluster robustness information required for subsequent ensemble analysis.

[1]  Manuel Menezes de Oliveira Neto,et al.  Overview and State-of-the-Art of Uncertainty Visualization , 2014, Scientific Visualization.

[2]  Matthew O. Ward,et al.  Hierarchical parallel coordinates for exploration of large datasets , 1999, Proceedings Visualization '99 (Cat. No.99CB37067).

[3]  Jaak Vilo,et al.  ClustVis: a web tool for visualizing clustering of multivariate data using Principal Component Analysis and heatmap , 2015, Nucleic Acids Res..

[4]  Alex T. Pang,et al.  Approaches to uncertainty visualization , 1996, The Visual Computer.

[5]  Rüdiger Westermann,et al.  Three-dimensional visualization of ensemble weather forecasts – Part 1: The visualization tool Met.3D (version 1.0) , 2015 .

[6]  Steven J. M. Jones,et al.  Circos: an information aesthetic for comparative genomics. , 2009, Genome research.

[7]  Stefan Bruckner,et al.  Result-Driven Exploration of Simulation Parameter Spaces for Visual Effects Design , 2010, IEEE Transactions on Visualization and Computer Graphics.

[8]  Tim N. Palmer,et al.  Ensemble forecasting , 2008, J. Comput. Phys..

[9]  Friedrich Leisch,et al.  Neighborhood graphs, stripes and shadow plots for cluster visualization , 2010, Stat. Comput..

[10]  Rüdiger Westermann,et al.  Streamline Variability Plots for Characterizing the Uncertainty in Vector Field Ensembles , 2016, IEEE Transactions on Visualization and Computer Graphics.

[11]  Thomas Villmann,et al.  Stochastic neighbor embedding (SNE) for dimension reduction and visualization using arbitrary divergences , 2012, Neurocomputing.

[12]  Helwig Hauser,et al.  Angular brushing of extended parallel coordinates , 2002, IEEE Symposium on Information Visualization, 2002. INFOVIS 2002..

[13]  Edward M. Reingold,et al.  Graph drawing by force‐directed placement , 1991, Softw. Pract. Exp..

[14]  Geoffrey E. Hinton,et al.  Stochastic Neighbor Embedding , 2002, NIPS.

[15]  Cynthia A. Brewer,et al.  Guidance for representing uncertainty on global temperature change maps , 2016 .

[16]  Matthias Schonlau,et al.  The Clustergram: A Graph for Visualizing Hierarchical and Nonhierarchical Cluster Analyses , 2002 .

[17]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[18]  Hong Zhou,et al.  Visual Clustering in Parallel Coordinates , 2008, Comput. Graph. Forum.

[19]  Joydeep Ghosh,et al.  Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions , 2002, J. Mach. Learn. Res..

[20]  Kenneth I. Joy,et al.  Comparative Visual Analysis of Lagrangian Transport in CFD Ensembles , 2013, IEEE Transactions on Visualization and Computer Graphics.

[21]  Satoru Miyano,et al.  Open source clustering software , 2004 .

[22]  Ryan D. Torn,et al.  Diagnosis of the Source of GFS Medium-Range Track Errors in Hurricane Sandy (2012) , 2015 .

[23]  Allen Tannenbaum,et al.  Statistical shape analysis using kernel PCA , 2006, Electronic Imaging.

[24]  David L. Kao,et al.  Visualization techniques for spatial probability density function data , 2004, Data Sci. J..

[25]  Stefan Bruckner,et al.  Eurographics/ Ieee-vgtc Symposium on Visualization 2010 Isosurface Similarity Maps , 2022 .

[26]  Mark Gahegan,et al.  Visualizing Geospatial Information Uncertainty: What We Know and What We Need to Know , 2005 .

[27]  Eduard Gröller,et al.  Cupid: Cluster-Based Exploration of Geometry Generators with Parallel Coordinates and Radial Trees , 2014, IEEE Transactions on Visualization and Computer Graphics.

[28]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[29]  Boudewijn P. F. Lelieveldt,et al.  A new cluster validity index for the fuzzy c-mean , 1998, Pattern Recognit. Lett..

[30]  R. L. Thorndike Who belongs in the family? , 1953 .

[31]  Rüdiger Westermann,et al.  Visual Analysis of Spatial Variability and Global Correlations in Ensembles of Iso‐Contours , 2016, Comput. Graph. Forum.

[32]  Rüdiger Westermann,et al.  3-D visualization of ensemble weather forecasts – Part 2: Forecasting warm conveyor belt situations for aircraft-based field campaigns , 2015 .

[33]  Sarah C. Jones,et al.  Forecast Variability of the Blocking System over Russia in Summer 2010 and Its Impact on Surface Conditions , 2017 .

[34]  Rüdiger Westermann,et al.  3-D visualization of ensemble weather forecasts – Part 1: The visualization tool Met.3D (version 1.0) , 2015 .

[35]  Markus Hadwiger,et al.  Ovis: A Framework for Visual Analysisof Ocean Forecast Ensembles , 2014, IEEE Transactions on Visualization and Computer Graphics.

[36]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[37]  Florian Pappenberger,et al.  The TIGGE Project and Its Achievements , 2016 .

[38]  Sarah C. Jones,et al.  Predictability Associated with the Downstream Impacts of the Extratropical Transition of Tropical Cyclones: Methodology and a Case Study of Typhoon Nabi (2005) , 2008 .

[39]  David L. Kao,et al.  Visualizing spatial multivalue data , 2005, IEEE Computer Graphics and Applications.

[40]  Alfred Inselberg,et al.  The plane with parallel coordinates , 1985, The Visual Computer.

[41]  Thomas Nocke,et al.  Methods for the visualization of clustered climate data , 2004, Comput. Stat..

[42]  Antony Unwin,et al.  Comparing Clusterings Using Bertin's Idea , 2012, IEEE Transactions on Visualization and Computer Graphics.

[43]  Joe Michael Kniss,et al.  Visualizing Summary Statistics and Uncertainty , 2010, Comput. Graph. Forum.

[44]  W. Briggs Statistical Methods in the Atmospheric Sciences , 2007 .

[45]  Eduard Gröller,et al.  MObjects--A Novel Method for the Visualization and Interactive Exploration of Defects in Industrial XCT Data , 2013, IEEE Transactions on Visualization and Computer Graphics.

[46]  Dieter Schmalstieg,et al.  Comparative Analysis of Multidimensional, Quantitative Data , 2010, IEEE Transactions on Visualization and Computer Graphics.

[47]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[48]  John Methven,et al.  Ensemble prediction of transitions of the North Atlantic eddy‐driven jet , 2011 .

[49]  Helwig Hauser,et al.  Outlier-Preserving Focus+Context Visualization in Parallel Coordinates , 2006, IEEE Transactions on Visualization and Computer Graphics.

[50]  Vijay Natarajan,et al.  Multiscale Symmetry Detection in Scalar Fields by Clustering Contours , 2014, IEEE Transactions on Visualization and Computer Graphics.

[51]  Csaba Legány,et al.  Cluster validity measurement techniques , 2006 .

[52]  Timothy S. Newman,et al.  Directional Texture for Visualization - New Technique and Application Study , 2015, 2015 19th International Conference on Information Visualisation.

[53]  Alex T. Pang,et al.  Visualizing scalar volumetric data with uncertainty , 2002, Comput. Graph..

[54]  B. Everitt,et al.  Cluster Analysis: Everitt/Cluster Analysis , 2011 .

[55]  Leland Wilkinson,et al.  The History of the Cluster Heat Map , 2009 .

[56]  Guojun Gan,et al.  Data Clustering: Theory, Algorithms, and Applications (ASA-SIAM Series on Statistics and Applied Probability) , 2007 .

[57]  Helwig Hauser,et al.  Visualization and Visual Analysis of Multifaceted Scientific Data: A Survey , 2013, IEEE Transactions on Visualization and Computer Graphics.

[58]  Rüdiger Westermann,et al.  Visualizing the Variability of Gradients in Uncertain 2D Scalar Fields , 2013, IEEE Transactions on Visualization and Computer Graphics.

[59]  Hamish A. Carr,et al.  On Histograms and Isosurface Statistics , 2006, IEEE Transactions on Visualization and Computer Graphics.

[60]  Sarah C. Jones,et al.  Characteristics of the TIGGE multimodel ensemble prediction system in representing forecast variability associated with extratropical transition , 2011 .

[61]  Yuan Luo,et al.  Cluster Visualization in Parallel Coordinates Using Curve Bundles , 2008 .

[62]  Paul Rosen,et al.  From Quantification to Visualization: A Taxonomy of Uncertainty Visualization Approaches , 2011, WoCoUQ.

[63]  Luis Angel García-Escudero,et al.  A review of robust clustering methods , 2010, Adv. Data Anal. Classif..

[64]  Hui Xiong,et al.  Adapting the right measures for K-means clustering , 2009, KDD.

[65]  Ross T. Whitaker,et al.  Contour Boxplots: A Method for Characterizing Uncertainty in Feature Sets from Simulation Ensembles , 2013, IEEE Transactions on Visualization and Computer Graphics.

[66]  Xi Wang,et al.  Clustering aggregation by probability accumulation , 2009, Pattern Recognit..

[67]  Sarah C. Jones,et al.  Predictability Associated with the Downstream Impacts of the Extratropical Transition of Tropical Cyclones: Case Studies , 2008 .

[68]  Munehiko Yamaguchi,et al.  Evaluation of Medium-Range Forecasts for Hurricane Sandy , 2014 .

[69]  S Miyano,et al.  Open source clustering software. , 2004, Bioinformatics.

[70]  Rüdiger Westermann,et al.  Time-Hierarchical Clustering and Visualization of Weather Forecast Ensembles , 2017, IEEE Transactions on Visualization and Computer Graphics.

[71]  R. Westermann,et al.  Three-dimensional visualization of ensemble weather forecasts – Part 1 : The visualization tool Met . 3 D ( version 1 . 0 ) , 2015 .

[72]  Bernhard Preim,et al.  Ieee Transactions on Visualization and Computer Graphics 1 Blood Flow Clustering and Applications in Virtual Stenting of Intracranial Aneurysms , 2022 .