Design and Evaluation of WebGL-Based Heat Map Visualization for Big Point Data

Depicting a large number of points on a map may lead to overplotting and to a visual clutter. One of the widely accepted visualization methods that provides a good overview of a spatial distribution of a large number of points is a heat map. Interactions for efficient data exploration, such as zooming, filtering or parameters’ adjustments, are highly demanding on the heat map construction. This is true especially in the case of big data. In this paper, we focus on a novel approach of estimating the kernel density and heat map visualization by utilizing a graphical processing unit. We designed a web-based JavaScript library dedicated to heat map rendering and user interactions through WebGL. The designed library enables to render a heat map as an overlay over a background map provided by a third party API (e.g. Open Layers) in the scope of milliseconds, even for data size exceeding one million points. In order to validate our approach, we designed a demo application visualizing a car accident dataset in the Great Britain. The described solution proves fast rendering times (below 100 ms) even for dataset up to 1.5 million points and outperforms mainstream systems such as the Google Maps API, Leaflet heat map plugin or ESRI’s ArcGIS online. Such performance enables interactive adjustments of the heat map parameters required by various domain experts. The described implementation is a part of the WebGLayer open source information visualization library.

[1]  Balaji Vasan Srinivasan GPUML : Graphical processors for speeding up kernel machines , 2010 .

[2]  Lisa Tompson,et al.  The Utility of Hotspot Mapping for Predicting Spatial Patterns of Crime , 2008 .

[3]  C. D. Kemp,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[4]  Terry A. Slocum,et al.  Thematic cartography and geovisualization, 3rd Edition , 2008 .

[5]  Jean-Daniel Fekete,et al.  Interactive information visualization of a million items , 2002, IEEE Symposium on Information Visualization, 2002. INFOVIS 2002..

[6]  Tessa K Anderson,et al.  Kernel density estimation and K-means clustering to profile road accident hotspots. , 2009, Accident; analysis and prevention.

[7]  Edzer Pebesma,et al.  Spatial interpolation in massively parallel computing environments , 2011 .

[8]  Carlos Eduardo Scheidegger,et al.  Nanocubes for Real-Time Exploration of Spatiotemporal Datasets , 2013, IEEE Transactions on Visualization and Computer Graphics.

[9]  André Skupin,et al.  The world of geography: Visualizing a knowledge domain with cartographic means , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[10]  M. Charlton,et al.  Quantitative geography : perspectives on spatial data analysis by , 2001 .

[11]  Anthony C. Robinson,et al.  The geoviz toolkit: using component-oriented coordination methods for geographic visualization and analysis , 2011, Int. J. Geogr. Inf. Sci..

[12]  Allen Newell,et al.  The psychology of human-computer interaction , 1983 .

[13]  Darya Filippova,et al.  Visual Analytics for Transportation Incident Data Sets , 2009 .

[14]  Peter A. Rogerson,et al.  Spatial Analysis and GIS , 1994 .

[15]  Jason Dykes,et al.  Cartographic visualization: exploratory spatial data analysis with local indicators of spatial association using Tcl/Tk and cdv , 1998 .

[16]  J. Dykes,et al.  Exploring Road Incident Data with Heat Maps , 2011 .

[17]  Jeffrey Heer,et al.  imMens: Real‐time Visual Querying of Big Data , 2013, Comput. Graph. Forum.

[18]  R. J. Kuo,et al.  Integration of self-organizing feature maps neural network and genetic K-means algorithm for market segmentation , 2006, Expert Syst. Appl..

[19]  David O'Sullivan,et al.  Geographic Information Analysis , 2002 .

[20]  Leland Wilkinson,et al.  The History of the Cluster Heat Map , 2009 .