Exploration of Interesting Dense Regions on Spatial Data

Nowadays, spatial data are ubiquitous in various fields of science, such as transportation and the social Web. A recent research direction in analyzing spatial data is to provide means for "exploratory analysis" of such data where analysts are guided towards interesting options in consecutive analysis iterations. Typically, the guidance component learns analyst's preferences using her explicit feedback, e.g., picking a spatial point or selecting a region of interest. However, it is often the case that analysts forget or don't feel necessary to explicitly express their feedback in what they find interesting. Our approach captures implicit feedback on spatial data. The approach consists of observing mouse moves (as a means of analyst's interaction) and also the explicit analyst's interaction with data points in order to discover interesting spatial regions with dense mouse hovers. In this paper, we define, formalize and explore Interesting Dense Regions (IDRs) which capture preferences of analysts, in order to automatically find interesting spatial highlights. Our approach involves a polygon-based abstraction layer for capturing preferences. Using these IDRs, we highlight points to guide analysts in the analysis process. We discuss the efficiency and effectiveness of our approach through realistic examples and experiments on Airbnb and Yelp datasets.

[1]  David P. Dobkin,et al.  The quickhull algorithm for convex hulls , 1996, TOMS.

[2]  Prasad Deshpande,et al.  Spatio-Temporal Indexing: Current Scenario, Challenges and Approaches , 2012, COMAD.

[3]  M. Bevis,et al.  Locating a point on a spherical surface relative to a spherical polygon of arbitrary shape , 1989 .

[4]  Antony Galton,et al.  What Is the Region Occupied by a Set of Points? , 2006, GIScience.

[5]  Carlos Eduardo Scheidegger,et al.  Nanocubes for Real-Time Exploration of Spatiotemporal Datasets , 2013, IEEE Transactions on Visualization and Computer Graphics.

[6]  Ahmed Eldawy,et al.  LARS: A Location-Aware Recommender System , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[7]  Sihem Amer-Yahia,et al.  UserDEV: A Mixed-Initiative System for User Group Analytics , 2019, HILDA@SIGMOD.

[8]  Mao Lin Huang,et al.  Highlighting in Information Visualization: A Survey , 2010, 2010 14th International Conference Information Visualisation.

[9]  Shagun Jhaver,et al.  Algorithmic Anxiety and Coping Strategies of Airbnb Hosts , 2018, CHI.

[10]  Derya Birant,et al.  ST-DBSCAN: An algorithm for clustering spatial-temporal data , 2007, Data Knowl. Eng..

[11]  Michela Bertolotto,et al.  Analysis of implicit interest indicators for spatial data , 2007, GIS.

[12]  Jasbir S. Arora,et al.  Survey of multi-objective optimization methods for engineering , 2004 .

[13]  Jiawei Han,et al.  Discovering interesting patterns through user's interactive feedback , 2006, KDD '06.

[14]  John F. Roddick,et al.  Spatial, temporal and spatio-temporal databases - hot issues and directions for phd research , 2004, SGMD.

[15]  Avi Arampatzis,et al.  Web-based delineation of imprecise regions , 2006, Comput. Environ. Urban Syst..

[16]  Claus Rinner,et al.  Advances in Geographic Information Science , 2015 .

[17]  Mohamed F. Mokbel,et al.  Recommendations in location-based social networks: a survey , 2015, GeoInformatica.

[18]  George Valkanas,et al.  Understanding Within-Content Engagement through Pattern Analysis of Mouse Gestures , 2014, CIKM.

[19]  Jia Yu,et al.  Spatial data management in apache spark: the GeoSpark perspective and beyond , 2018, GeoInformatica.

[20]  Min Zhao,et al.  Unifying explicit and implicit feedback for collaborative filtering , 2010, CIKM.

[21]  Snehasis Mukhopadhyay,et al.  Interactive pattern mining on hidden data: a sampling-based solution , 2012, CIKM.

[22]  Behrooz Omidvar-Tehrani,et al.  SAGE: Interactive State-aware Point-of-Interest Recommendation , 2020 .

[23]  Jeffrey Heer,et al.  Scented Widgets: Improving Navigation Cues with Embedded Visualizations , 2007, IEEE Transactions on Visualization and Computer Graphics.

[24]  Mohamed-Jalal Fadili,et al.  Non-convex onion-peeling using a shape hull algorithm , 2004, Pattern Recognit. Lett..

[25]  Kanit Wongsuphasawat,et al.  Voyager: Exploratory Analysis via Faceted Browsing of Visualization Recommendations , 2016, IEEE Transactions on Visualization and Computer Graphics.

[26]  Anthony C. Robinson,et al.  Highlighting in Geovisualization , 2011 .

[27]  Arnab Nandi,et al.  Distributed and interactive cube exploration , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[28]  Jon Louis Bentley,et al.  Quad trees a data structure for retrieval on composite keys , 1974, Acta Informatica.

[29]  Francesco Bonchi,et al.  Cold start link prediction , 2010, KDD.

[30]  W. Bruce Croft The Importance of Interaction for Information Retrieval , 2019, SIGIR.

[31]  Alexandre Termier,et al.  Interactive User Group Analysis , 2015, CIKM.

[32]  Antony Galton,et al.  Efficient generation of simple polygons for characterizing the shape of a set of points in the plane , 2008, Pattern Recognit..

[33]  John Lee,et al.  Effortless Data Exploration with zenvisage: An Expressive and Interactive Visual Analytics System , 2016, Proc. VLDB Endow..

[34]  Stefan Wrobel,et al.  One click mining: interactive local pattern discovery through implicit preference and performance learning , 2013, IDEA@KDD.

[35]  Michela Bertolotto,et al.  Semantically Enriching VGI in Support of Implicit Feedback Analysis , 2011, W2GIS.

[36]  Olga Papaemmanouil,et al.  AIDE: An Active Learning-Based Approach for Interactive Data Exploration , 2016, IEEE Transactions on Knowledge and Data Engineering.

[37]  Behrooz Omidvar-Tehrani,et al.  GeoGuide: An Interactive Guidance Approach for Spatial Data , 2017, 2017 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData).

[38]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[39]  Evaggelia Pitoura,et al.  DisC diversity: result diversification based on dissimilarity and coverage , 2012, Proc. VLDB Endow..

[40]  John R. Anderson,et al.  What can a mouse cursor tell us more?: correlation of eye/mouse movements on web browsing , 2001, CHI Extended Abstracts.