G-RoI

Geotagged data gathered from social media can be used to discover interesting locations visited by users called Places-of-Interest (PoIs). Since a PoI is generally identified by the geographical coordinates of a single point, it is hard to match it with user trajectories. Therefore, it is useful to define an area, called Region-of-Interest (RoI), to represent the boundaries of the PoI’s area. RoI mining techniques are aimed at discovering ROIs from PoIs and other data. Existing RoI mining techniques are based on three main approaches: predefined shapes, density-based clustering, and grid-based aggregation. This article proposes G-RoI, a novel RoI mining technique that exploits the indications contained in geotagged social media items to discover RoIs with a high accuracy. Experiments performed over a set of PoIs in Rome and Paris using social media geotagged data, demonstrate that G-RoI in most cases achieves better results than existing techniques. In particular, the mean F1 score is 0.34 higher than that obtained with the well-known DBSCAN algorithm in Rome RoIs and 0.23 higher in Paris RoIs.

[1]  Tat-Seng Chua,et al.  Mining Travel Patterns from Geotagged Photos , 2012, TIST.

[2]  David P. Dobkin,et al.  The quickhull algorithm for convex hulls , 1996, TOMS.

[3]  Yizong Cheng,et al.  Mean Shift, Mode Seeking, and Clustering , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Dino Pedreschi,et al.  Trajectory pattern mining , 2007, KDD '07.

[5]  Tomoharu Iwata,et al.  Travel route recommendation using geotags in photo sharing sites , 2010, CIKM.

[6]  Jon M. Kleinberg,et al.  Mapping the world's photos , 2009, WWW '09.

[7]  Jiebo Luo,et al.  Diversified Trajectory Pattern Ranking in Geo-tagged Social Media , 2011, SDM.

[8]  Linlin You,et al.  Social data analysis framework in cloud and Mobility Analyzer for Smarter Cities , 2014, Proceedings of 2014 IEEE International Conference on Service Operations and Logistics, and Informatics.

[9]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[10]  Bart Braden The Surveyor's Area Formula , 1986 .

[11]  Xing Xie,et al.  Where to find my next passenger , 2011, UbiComp '11.

[12]  Nikos Mamoulis,et al.  Density-based place clustering in geo-social networks , 2014, SIGMOD Conference.

[13]  Ickjai Lee,et al.  Sequential pattern mining of geo-tagged photos with an arbitrary regions-of-interest detection method , 2014, Expert Syst. Appl..

[14]  Eugenio Cesario,et al.  Following soccer fans from geotagged tweets at FIFA World Cup 2014 , 2015, 2015 2nd IEEE International Conference on Spatial Data Mining and Geographical Knowledge Services (ICSDM).

[15]  Per Christian Hansen,et al.  Analysis of Discrete Ill-Posed Problems by Means of the L-Curve , 1992, SIAM Rev..

[16]  Constantinos Antoniou,et al.  Use of Geotagged Social Media in Urban Settings: Empirical Evidence on Its Potential from Twitter , 2015, 2015 IEEE 18th International Conference on Intelligent Transportation Systems.

[17]  Slava Kisilevich,et al.  A Novel Approach to Mining Travel Sequences Using Collections of Geotagged Photos , 2010, AGILE Conf..

[18]  Eugenio Cesario,et al.  Analyzing social media data to discover mobility patterns at EXPO 2015: Methodology and results , 2016, 2016 International Conference on High Performance Computing & Simulation (HPCS).

[19]  Franco Zambonelli,et al.  Extracting urban patterns from location-based social networks , 2011, LBSN '11.

[20]  Ronald L. Graham,et al.  An Efficient Algorithm for Determining the Convex Hull of a Finite Planar Set , 1972, Inf. Process. Lett..

[21]  Slava Kisilevich,et al.  P-DBSCAN: a density based clustering algorithm for exploration and analysis of attractive areas using collections of geo-tagged photos , 2010, COM.Geo '10.

[22]  Georges Voronoi Nouvelles applications des paramètres continus à la théorie des formes quadratiques. Deuxième mémoire. Recherches sur les parallélloèdres primitifs. , 1908 .

[23]  Eugenio Cesario,et al.  Trajectory Pattern Mining for Urban Computing in the Cloud , 2017, IEEE Transactions on Parallel and Distributed Systems.

[24]  Ickjai Lee,et al.  Spatio-temporal Sequential Pattern Mining for Tourism Sciences , 2014, ICCS.

[25]  Evaggelos Spyrou,et al.  Analyzing Flickr metadata to extract location-based information and semantically organize its photo content , 2016, Neurocomputing.

[26]  Maurice van Keulen,et al.  Point of interest to region of interest conversion , 2013, SIGSPATIAL/GIS.