Regular Activity Patterns in Spatio-Temporal Events Databases: Multi-Scale Extraction of Geolocated Tweets

This paper proposes a new technique for the extraction of regular activity patterns at different scales (resolution levels), mined from the microblogging platform Twitter. The approach is based on the recursive application of the DBSCAN clustering algorithm to the geolocated Twitter feed. The proposed technique includes a novel way to obtain ’averaged’ regular activity zones based on the rasterization and aggregation of the Concave Hull of the clusters identified at each resolution level. This technique uses only the spatio-temporal characteristics of the geolocated Twitter feed and does not depend on the data content; therefore it can be extended to work with different spatio-temporal event sources such as mobile telephone records. An experiment was carried out to demonstrate the effectiveness of our technique in the extraction of known activity patterns in the Mexico City Metropolitan Area. Palabras clave: social media, urban activity, geographic data mining.

[1]  Kazutoshi Sumiya,et al.  Measuring geographical regularities of crowd behaviors for Twitter-based geo-social event detection , 2010, LBSN '10.

[2]  Lingjuan Li,et al.  Research on Clustering Algorithm and Its Parallelization Strategy , 2011, 2011 International Conference on Computational and Information Sciences.

[3]  Kazutoshi Sumiya,et al.  Detection of Unusually Crowded Places through Micro-Blogging Sites , 2010, 2010 IEEE 24th International Conference on Advanced Information Networking and Applications Workshops.

[4]  Daniel Villatoro,et al.  From Tweets to Semantic Trajectories: Mining Anomalous Urban Mobility Patterns , 2013, CitiSens.

[5]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[6]  Kazutoshi Sumiya,et al.  Discovery of unusual regional social activities using geo-tagged microblogs , 2011, World Wide Web.

[7]  Leysia Palen,et al.  Chatter on the red: what hazards threat reveals about the social life of microblogged information , 2010, CSCW '10.

[8]  Ickjai Lee,et al.  AMOEBA: HIERARCHICAL CLUSTERING BASED ON SPATIAL PROXIMITY USING DELAUNATY DIAGRAM , 2000 .

[10]  Víctor Soto,et al.  Characterizing Urban Landscapes Using Geolocated Tweets , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[11]  Max L. Wilson,et al.  Searching Twitter: Separating the Tweet from the Chaff , 2011, ICWSM.

[12]  Manuel Suárez,et al.  Is Mexico City Polycentric? A Trip Attraction Capacity Approach , 2009 .

[13]  Dongman Lee,et al.  What's Happening: Finding Spontaneous User Clusters Nearby Using Twitter , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[14]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .

[15]  Herbert Edelsbrunner,et al.  Smooth Surfaces for Multi-Scale Shape Representation , 1995, FSTTCS.

[16]  H. Raghav Rao,et al.  Information control and terrorism: Tracking the Mumbai terrorist attack through twitter , 2011, Inf. Syst. Frontiers.

[17]  Aftab A. Khan THE ROLE SOCIAL OF MEDIA AND MODERN TECHNOLOGY IN ARABS SPRING , 2012 .

[18]  Vipin Kumar,et al.  Chameleon: Hierarchical Clustering Using Dynamic Modeling , 1999, Computer.

[19]  Slava Kisilevich,et al.  Spatio-temporal clustering , 2010, Data Mining and Knowledge Discovery Handbook.

[20]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[21]  Anthony Stefanidis,et al.  #Earthquake: Twitter as a Distributed Sensor System , 2013, Trans. GIS.

[22]  Dongman Lee,et al.  EventRadar: A Real-Time Local Event Detection Scheme Using Twitter Stream , 2012, 2012 IEEE International Conference on Green Computing and Communications.

[23]  Wael Khreich,et al.  A Survey of Techniques for Event Detection in Twitter , 2015, Comput. Intell..

[24]  Vanessa Frías-Martínez,et al.  Spectral clustering for sensing urban land use using Twitter activity , 2014, Engineering applications of artificial intelligence.