A Survey on Spatiotemporal and Semantic Data Mining

The wide proliferation of GPS-enabled mobile devices and the rapid development of sensing technology have nurtured explosive growth of semantics- enriched spatiotemporal (SeST) data. Compared to traditional spatiotemporal data like GPS traces and RFID data, SeST data is multidimensional in nature as each SeST object involves location, time, and text. On one hand, mining spatiotemporal knowledge from SeST data brings new opportunities to improving applications like location recommendation, event detection, and urban planning. On the other hand, SeST data also introduces new challenges that have led to the developments of various techniques tailored for mining SeST information. In this survey, we summarize state-of-the-art studies on knowledge discovery from SeST data. Specifically, we first identify the key challenges and data representations for mining SeST data. Then we introduce major mining tasks and how SeST information is leveraged in existing studies. Finally, we provide an overall picture of this research area and an outlook on several future directions of it. We anticipate this survey to provide readers with an overall picture of the state-of-the-art research in this area and to help them generate high-quality work.

[1]  Lidan Shou,et al.  Splitter: Mining Fine-Grained Sequential Patterns in Semantic Trajectories , 2014, Proc. VLDB Endow..

[2]  Raffaele Perego,et al.  On planning sightseeing tours with TripBuilder , 2015, Inf. Process. Manag..

[3]  Shashi Shekhar,et al.  Spatiotemporal Data Mining: A Computational Perspective , 2015, ISPRS Int. J. Geo Inf..

[4]  David J. Crandall,et al.  Beyond co-occurrence: discovering and visualizing tag relationships from geo-spatial and temporal similarities , 2012, WSDM '12.

[5]  Xing Xie,et al.  Discovering regions of different functions in a city using human mobility and POIs , 2012, KDD.

[6]  Ling Chen,et al.  Event detection from flickr data through wavelet-based spatial analysis , 2009, CIKM.

[7]  Yu Zheng,et al.  U-Air: when urban air quality inference meets big data , 2013, KDD.

[8]  Liang Zhao,et al.  Spatiotemporal Event Forecasting in Social Media , 2015, SDM.

[9]  Ari Rappoport,et al.  What's in a hashtag?: content based prediction of the spread of ideas in microblogging communities , 2012, WSDM '12.

[10]  Nadia Magnenat-Thalmann,et al.  Time-aware point-of-interest recommendation , 2013, SIGIR.

[11]  Cyrus Shahabi,et al.  Crowd sensing of traffic anomalies based on human mobility and social media , 2013, SIGSPATIAL/GIS.

[12]  Krzysztof Janowicz,et al.  On the semantic annotation of places in location-based social networks , 2011, KDD.

[13]  Mao Ye,et al.  Exploiting geographical influence for collaborative point-of-interest recommendation , 2011, SIGIR.

[14]  Gao Cong,et al.  A general graph-based model for recommendation in event-based social networks , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[15]  Shashi Shekhar,et al.  Cascading Spatio-Temporal Pattern Discovery , 2012, IEEE Transactions on Knowledge and Data Engineering.

[16]  Nicholas Jing Yuan,et al.  You Are Where You Go: Inferring Demographic Attributes from Location Check-ins , 2015, WSDM.

[17]  Eric Horvitz,et al.  Eyewitness: identifying local events via space-time signals in twitter feeds , 2015, SIGSPATIAL/GIS.

[18]  Chung-Hong Lee,et al.  Mining spatio-temporal information on microblogging streams using a density-based online clustering method , 2012, Expert Syst. Appl..

[19]  Wang-Chien Lee,et al.  Semantic Annotation of Mobility Data using Social Media , 2015, WWW.

[20]  Geert-Jan Houben,et al.  Placing images on the world map: a microblog-based enrichment approach , 2012, SIGIR '12.

[21]  Clare R. Voss,et al.  ClusType: Effective Entity Recognition and Typing by Relation Phrase-Based Clustering , 2015, KDD.

[22]  Archan Misra,et al.  TODMIS: mining communities from trajectories , 2013, CIKM.

[23]  Daniel Gatica-Perez,et al.  Discovering routines from large-scale human locations using probabilistic topic models , 2011, TIST.

[24]  Luming Zhang,et al.  GMove: Group-Level Mobility Modeling Using Geo-Tagged Social Media , 2016, KDD.

[25]  Tao Mei,et al.  When recommendation meets mobile: contextual and personalized recommendation on the go , 2011, UbiComp '11.

[26]  Hiroyuki Kitagawa,et al.  Online User Location Inference Exploiting Spatiotemporal Correlations in Social Streams , 2014, CIKM.

[27]  Gao Cong,et al.  Who, Where, When, and What , 2015, ACM Trans. Inf. Syst..

[28]  Jiawei Han,et al.  Mining Quality Phrases from Massive Text Corpora , 2015, SIGMOD Conference.