A Spatial-Temporal Topic Model for the Semantic Annotation of POIs in LBSNs

Semantic tags of points of interest (POIs) are a crucial prerequisite for location search, recommendation services, and data cleaning. However, most POIs in location-based social networks (LBSNs) are either tag-missing or tag-incomplete. This article aims to develop semantic annotation techniques to automatically infer tags for POIs. We first analyze two LBSN datasets and observe that there are two types of tags, category-related ones and sentimental ones, which have unique characteristics. Category-related tags are hierarchical, whereas sentimental ones are category-aware. All existing related work has adopted classification methods to predict high-level category-related tags in the hierarchy, but they cannot apply to infer either low-level category tags or sentimental ones. In light of this, we propose a latent-class probabilistic generative model, namely the spatial-temporal topic model (STM), to infer personal interests, the temporal and spatial patterns of topics/semantics embedded in users’ check-in activities, the interdependence between category-topic and sentiment-topic, and the correlation between sentimental tags and rating scores from users’ check-in and rating behaviors. Then, this learned knowledge is utilized to automatically annotate all POIs with both category-related and sentimental tags in a unified way. We conduct extensive experiments to evaluate the performance of the proposed STM on a real large-scale dataset. The experimental results show the superiority of our proposed STM, and we also observe that the real challenge of inferring category-related tags for POIs lies in the low-level ones of the hierarchy and that the challenge of predicting sentimental tags are those with neutral ratings.

[1]  Alice H. Oh,et al.  Aspect and sentiment unification model for online review analysis , 2011, WSDM '11.

[2]  Gao Cong,et al.  Who, Where, When, and What , 2015, ACM Trans. Inf. Syst..

[3]  Yue Lu,et al.  Latent aspect rating analysis on review text data: a rating regression approach , 2010, KDD.

[4]  Kyumin Lee,et al.  Exploring Millions of Footprints in Location Sharing Services , 2011, ICWSM.

[5]  Hai Yang,et al.  ACM Transactions on Intelligent Systems and Technology - Special Section on Urban Computing , 2014 .

[6]  Andrew McCallum,et al.  Topics over time: a non-Markov continuous-time model of topical trends , 2006, KDD '06.

[7]  Ivan Titov,et al.  Modeling online reviews with multi-grain topic models , 2008, WWW.

[8]  Alexander J. Smola,et al.  Discovering geographical topics in the twitter stream , 2012, WWW.

[9]  Yiqun Liu,et al.  Do users rate or review?: boost phrase-level sentiment labeling with review-level sentiment classification , 2014, SIGIR.

[10]  Hua Lu,et al.  A unified model for stable and temporal topic detection from social media data , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[11]  John Krumm,et al.  Placer: semantic place labels from diary data , 2013, UbiComp.

[12]  Hongfei Yan,et al.  Comparing Twitter and Traditional Media Using Topic Models , 2011, ECIR.

[13]  Zhiting Hu,et al.  Dynamic User Modeling in Social Media Systems , 2015, TOIS.

[14]  Suresh Manandhar,et al.  Exploring user expertise and descriptive ability in community question answering , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[15]  Nadia Magnenat-Thalmann,et al.  Time-aware point-of-interest recommendation , 2013, SIGIR.

[16]  Mao Ye,et al.  Exploiting geographical influence for collaborative point-of-interest recommendation , 2011, SIGIR.

[17]  Grigory Begelman,et al.  Automated Tag Clustering: Improving search and exploration in the tag space , 2006 .

[18]  Hojung Cha,et al.  Autonomous place naming system using opportunistic crowdsensing and knowledge from crowdsourcing , 2013, 2013 ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN).

[19]  Vagelis Hristidis,et al.  Leveraging collaborative tagging for web item design , 2011, KDD.

[20]  Jie Tang,et al.  ArnetMiner: extraction and mining of academic social networks , 2008, KDD.

[21]  Zi Huang,et al.  A temporal context-aware model for user behavior modeling in social media systems , 2014, SIGMOD Conference.

[22]  Ivan Titov,et al.  A Joint Model of Text and Aspect Ratings for Sentiment Summarization , 2008, ACL.

[23]  Hojung Cha,et al.  Automatically characterizing places with opportunistic crowdsensing using smartphones , 2012, UbiComp.

[24]  Huan Liu,et al.  Exploring temporal effects for location recommendation on location-based social networks , 2013, RecSys.

[25]  Jiawei Han,et al.  Geographical topic discovery and comparison , 2011, WWW.

[26]  Shazia Wasim Sadiq,et al.  Joint Modeling of User Check-in Behaviors for Point-of-Interest Recommendation , 2015, CIKM.

[27]  Hui Xiong,et al.  Learning geographical preferences for point-of-interest recommendation , 2013, KDD.

[28]  Nadia Magnenat-Thalmann,et al.  Who, where, when and what: discover spatio-temporal topics for twitter users , 2013, KDD.

[29]  Yizhou Sun,et al.  LCARS: a location-content-aware recommender system , 2013, KDD.

[30]  Krzysztof Janowicz,et al.  On the semantic annotation of places in location-based social networks , 2011, KDD.

[31]  James J. Little,et al.  Place Classification Using Visual Object Categorization and Global Information , 2011, 2011 Canadian Conference on Computer and Robot Vision.

[32]  Wolfram Burgard,et al.  Semantic Place Classification of Indoor Environments with Mobile Robots Using Boosting , 2005, AAAI.

[33]  Jimeng Sun,et al.  Cross-domain collaboration recommendation , 2012, KDD.

[34]  Martin Ester,et al.  ILDA: interdependent LDA model for learning latent aspects and their ratings from online product reviews , 2011, SIGIR.

[35]  Krzysztof Janowicz,et al.  An agenda for the next generation gazetteer: geographic information contribution and retrieval , 2009, GIS.

[36]  Ling Chen,et al.  Geo-SAGE: A Geographical Sparse Additive Generative Model for Spatial Item Recommendation , 2015, KDD.

[37]  Brian D. Davison,et al.  Tracking trends: incorporating term volume into temporal topic models , 2011, KDD.

[38]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[39]  Brendan T. O'Connor,et al.  A Latent Variable Model for Geographic Lexical Variation , 2010, EMNLP.

[40]  GeunSik Jo,et al.  Collaborative filtering based on collaborative tagging for enhancing the quality of recommendation , 2010, Electron. Commer. Res. Appl..

[41]  Martin Ester,et al.  Spatial topic modeling in online social media for location recommendation , 2013, RecSys.

[42]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[43]  Xing Xie,et al.  Discovering regions of different functions in a city using human mobility and POIs , 2012, KDD.

[44]  Zi Huang,et al.  Joint Modeling of Users' Interests and Mobility Patterns for Point-of-Interest Recommendation , 2015, ACM Multimedia.

[45]  Xu Ling,et al.  Topic sentiment mixture: modeling facets and opinions in weblogs , 2007, WWW '07.

[46]  Jing Yuan,et al.  A framework of traveling companion discovery on trajectory data streams , 2013, ACM Trans. Intell. Syst. Technol..

[47]  Huiping Sun,et al.  CQArank: jointly model topics and expertise in community question answering , 2013, CIKM.

[48]  Francisco C. Pereira,et al.  Semantics of Place: Ontology Enrichment , 2008, IBERAMIA.

[49]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[50]  Andrew McCallum,et al.  Rethinking LDA: Why Priors Matter , 2009, NIPS.