Modeling location-based social network data with area attraction and neighborhood competition

Modeling user check-in behavior helps us gain useful insights about venues as well as the users visiting them. These insights are important in urban planning and recommender system applications. Since check-in behavior is the result of multiple factors, this paper focuses on studying two venue related factors, namely, area attraction and neighborhood competition. The former refers to the ability of a spatial area covering multiple venues to collectively attract check-ins from users, while the latter represents the extent to which a venue can compete with other venues in the same area for check-ins. We first embark on empirical studies to ascertain the two factors using three datasets gathered from users and venues of three major cities, Singapore, Jakarta and New York City. We then propose the visitation by area attractiveness and neighborhood competition (VAN) model incorporating area attraction and neighborhood competition factors. Our VAN model is also extended to incorporate social homophily so as to further enhance its modeling power. We evaluate VAN model using real world datasets against various state-of-the-art baselines. The results show that VAN model outperforms the baselines in check-in prediction task and its performance is robust under different parameter settings.

[1]  Michael I. Jordan,et al.  Variational inference for Dirichlet process mixtures , 2006 .

[2]  Chunyan Miao,et al.  Exploiting Geographical Neighborhood Characteristics for Location Recommendation , 2014, CIKM.

[3]  Cecilia Mascolo,et al.  Geo-spotting: mining online location-based services for optimal retail store placement , 2013, KDD.

[4]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[5]  Nicu Sebe,et al.  The Death and Life of Great Italian Cities: A Mobile Phone Data Perspective , 2016, WWW.

[6]  Ee-Peng Lim,et al.  Mining Business Competitiveness from User Visitation Data , 2015, SBP.

[7]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[8]  Ee-Peng Lim,et al.  Modeling Check-In Behavior with Geographical Neighborhood Influence of Venues , 2017, ADMA.

[9]  Huan Liu,et al.  Mining Human Mobility in Location-Based Social Networks , 2015, Mining Human Mobility in Location-Based Social Networks.

[10]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[11]  Richang Hong,et al.  Point-of-Interest Recommendations: Learning Potential Check-ins from Friends , 2016, KDD.

[12]  Ee-Peng Lim,et al.  On Neighborhood Effects in Location-Based Social Networks , 2015, 2015 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT).

[13]  Chong Wang,et al.  Collaborative topic modeling for recommending scientific articles , 2011, KDD.

[14]  Michael R. Lyu,et al.  Mining Business Opportunities from Location-based Social Networks , 2017, SIGIR.

[15]  Eric Sun,et al.  Location3: How Users Share and Respond to Location-Based Data on Social , 2011, ICWSM.

[16]  Ying-Cheng Lai,et al.  Universal model of individual and population mobility on diverse spatial scales , 2017, Nature Communications.

[17]  Clodoveu A. Davis,et al.  Quality of Urban Life Index From Location-Based Social Networks Data: A Case Study in Belo Horizonte, Brazil , 2017 .

[18]  Lars Schmidt-Thieme,et al.  BPR: Bayesian Personalized Ranking from Implicit Feedback , 2009, UAI.

[19]  Hui Xiong,et al.  Link Graph Analysis for Business Site Selection , 2012, Computer.

[20]  Xing Xie,et al.  Discovering regions of different functions in a city using human mobility and POIs , 2012, KDD.

[21]  Cecilia Mascolo,et al.  Where Businesses Thrive: Predicting the Impact of the Olympic Games on Local Retailers through Location-based Services Data , 2014, ICWSM.

[22]  Hui Xiong,et al.  Learning geographical preferences for point-of-interest recommendation , 2013, KDD.

[23]  Tao Mei,et al.  Shop-Type Recommendation Leveraging the Data from Social Media and Location-Based Services , 2016, ACM Trans. Knowl. Discov. Data.

[24]  Michael R. Lyu,et al.  SoRec: social recommendation using probabilistic matrix factorization , 2008, CIKM '08.

[25]  Ee-Peng Lim,et al.  A Business Zone Recommender System Based on Facebook and Urban Planning Data , 2016, ECIR.

[26]  Yong Liu,et al.  Your neighbors affect your ratings: on geographical neighborhood influence to rating prediction , 2014, SIGIR.

[27]  Peter A. Lachenbruch,et al.  Paired t Test , 2005 .

[28]  Tong Zhao,et al.  Leveraging Social Connections to Improve Personalized Ranking for Collaborative Filtering , 2014, CIKM.

[29]  Huan Liu,et al.  Data Analysis on Location-Based Social Networks , 2014 .

[30]  A. Vespignani,et al.  Competition among memes in a world with limited attention , 2012, Scientific Reports.

[31]  Ee-Peng Lim,et al.  Where is the Goldmine?: Finding Promising Business Locations through Facebook Data Analytics , 2016, HT.

[32]  Jure Leskovec,et al.  Friendship and mobility: user movement in location-based social networks , 2011, KDD.

[33]  Ole Winther,et al.  Bayesian Non-negative Matrix Factorization , 2009, ICA.

[34]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[35]  Albert-László Barabási,et al.  Limits of Predictability in Human Mobility , 2010, Science.

[36]  Daniele Quercia,et al.  Mining Urban Deprivation from Foursquare: Implicit Crowdsourcing of City Land Use , 2014, IEEE Pervasive Computing.

[37]  Margaret Martonosi,et al.  Human mobility modeling at metropolitan scales , 2012, MobiSys '12.

[38]  Clodoveu A. Davis,et al.  Could Data from Location-Based Social Networks Be Used to Support Urban Planning? , 2017, WWW.

[39]  Patrick Seemann,et al.  Matrix Factorization Techniques for Recommender Systems , 2014 .

[40]  Jun Zhang,et al.  Trade area analysis using user generated mobile location data , 2013, WWW '13.

[41]  Huan Liu,et al.  gSCorr: modeling geo-social correlations for new check-ins on location-based social networks , 2012, CIKM.

[42]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[43]  Huan Liu,et al.  Exploring Social-Historical Ties on Location-Based Social Networks , 2012, ICWSM.

[44]  Eric Sun,et al.  Location 3 : How Users Share and Respond to Location-Based Data on Social Networking Sites , 2011 .

[45]  Ee-Peng Lim,et al.  Attractiveness versus Competition: Towards an Unified Model for User Visitation , 2016, CIKM.

[46]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[47]  Daqing Zhang,et al.  Where is the Largest Market: Ranking Areas by Popularity from Location Based Social Networks , 2013, 2013 IEEE 10th International Conference on Ubiquitous Intelligence and Computing and 2013 IEEE 10th International Conference on Autonomic and Trusted Computing.

[48]  Rui Wang,et al.  Towards social user profiling: unified and discriminative influence model for inferring home locations , 2012, KDD.

[49]  Xiang Li,et al.  Advanced Data Mining and Applications (ADMA) , 2008, ADMA 2008.

[50]  D. Huff A Probabilistic Analysis of Shopping Center Trade Areas , 1963 .

[51]  David M. Blei,et al.  Modeling User Exposure in Recommendation , 2015, WWW.

[52]  Lars Backstrom,et al.  Find me if you can: improving geographical prediction with social and spatial proximity , 2010, WWW '10.

[53]  Chao Liu,et al.  Recommender systems with social regularization , 2011, WSDM '11.

[54]  Richard L. Church,et al.  Business Site Selection, Location Analysis and GIS , 2008 .