A Probabilistic Lifestyle-Based Trajectory Model for Social Strength Inference from Human Trajectory Data

With the pervasiveness of location-based social networks, it becomes increasingly important to consider the social characteristics of locations shared among persons. Several studies have been proposed to infer social strength by using trajectory similarity. However, these studies have two major shortcomings. First, they rely on the explicit co-occurrence of check-in locations. In this situation, a user pair of two friends who seldom share common locations or a user pair of two strangers who heavily share common visited locations will receive an unreliable estimation of the real social strength between them. Second, these studies do not consider how the overall trajectory patterns of users change with the varying of living styles. In this article, we propose a probabilistic generative model to mine latent lifestyle-related patterns from human trajectory data for inferring social strength. It can automatically learn functionality topics consisting of locations with similar service functions and transition probabilities over the set of functionality topics. Furthermore, a lifestyle is modeled as a unique transition probability matrix over the set of functionality topics. A user has a preference distribution over the set of lifestyles, and he or she is able to select over multiple lifestyles to adapt to different living contexts. The learned lifestyle-related patterns are subsequently used as features in a supervised learner for both strength estimation and link prediction. We conduct extensive experiments to evaluate the performance of the proposed method on two real-world datasets. The experimental results demonstrate the effectiveness of our proposed method.

[1]  Yang Liu,et al.  Who Influenced You? Predicting Retweet via Social Influence Locality , 2015, ACM Trans. Knowl. Discov. Data.

[2]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[3]  Brendan T. O'Connor,et al.  A Latent Variable Model for Geographic Lexical Variation , 2010, EMNLP.

[4]  Aram Galstyan,et al.  Socially Relevant Venue Clustering from Check-in Data , 2013 .

[5]  Alexander J. Smola,et al.  Like like alike: joint friendship and interest propagation in social networks , 2011, WWW.

[6]  Xing Xie,et al.  Mining user similarity based on location history , 2008, GIS '08.

[7]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[8]  Dieter Bögenhold Social Inequality and the Sociology of Life Style: Material and Cultural Aspects of Social Stratification , 2001 .

[9]  Jon M. Kleinberg,et al.  The Directed Closure Process in Hybrid Social-Information Networks, with an Analysis of Link Formation on Twitter , 2010, ICWSM.

[10]  Christian S. Jensen,et al.  Discovery of convoys in trajectory databases , 2008, Proc. VLDB Endow..

[11]  Beng Chin Ooi,et al.  Continuous Clustering of Moving Objects , 2007, IEEE Transactions on Knowledge and Data Engineering.

[12]  Nikos Mamoulis,et al.  Mining frequent spatio-temporal sequential patterns , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[13]  Hanna M. Wallach,et al.  Topic modeling: beyond bag-of-words , 2006, ICML.

[14]  Umeshwar Dayal,et al.  PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth , 2001, ICDE 2001.

[15]  Chong Wang,et al.  Mining geographic knowledge using location aware topic model , 2007, GIR '07.

[16]  Daniel Gatica-Perez,et al.  A probabilistic approach to mining mobile phone data sequences , 2013, Personal and Ubiquitous Computing.

[17]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[18]  Yu Zheng,et al.  Trajectory Data Mining , 2015, ACM Trans. Intell. Syst. Technol..

[19]  Andreas Krause,et al.  Data association for topic intensity tracking , 2006, ICML '06.

[20]  Ling Chen,et al.  Geo-SAGE: A Geographical Sparse Additive Generative Model for Spatial Item Recommendation , 2015, KDD.

[21]  L. Baum,et al.  Statistical Inference for Probabilistic Functions of Finite State Markov Chains , 1966 .

[22]  Wei Chen,et al.  Diffusion of “Following” Links in Microblogging Networks , 2015, IEEE Transactions on Knowledge and Data Engineering.

[23]  Xifeng Yan,et al.  CloSpan: Mining Closed Sequential Patterns in Large Datasets , 2003, SDM.

[24]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[25]  Wei-Ying Ma,et al.  Recommending friends and locations based on individual location history , 2011, ACM Trans. Web.

[26]  Nicholas Jing Yuan,et al.  We know how you live: exploring the spectrum of urban lifestyles , 2013, COSN '13.

[27]  Mohamed F. Mokbel,et al.  Location-based and preference-aware recommendation using sparse geo-social networking data , 2012, SIGSPATIAL/GIS.

[28]  Abhijit Banerjee,et al.  Gossip: Identifying Central Individuals in a Social Network , 2014, ArXiv.

[29]  Tomoharu Iwata,et al.  Travel route recommendation using geotags in photo sharing sites , 2010, CIKM.

[30]  Jiawei Han,et al.  Geographical topic discovery and comparison , 2011, WWW.

[31]  Alexei Pozdnoukhov,et al.  Best Paper Award , 2011 .

[32]  Ashwin Machanavajjhala,et al.  Personalized Social Recommendations - Accurate or Private? , 2011, Proc. VLDB Endow..

[33]  Jon M. Kleinberg,et al.  Bursty and Hierarchical Structure in Streams , 2002, Data Mining and Knowledge Discovery.

[34]  Lars Schmidt-Thieme,et al.  Factorizing personalized Markov chains for next-basket recommendation , 2010, WWW '10.

[35]  Yan Liu,et al.  EBM: an entropy-based model to infer social strength from spatiotemporal data , 2013, SIGMOD '13.

[36]  Joachim Gudmundsson,et al.  Computing longest duration flocks in trajectory data , 2006, GIS '06.

[37]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[38]  P. Lazarsfeld,et al.  Friendship as Social process: a substantive and methodological analysis , 1964 .

[39]  Huiping Sun,et al.  CQArank: jointly model topics and expertise in community question answering , 2013, CIKM.

[40]  Aniket Kittur,et al.  Bridging the gap between physical location and online social networks , 2010, UbiComp.

[41]  Maguelonne Teisseire,et al.  The Pattern Next Door: Towards Spatio-sequential Pattern Discovery , 2012, PAKDD.

[42]  Xiaoming Fu,et al.  Mining triadic closure patterns in social networks , 2014, WWW.

[43]  Dino Pedreschi,et al.  Trajectory pattern mining , 2007, KDD '07.

[44]  Li Chen,et al.  Factorization vs. regularization: fusing heterogeneous social relationships in top-n recommendation , 2011, RecSys '11.

[45]  Jure Leskovec,et al.  Friendship and mobility: user movement in location-based social networks , 2011, KDD.

[46]  Michael R. Lyu,et al.  Fused Matrix Factorization with Geographical and Social Influence in Location-Based Social Networks , 2012, AAAI.

[47]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[48]  Hao Ma On measuring social friend interest similarities in recommender systems , 2014, SIGIR.

[49]  Xing Xie,et al.  Finding similar users using category-based location history , 2010, GIS '10.

[50]  Yizhou Sun,et al.  LCARS: a location-content-aware recommender system , 2013, KDD.

[51]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[52]  Cyrus Shahabi,et al.  Towards integrating real-world spatiotemporal data with social networks , 2011, GIS.

[53]  Huan Liu,et al.  Modeling temporal effects of human mobile behavior on location-based social networks , 2013, CIKM.

[54]  Xing Xie,et al.  Discovering regions of different functions in a city using human mobility and POIs , 2012, KDD.

[55]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[56]  Ido Guy,et al.  Personalized social search based on the user's social network , 2009, CIKM.

[57]  Bruno Martins,et al.  Predicting future locations with hidden Markov models , 2012, UbiComp.

[58]  Chin-Wan Chung,et al.  A User Similarity Calculation Based on the Location for Social Network Services , 2011, DASFAA.

[59]  Tomoharu Iwata,et al.  Geo topic model: joint modeling of user's activity area and interests for location recommendation , 2013, WSDM.

[60]  Lei Chen,et al.  Finding time period-based most frequent path in big trajectory data , 2013, SIGMOD '13.

[61]  Dino Pedreschi,et al.  Human mobility, social ties, and link prediction , 2011, KDD.

[62]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[63]  Gao Cong,et al.  COM: a generative model for group recommendation , 2014, KDD.

[64]  Panos Kalnis,et al.  On Discovering Moving Clusters in Spatio-temporal Data , 2005, SSTD.

[65]  Sergej Sizov,et al.  GeoFolk: latent spatial semantics in web 2.0 social media , 2010, WSDM '10.