HisRect: Features from Historical Visits and Recent Tweet for Co-Location Judgement

Social media users are increasingly going mobile, fostering location based services on social media platforms (e.g., Twitter). Services like friends notification and community detection benefit from co-location judgement that decides if two users are co-located in some point-of-interest (POI). This problem is challenging due to the limited information and the lack of explicit geo-tags in tweets that can be used as labeled data. Our approach is based on a novel concept HisRect features extracted from users' historical visits and recent tweets: The former has impacts on where a user visits in general, whereas the latter gives more hints about where a user is currently. As labeled data is scarce in practice, we design a semi-supervised learning (SSL) framework that leverages unlabeled data to extract HisRect features. Moreover, we employ an embedding neural network layer to process HisRect features of two users, which decides co-location based on the embedding difference between the two features. Our model is evaluated on large sets of real Twitter data from over one million users. The experimental results demonstrate that our HisRect features and SSL framework are highly effective at deciding co-locations. In terms of multiple metrics, our approach clearly outperforms alternative approaches using state-of-the-art techniques.

[1]  Chi-Yin Chow,et al.  LORE: exploiting sequential influence for location recommendations , 2014, SIGSPATIAL/GIS.

[2]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[3]  Themis Palpanas,et al.  Where has this tweet come from? Fast and fine-grained geolocalization of non-geotagged tweets , 2016, Social Network Analysis and Mining.

[4]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[5]  Michael R. Lyu,et al.  Where You Like to Go Next: Successive Point-of-Interest Recommendation , 2013, IJCAI.

[6]  Kyumin Lee,et al.  You are where you tweet: a content-based approach to geo-locating twitter users , 2010, CIKM.

[7]  Weiqing Wang,et al.  TPM: A Temporal Personalized Model for Spatial Item Recommendation , 2018, ACM Trans. Intell. Syst. Technol..

[8]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[9]  Zhaohui Wu,et al.  Online Community Detection for Large Complex Networks , 2013, IJCAI.

[10]  Cecilia Mascolo,et al.  Mining User Mobility Features for Next Place Prediction in Location-Based Services , 2012, 2012 IEEE 12th International Conference on Data Mining.

[11]  Hao Wang,et al.  Adapting to User Interest Drift for POI Recommendation , 2016, IEEE Transactions on Knowledge and Data Engineering.

[12]  Huan Liu,et al.  Discovering Location Information in Social Media , 2015, IEEE Data Eng. Bull..

[13]  Andrew R. Barron,et al.  Approximation and estimation bounds for artificial neural networks , 2004, Machine Learning.

[14]  Navdeep Jaitly,et al.  Towards End-To-End Speech Recognition with Recurrent Neural Networks , 2014, ICML.

[15]  Weitong Chen,et al.  Learning Graph-based POI Embedding for Location-based Recommendation , 2016, CIKM.

[16]  Aron Culotta,et al.  Inferring the origin locations of tweets with quantitative confidence , 2013, CSCW.

[17]  Hua Lu,et al.  Finding Influential Local Users with Similar Interest from Geo-Tagged Social Media Data , 2017, 2017 18th IEEE International Conference on Mobile Data Management (MDM).

[18]  James Caverlee,et al.  A geographic study of tie strength in social media , 2011, CIKM '11.

[19]  Mor Naaman,et al.  On the Accuracy of Hyper-local Geotagging of Social Media Content , 2014, WSDM.

[20]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[21]  Mao Ye,et al.  Location recommendation for out-of-town users in location-based social networks , 2013, CIKM.

[22]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[23]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[24]  Ling Chen,et al.  SPORE: A sequential personalized spatial item recommender system , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[25]  Zhi Liu,et al.  SPOT: Locating Social Media Users Based on Social Network Context , 2014, Proc. VLDB Endow..

[26]  Sue Moon,et al.  Inferring Twitter user locations with 10 km accuracy , 2014, WWW.

[27]  Mark Dredze,et al.  Geolocation for Twitter: Timing Matters , 2016, NAACL.

[28]  Swapna S. Gokhale,et al.  Accurate Local Estimation of Geo-Coordinates for Social Media Posts , 2014, SEKE.

[29]  Jiawei Han,et al.  Bridging Collaborative Filtering and Semi-Supervised Learning: A Neural Approach for POI Recommendation , 2017, KDD.

[30]  Zhaohui Wu,et al.  Discovering different kinds of smartphone users through their application usage behaviors , 2016, UbiComp.

[31]  Hiroyuki Kitagawa,et al.  Online User Location Inference Exploiting Spatiotemporal Correlations in Social Streams , 2014, CIKM.

[32]  Zhaohui Wu,et al.  Mining User Attributes Using Large-Scale APP Lists of Smartphones , 2017, IEEE Systems Journal.

[33]  Ruslan Salakhutdinov,et al.  Revisiting Semi-Supervised Learning with Graph Embeddings , 2016, ICML.

[34]  Jianxin Li,et al.  Most Influential Community Search over Large Social Networks , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[35]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[36]  Jing Li,et al.  Efficient notification of meeting points for moving groups via independent safe regions , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[37]  Dit-Yan Yeung,et al.  Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[38]  Hua Lu,et al.  Location Inference for Non-Geotagged Tweets in User Timelines , 2019, IEEE Transactions on Knowledge and Data Engineering.

[39]  Arkaitz Zubiaga,et al.  Towards Real-Time, Country-Level Location Classification of Worldwide Tweets , 2016, IEEE Transactions on Knowledge and Data Engineering.

[40]  Yizhou Sun,et al.  Graph Regularized Transductive Classification on Heterogeneous Information Networks , 2010, ECML/PKDD.

[41]  David F. Gleich,et al.  A Correlation Clustering Framework for Community Detection , 2018, WWW.

[42]  Cyrus Shahabi,et al.  Spatial influence - measuring followship in the real world , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[43]  Wen-Chih Peng,et al.  Modeling User Mobility for Location Promotion in Location-based Social Networks , 2015, KDD.

[44]  Hua Lu,et al.  Finding top-k local users in geo-tagged social media data , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[45]  Aixin Sun,et al.  A Survey of Location Prediction on Twitter , 2017, IEEE Transactions on Knowledge and Data Engineering.

[46]  Gang Chen,et al.  Evaluating geo-social influence in location-based social networks , 2012, CIKM.

[47]  James Caverlee,et al.  Location prediction in social media based on tie strength , 2013, CIKM.

[48]  A. Barron Approximation and Estimation Bounds for Artificial Neural Networks , 1991, COLT '91.

[49]  Jure Leskovec,et al.  Friendship and mobility: user movement in location-based social networks , 2011, KDD.

[50]  Ling Chen,et al.  Geo-SAGE: A Geographical Sparse Additive Generative Model for Spatial Item Recommendation , 2015, KDD.

[51]  Tom M. Mitchell,et al.  PIDGIN: ontology alignment using web text as interlingua , 2013, CIKM.

[52]  Yizhou Sun,et al.  LCARS: a location-content-aware recommender system , 2013, KDD.

[53]  Wei Zhang,et al.  Location and Time Aware Social Collaborative Retrieval for New Successive Point-of-Interest Recommendation , 2015, CIKM.

[54]  Yizhou Sun,et al.  Task-Guided and Path-Augmented Heterogeneous Network Embedding for Author Identification , 2016, WSDM.

[55]  Themis Palpanas,et al.  Fine-grained geolocalisation of non-geotagged tweets , 2015, 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[56]  Ling Chen,et al.  Spatial-Aware Hierarchical Collaborative Deep Learning for POI Recommendation , 2017, IEEE Transactions on Knowledge and Data Engineering.

[57]  Jason Weston,et al.  Deep learning via semi-supervised embedding , 2008, ICML '08.

[58]  Nadia Magnenat-Thalmann,et al.  Who, where, when and what: discover spatio-temporal topics for twitter users , 2013, KDD.

[59]  Sheila Kinsella,et al.  "I'm eating a sandwich in Glasgow": modeling locations with tweets , 2011, SMUC '11.

[60]  Shazia Wasim Sadiq,et al.  Discovering interpretable geo-social communities for user behavior prediction , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[61]  Geoffrey E. Hinton,et al.  On the importance of initialization and momentum in deep learning , 2013, ICML.