Seed-Driven Geo-Social Data Extraction

Geo-social data has been an attractive source for a variety of problems such as mining mobility patterns, link prediction, location recommendation, and influence maximization. However, new geo-social data is increasingly unavailable and suffers several limitations. In this paper, we aim to remedy the problem of effective data extraction from geo-social data sources. We first identify the limitations of extracting geo-social data. To overcome the limitations, we propose a novel seed-driven approach that uses the points of one source as the seed to feed as queries for the others. We additionally handle differences between, and dynamics within the sources by proposing three variants for optimizing search radius. Furthermore, we provide an optimization based on recursive clustering to minimize the number of requests and an adaptive procedure to learn the specific data distribution of each source. Our comprehensive experiments with six popular sources show that our seed-driven approach yields 14.3 times more data overall, while our request-optimized algorithm retrieves up to 95% of the data with less than 16% of the requests. Thus, our proposed seed-driven approach set new standards for effective and efficient extraction of geo-social data.

[1]  Kazutoshi Sumiya,et al.  Measuring geographical regularities of crowd behaviors for Twitter-based geo-social event detection , 2010, LBSN '10.

[2]  Michael R. Lyu,et al.  STELLAR: Spatial-Temporal Latent Ranking for Successive Point-of-Interest Recommendation , 2016, AAAI.

[3]  Huan Liu,et al.  Content-Aware Point of Interest Recommendation on Location-Based Social Networks , 2015, AAAI.

[4]  Derek Ruths,et al.  Geolocation Prediction in Twitter Using Social Networks: A Critical Analysis and Review of Current Practice , 2015, ICWSM.

[5]  Kian-Lee Tan,et al.  Efficient location-aware influence maximization , 2014, SIGMOD Conference.

[6]  Jianxin Li,et al.  Geo-Social Influence Spanning Maximization , 2017, IEEE Transactions on Knowledge and Data Engineering.

[7]  Srikanta J. Bedathur,et al.  LoCaTe: Influence Quantification for Location Promotion in Location-based Social Networks , 2017, IJCAI.

[8]  Kyumin Lee,et al.  You are where you tweet: a content-based approach to geo-locating twitter users , 2010, CIKM.

[9]  Mao Ye,et al.  Location recommendation for out-of-town users in location-based social networks , 2013, CIKM.

[10]  Cecilia Mascolo,et al.  Distance Matters: Geo-social Metrics for Online Social Networks , 2010, WOSN.

[11]  Huan Liu,et al.  Exploring temporal effects for location recommendation on location-based social networks , 2013, RecSys.

[12]  Moditha Hewasinghage,et al.  Interpreting Reputation Through Frequent Named Entities in Twitter , 2017, WISE.

[13]  Franco Zambonelli,et al.  Extracting urban patterns from location-based social networks , 2011, LBSN '11.

[14]  Trevor Cohn,et al.  Mining user behaviours: a study of check-in patterns in location based social networks , 2013, WebSci.

[15]  Nikos Mamoulis,et al.  Geo-Social Co-location Mining , 2015, GeoRich@SIGMOD.

[16]  Andreas Züfle,et al.  Geo-Social Skyline Queries , 2014, DASFAA.

[17]  Yifeng Zeng,et al.  Personalized Ranking Metric Embedding for Next New POI Recommendation , 2015, IJCAI.

[18]  M. Goodchild,et al.  Spatial, temporal, and socioeconomic patterns in the use of Twitter and Flickr , 2013 .

[19]  Chi-Yin Chow,et al.  iGSLR: personalized geo-social location recommendation: a kernel density estimation approach , 2013, SIGSPATIAL/GIS.

[20]  Guanling Chen,et al.  Analysis of a Location-Based Social Network , 2009, 2009 International Conference on Computational Science and Engineering.

[21]  Gao Cong,et al.  An Experimental Evaluation of Point-of-interest Recommendation in Location-based Social Networks , 2017, Proc. VLDB Endow..

[22]  Hawoong Jeong,et al.  Statistical properties of sampled networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[23]  Hui Xiong,et al.  Point-of-Interest Recommendation in Location Based Social Networks with Topic and Location Awareness , 2013, SDM.

[24]  Jure Leskovec,et al.  Friendship and mobility: user movement in location-based social networks , 2011, KDD.

[25]  Gang Chen,et al.  Evaluating geo-social influence in location-based social networks , 2012, CIKM.

[26]  Toon Calders,et al.  Location Influence in Location-based Social Networks , 2017, WSDM.

[27]  Cecilia Mascolo,et al.  A Random Walk around the City: New Venue Recommendation in Location-Based Social Networks , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[28]  Jianliang Xu,et al.  Geo-social group queries with minimum acquaintance constraints , 2017, The VLDB Journal.

[29]  David Jurgens,et al.  That's What Friends Are For: Inferring Location in Online Social Media Platforms Based on Social Relationships , 2013, ICWSM.

[30]  Torben Bach Pedersen,et al.  Seed-Driven Geo-Social Data Extraction - Full Version , 2019, ArXiv.

[31]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[32]  Giuseppe Psaila,et al.  The Urban Nexus Approach for Analyzing Mobility in the Smart City: Towards the Identification of City Users Networking , 2018, Mob. Inf. Syst..

[33]  Toon Calders,et al.  Effective and efficient location influence mining in location-based social networks , 2018, Knowledge and Information Systems.

[34]  Hao Wang,et al.  Location recommendation in location-based social networks using user check-in data , 2013, SIGSPATIAL/GIS.

[35]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[36]  Shazia Wasim Sadiq,et al.  Discovering interpretable geo-social communities for user behavior prediction , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[37]  Stavros Papadopoulos,et al.  A General Framework for Geo-Social Query Processing , 2013, Proc. VLDB Endow..

[38]  Cecilia Mascolo,et al.  Measuring Urban Social Diversity Using Interconnected Geo-Social Networks , 2016, WWW.

[39]  M. L. Fisher,et al.  An analysis of approximations for maximizing submodular set functions—I , 1978, Math. Program..

[40]  Dan Cosley,et al.  Inferring social ties from geographic coincidences , 2010, Proceedings of the National Academy of Sciences.

[41]  Cecilia Mascolo,et al.  Exploiting place features in link prediction on location-based social networks , 2011, KDD.

[42]  Hui Xiong,et al.  POI Recommendation: A Temporal Matching between POI Popularity and User Regularity , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[43]  Yingshu Li,et al.  Using crowdsourced data in location-based social networks to explore influence maximization , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[44]  Fei Yu,et al.  Friend Recommendation Considering Preference Coverage in Location-Based Social Networks , 2017, PAKDD.

[45]  Huan Liu,et al.  Is the Sample Good Enough? Comparing Data from Twitter's Streaming API with Twitter's Firehose , 2013, ICWSM.