PAIRS: Privacy-Aware Identification and Recommendation of Spatio-Friends

Due to the prevalence of location-based services, it has now become possible to infer social connections between people by observing their spatial behaviors over time. Such spatial behaviors, if shared, can be utilized to identify and recommend friends for web-based social service users. However, such approaches cannot be implemented without solving two key challenges: (a) guaranteeing an individual's privacy in the shared spatiotemporal data, and (b) addressing the inherent sparseness of shared spatiotemporal data. In this paper, we propose a Privacy-Aware Identification and Recommendation of Spatio-Friends (PAIRS) approach, that can infer and recommend potential social connections by analyzing spatiotemporal information of social media users using robust privacy guarantee mechanisms. To achieve this, PAIRS constructs co-occurrence profiles using a cluster-based anchor representation to alleviate the sparseness of shared spatiotemporal information. It utilizes the diversity, time and weighted frequency-based inference to efficiently infer the strength of potential social connections from co-occurrence profile by reducing the negative impact of coincidences and thereby enhances accuracy. To tackle the privacy concerns, PAIRS sanitizes the cluster-based anchors, the location entropy values as well as the co-occurrence profile under differential privacy, including optimization mechanisms to handle trade-offs in utility and privacy. Extensive experiments are conducted with real-world datasets including both individuals' spatiotemporal data and their actual social connections. We confirm that our approach can achieve two often contradictory goals: a provable robust privacy protection for sharing data and an efficient social strength inference and spatio-friend identification mechanism. Specifically, PAIRS remains approximately 70% accuracy (precision) and 80% efficiency (recommendation potential) after perturbation.

[1]  Jie Wu,et al.  Privacy-Preserving Social Tie Discovery Based on Cloaked Human Trajectories , 2017, IEEE Trans. Veh. Technol..

[2]  Xing Xie,et al.  Learning travel recommendations from user-generated GPS traces , 2011, TIST.

[3]  Claude Castelluccia,et al.  Differentially private sequential data publication via variable-length n-grams , 2012, CCS.

[4]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[5]  Catuscia Palamidessi,et al.  Geo-indistinguishability: differential privacy for location-based systems , 2012, CCS.

[6]  Ben Y. Zhao,et al.  Sharing graphs using differentially private graph models , 2011, IMC '11.

[7]  Farnoush Banaei Kashani,et al.  Efficient Reachability Query Evaluation in Large Spatiotemporal Contact Datasets , 2012, Proc. VLDB Endow..

[8]  Dino Pedreschi,et al.  Human mobility, social ties, and link prediction , 2011, KDD.

[9]  Dan Cosley,et al.  Inferring social ties from geographic coincidences , 2010, Proceedings of the National Academy of Sciences.

[10]  Cynthia Dwork,et al.  Differential Privacy , 2006, ICALP.

[11]  Aniket Kittur,et al.  Bridging the gap between physical location and online social networks , 2010, UbiComp.

[12]  Prateek Mittal,et al.  LinkMirage: Enabling Privacy-preserving Analytics on Social Relationships , 2016, NDSS.

[13]  Cyrus Shahabi,et al.  Towards integrating real-world spatiotemporal data with social networks , 2011, GIS.

[14]  Richard O. Sinnott,et al.  Protecting personal trajectories of social media users through differential privacy , 2017, Comput. Secur..

[15]  Dan Suciu,et al.  Boosting the accuracy of differentially private histograms through consistency , 2009, Proc. VLDB Endow..

[16]  Kunal Talwar,et al.  Mechanism Design via Differential Privacy , 2007, 48th Annual IEEE Symposium on Foundations of Computer Science (FOCS'07).

[17]  Yan Liu,et al.  EBM: an entropy-based model to infer social strength from spatiotemporal data , 2013, SIGMOD '13.

[18]  Cynthia Dwork,et al.  Calibrating Noise to Sensitivity in Private Data Analysis , 2006, TCC.

[19]  Sofya Raskhodnikova,et al.  Analyzing Graphs with Node Differential Privacy , 2013, TCC.

[20]  Cyrus Shahabi,et al.  Differentially private publication of location entropy , 2016, SIGSPATIAL/GIS.

[21]  Panos Kalnis,et al.  Location Diversity: Enhanced Privacy Protection in Location Based Services , 2009, LoCA.

[22]  Junzhong Gu,et al.  Link Prediction Using Protected Location History , 2013, 2013 International Conference on Computational and Information Sciences.

[23]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[24]  Jon Kleinberg,et al.  The link prediction problem for social networks , 2003, CIKM '03.

[25]  Xing Xie,et al.  Mining user similarity based on location history , 2008, GIS '08.

[26]  Divesh Srivastava,et al.  DPT: Differentially Private Trajectory Synthesis Using Hierarchical Reference Systems , 2015, Proc. VLDB Endow..

[27]  Jure Leskovec,et al.  Friendship and mobility: user movement in location-based social networks , 2011, KDD.