Investigating Link Inference in Partially Observable Networks: Friendship Ties and Interaction

While privacy preserving mechanisms, such as hiding one's friends list, may be available to withhold personal information on online social networking sites, it is not obvious whether to which degree a user's social behavior renders such an attempt futile. In this paper, we study the impact of additional interaction information on the inference of links between nodes in partially covert networks. This investigation is based on the assumption that interaction might be a proxy for connectivity patterns in online social networks. For this purpose, we use data collected from 586 Facebook profiles consisting of friendship ties (conceptualized as the network) and comments on wall posts (serving as interaction information) by a total of 64 000 users. The link-inference problem is formulated as a binary classification problem using a comprehensive set of features and multiple supervised learning algorithms. Our results suggest that interactions reiterate the information contained in friendship ties sufficiently well to serve as a proxy when the majority of a network is unobserved.

[1]  Francesco Bonchi,et al.  Cold start link prediction , 2010, KDD.

[2]  Krishna P. Gummadi,et al.  Understanding and Specifying Social Access Control Lists , 2014, SOUPS.

[3]  Matthieu Latapy,et al.  Efficient Measurement of Complex Networks Using Link Queries , 2009, IEEE INFOCOM Workshops 2009.

[4]  S. Feld The Focused Organization of Social Ties , 1981, American Journal of Sociology.

[5]  Lise Getoor,et al.  Using Friendship Ties and Family Circles for Link Prediction , 2008, SNAKDD.

[6]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[7]  Rossano Schifanella,et al.  Friendship prediction and homophily in social media , 2012, TWEB.

[8]  Malik Magdon-Ismail,et al.  Finding Overlapping Communities in Social Networks , 2010, 2010 IEEE Second International Conference on Social Computing.

[9]  Alexander J. Smola,et al.  Like like alike: joint friendship and interest propagation in social networks , 2011, WWW.

[10]  Chenhao Tan,et al.  On the Interplay between Social and Topical Structure , 2011, ICWSM.

[11]  Ling Huang,et al.  Joint Link Prediction and Attribute Inference Using a Social-Attribute Network , 2014, TIST.

[12]  Ben Taskar,et al.  Link Prediction in Relational Data , 2003, NIPS.

[13]  Renaud Lambiotte,et al.  Predicting links in ego-networks using temporal information , 2015, EPJ Data Science.

[14]  Jiawei Han,et al.  A Unified Framework for Link Recommendation Using Random Walks , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[15]  LeskovecJure,et al.  Discovering social circles in ego networks , 2014 .

[16]  Chapitre 8. Travail et travailleurs de la donnée , 2015, Big Data.

[17]  Jure Leskovec,et al.  {SNAP Datasets}: {Stanford} Large Network Dataset Collection , 2014 .

[18]  David Lazer,et al.  Inferring friendship network structure by using mobile phone data , 2009, Proceedings of the National Academy of Sciences.

[19]  Bo Yang,et al.  Graph-based features for supervised link prediction , 2011, The 2011 International Joint Conference on Neural Networks.

[20]  Padhraic Smyth,et al.  Prediction and ranking algorithms for event-based network data , 2005, SKDD.

[21]  Krishna P. Gummadi,et al.  Analyzing facebook privacy settings: user expectations vs. reality , 2011, IMC '11.

[22]  Jure Leskovec,et al.  Supervised random walks: predicting and recommending links in social networks , 2010, WSDM '11.

[23]  Jon M. Kleinberg,et al.  The link-prediction problem for social networks , 2007, J. Assoc. Inf. Sci. Technol..

[24]  Krishna P. Gummadi,et al.  You are who you know: inferring user profiles in online social networks , 2010, WSDM '10.

[25]  F. Hamprecht,et al.  One Plus One Makes Three (for Social Networks) , 2012, PloS one.

[26]  Jure Leskovec,et al.  Discovering social circles in ego networks , 2012, ACM Trans. Knowl. Discov. Data.

[27]  Jimeng Sun,et al.  Confluence: conformity influence in large social networks , 2013, KDD.

[28]  Ulrik Brandes,et al.  Link prediction with social vector clocks , 2013, KDD.

[29]  Purnamrita Sarkar,et al.  Theoretical Justification of Popular Link Prediction Heuristics , 2011, IJCAI.

[30]  Georg Simmel Soziologie: Untersuchungen Über Die Formen Der Vergesellschaftung , 2009 .

[31]  Nitesh V. Chawla,et al.  New perspectives and methods in link prediction , 2010, KDD.

[32]  Jens Grossklags,et al.  Third-party apps on Facebook: privacy and the illusion of control , 2011, CHIMIT '11.

[33]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[34]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[35]  Jure Leskovec,et al.  Predicting positive and negative links in online social networks , 2010, WWW '10.

[36]  Mehwish Nasim,et al.  On commenting behavior of Facebook users , 2013, HT.