Link prediction in directed social networks

In today’s online social networks, it becomes essential to help newcomers as well as existing community members to find new social contacts. In scientific literature, this recommendation task is known as link prediction. Link prediction has important practical applications in social network platforms. It allows social network platform providers to recommend friends to their users. Another application is to infer missing links in partially observed networks. The shortcoming of many of the existing link prediction methods is that they mostly focus on undirected graphs only. This work closes this gap and introduces link prediction methods and metrics for directed graphs. Here, we compare well-known similarity metrics and their suitability for link prediction in directed social networks. We advance existing techniques and propose mining of subgraph patterns that are used to predict links in networks such as GitHub, GooglePlus, and Twitter. Our results show that the proposed metrics and techniques yield more accurate predictions when compared with metrics not accounting for the directed nature of the underlying networks.

[1]  S. Boorman,et al.  Social Structure from Multiple Networks. I. Blockmodels of Roles and Positions , 1976, American Journal of Sociology.

[2]  Jennifer Widom,et al.  SimRank: a measure of structural-context similarity , 2002, KDD.

[3]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[4]  Kathryn B. Laskey,et al.  Stochastic blockmodels: First steps , 1983 .

[5]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[6]  Jennifer Widom,et al.  Scaling personalized web search , 2003, WWW '03.

[7]  Daniel M. Romero,et al.  Who Should I Follow? Recommending People in Directed Social Networks , 2011, ICWSM.

[8]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[9]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[10]  Edoardo M. Airoldi,et al.  Mixed Membership Stochastic Blockmodels , 2007, NIPS.

[11]  U. Alon Network motifs: theory and experimental approaches , 2007, Nature Reviews Genetics.

[12]  Klemens Böhm,et al.  High-throughput crowdsourcing mechanisms for complex tasks , 2013, Social Network Analysis and Mining.

[13]  S. Shen-Orr,et al.  Networks Network Motifs : Simple Building Blocks of Complex , 2002 .

[14]  M. Newman,et al.  Vertex similarity in networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  Jure Leskovec,et al.  Predicting positive and negative links in online social networks , 2010, WWW '10.

[16]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[17]  Achim Rettinger,et al.  Context-aware tensor decomposition for relation prediction in social networks , 2012, Social Network Analysis and Mining.

[18]  Linyuan Lü,et al.  Predicting missing links via local information , 2009, 0901.0553.

[19]  唐翌,et al.  Link prediction based on a semi-local similarity index , 2011 .

[20]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[21]  Daniel Schall,et al.  Expertise ranking using activity and contextual link measures , 2012, Data Knowl. Eng..

[22]  Jon M. Kleinberg,et al.  The Directed Closure Process in Hybrid Social-Information Networks, with an Analysis of Link Formation on Twitter , 2010, ICWSM.

[23]  Armelle Brun,et al.  Densifying a behavioral recommender system by social networks link prediction methods , 2011, Social Network Analysis and Mining.

[24]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[25]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[26]  Rossano Schifanella,et al.  Friendship prediction and homophily in social media , 2012, TWEB.

[27]  S. Boorman,et al.  Social Structure from Multiple Networks. II. Role Structures , 1976, American Journal of Sociology.

[28]  Jon Kleinberg,et al.  The link prediction problem for social networks , 2003, CIKM '03.

[29]  T. Sørensen,et al.  A method of establishing group of equal amplitude in plant sociobiology based on similarity of species content and its application to analyses of the vegetation on Danish commons , 1948 .

[30]  Daniel Schall,et al.  Service-Oriented Crowdsourcing: Architecture, Protocols and Algorithms , 2012 .

[31]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[32]  Florian Skopik,et al.  Social network mining of requester communities in crowdsourcing markets , 2012, Social Network Analysis and Mining.

[33]  Vladimir Batagelj,et al.  A subquadratic triad census algorithm for large sparse networks with small maximum degree , 2001, Soc. Networks.

[34]  P. Holland,et al.  A Method for Detecting Structure in Sociometric Data , 1970, American Journal of Sociology.

[35]  Panagiotis Symeonidis,et al.  Spectral clustering for link prediction in social networks with positive and negative links , 2013, Social Network Analysis and Mining.

[36]  Jon M. Kleinberg,et al.  The link-prediction problem for social networks , 2007, J. Assoc. Inf. Sci. Technol..

[37]  Linyuan Lu,et al.  Link prediction based on local random walk , 2010, 1001.2467.

[38]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[39]  Stanford,et al.  Learning to Discover Social Circles in Ego Networks , 2012 .

[40]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[41]  Daniel Schall,et al.  Service-Oriented Crowdsourcing , 2012, SpringerBriefs in Computer Science.

[42]  Jure Leskovec,et al.  Supervised random walks: predicting and recommending links in social networks , 2010, WSDM '11.