Link Weight Prediction Using Supervised Learning Methods and Its Application to Yelp Layered Network

Real-world networks feature weights of interactions, where link weights often represent some physical attributes. In many situations, to recover the missing data or predict the network evolution, we need to predict link weights in a network. In this paper, we first proposed a series of new centrality indices for links in line graph. Then, utilizing these line graph indices, as well as a number of original graph indices, we designed three supervised learning methods to realize link weight prediction both in the networks of single layer and multiple layers, which perform much better than several recently proposed baseline methods. We found that the resource allocation index (RA) plays a more important role in the weight prediction than other topological properties, and the line graph indices are at least as important as the original graph indices in link weight prediction. In particular, the success application of our methods on Yelp layered network suggests that we can indeed predict the offline co-foraging behaviors of users just based on their online social interactions, which may open a new direction for link weight prediction algorithms, and meanwhile provide insights to design better restaurant recommendation systems.

[1]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[2]  Retantyo Wardoyo,et al.  Time Complexity Analysis of Support Vector Machines (SVM) in LibSVM , 2015 .

[3]  Qi Xuan,et al.  Growth model for complex networks with hierarchical and modular structures. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Jon M. Kleinberg,et al.  The link-prediction problem for social networks , 2007, J. Assoc. Inf. Sci. Technol..

[5]  Jian Yang,et al.  The integration of weighted human gene association networks based on link prediction , 2017, BMC Systems Biology.

[6]  Linyuan Lu,et al.  Link prediction based on local random walk , 2010, 1001.2467.

[7]  Tao Zhou,et al.  The H-index of a network node and its relation to degree and coreness , 2016, Nature Communications.

[8]  Qi Xuan,et al.  Temporal motifs reveal collaboration patterns in online task-oriented networks. , 2015, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  Mason A. Porter,et al.  Multilayer networks , 2013, J. Complex Networks.

[10]  Gueorgi Kossinets Effects of missing data in social networks , 2006, Soc. Networks.

[11]  David Lazer,et al.  Inferring friendship network structure by using mobile phone data , 2009, Proceedings of the National Academy of Sciences.

[12]  Bo Yang,et al.  Graph-based features for supervised link prediction , 2011, The 2011 International Joint Conference on Neural Networks.

[13]  Qi Xuan,et al.  Building it together: synchronous development in OSS , 2014, ICSE.

[14]  Jon M. Kleinberg,et al.  The Web as a Graph: Measurements, Models, and Methods , 1999, COCOON.

[15]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[16]  Shilpa Chakravartula,et al.  Complex Networks: Structure and Dynamics , 2014 .

[17]  Yuval Shavitt,et al.  A model of Internet topology using k-shell decomposition , 2007, Proceedings of the National Academy of Sciences.

[18]  Nitesh V. Chawla,et al.  New perspectives and methods in link prediction , 2010, KDD.

[19]  Rami Puzis,et al.  Link Prediction in Social Networks Using Computationally Efficient Topological Features , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[20]  T. Sørensen,et al.  A method of establishing group of equal amplitude in plant sociobiology based on similarity of species content and its application to analyses of the vegetation on Danish commons , 1948 .

[21]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[22]  Qi Xuan,et al.  Node matching between complex networks. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[23]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[24]  M. Newman Clustering and preferential attachment in growing networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  M. Newman,et al.  Vertex similarity in networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[26]  Jing Zhao,et al.  Prediction of Links and Weights in Networks by Reliable Routes , 2015, Scientific Reports.

[27]  Qi Xuan,et al.  Pinning control of clustered complex networks with different size , 2017 .

[28]  Linyuan Lü,et al.  Predicting missing links via local information , 2009, 0901.0553.

[29]  Jon M. Kleinberg,et al.  Romantic partnerships and the dispersion of social ties: a network analysis of relationship status on facebook , 2013, CSCW.

[30]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[31]  Dror Y. Kenett,et al.  Networks of networks – An introduction , 2015 .

[32]  P. Bonacich Power and Centrality: A Family of Measures , 1987, American Journal of Sociology.

[33]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[34]  Yin Zhang,et al.  Scalable proximity estimation and link prediction in online social networks , 2009, IMC '09.

[35]  C. Blakemore,et al.  Analysis of connectivity in the cat cerebral cortex , 1995, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[36]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[37]  W. Li,et al.  Statistical analysis of airport network of China. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[38]  Wang Bing-Hong,et al.  Node importance ranking of complex networks , 2013 .

[39]  Jari Saramäki,et al.  Temporal Networks , 2011, Encyclopedia of Social Network Analysis and Mining.

[40]  Hsinchun Chen,et al.  Link prediction approach to collaborative filtering , 2005, Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '05).

[41]  S. Strogatz Exploring complex networks , 2001, Nature.

[42]  S. Horvath,et al.  A General Framework for Weighted Gene Co-Expression Network Analysis , 2005, Statistical applications in genetics and molecular biology.

[43]  Leonard M. Freeman,et al.  A set of measures of centrality based upon betweenness , 1977 .

[44]  M. Newman Analysis of weighted networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[45]  Alex Bavelas,et al.  Communication Patterns in Task‐Oriented Groups , 1950 .

[46]  Donald E. Knuth,et al.  The Stanford GraphBase - a platform for combinatorial computing , 1993 .

[47]  Cecilia Mascolo,et al.  Exploiting place features in link prediction on location-based social networks , 2011, KDD.

[48]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[49]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[50]  L. da F. Costa,et al.  Characterization of complex networks: A survey of measurements , 2005, cond-mat/0505185.

[51]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[52]  Gérard Biau,et al.  Analysis of a Random Forests Model , 2010, J. Mach. Learn. Res..

[53]  Qi Xuan,et al.  Iterative node matching between complex networks , 2010 .

[54]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[55]  Stephen B. Seidman,et al.  Network structure and minimum degree , 1983 .

[56]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[57]  Vladimir Batagelj,et al.  An O(m) Algorithm for Cores Decomposition of Networks , 2003, ArXiv.

[58]  R. Z. Norman,et al.  Some properties of line digraphs , 1960 .

[59]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[60]  Petter Holme,et al.  Introduction to Temporal Network Epidemiology , 2017 .

[61]  J. E. Hirsch,et al.  An index to quantify an individual's scientific research output , 2005, Proc. Natl. Acad. Sci. USA.

[62]  Mei Liu,et al.  Prediction of protein-protein interactions using random decision forest framework , 2005, Bioinform..

[63]  Sophie Ahrens,et al.  Recommender Systems , 2012 .

[64]  Aaron Clauset,et al.  Learning Latent Block Structure in Weighted Networks , 2014, J. Complex Networks.

[65]  Yongxiang Xia,et al.  Weight prediction in complex networks based on neighbor set , 2016, Scientific Reports.

[66]  M E J Newman Assortative mixing in networks. , 2002, Physical review letters.

[67]  Jian-Guo Liu,et al.  Detecting community structure in complex networks via node similarity , 2010 .

[68]  Gueorgi Kossinets,et al.  Empirical Analysis of an Evolving Social Network , 2006, Science.

[69]  Conrado J. Pérez Vicente,et al.  Diffusion dynamics on multiplex networks , 2012, Physical review letters.

[70]  Bao-qun Yin,et al.  Power-law strength-degree correlation from resource-allocation dynamics on weighted networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[71]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[72]  Ricardo B. C. Prudêncio,et al.  Supervised link prediction in weighted networks , 2011, The 2011 International Joint Conference on Neural Networks.

[73]  Tao Zhou,et al.  Link prediction in weighted networks: The role of weak ties , 2010 .

[74]  Pierre-André Noël,et al.  Quantifying dynamical spillover in co-evolving multiplex networks , 2015, Scientific Reports.

[75]  Qi Xuan,et al.  Ranking Developer Candidates by Social Links , 2014, Adv. Complex Syst..

[76]  Mohammad Al Hasan,et al.  Link prediction using supervised learning , 2006 .

[77]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[78]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[79]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[80]  K. Kaski,et al.  Intensity and coherence of motifs in weighted complex networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.