Link prediction for tree-like networks.

Link prediction is the problem of predicting the location of either unknown or fake links from uncertain structural information of a network. Link prediction algorithms are useful in gaining insight into different network structures from partial observations of exemplars. However, existing link prediction algorithms only focus on regular complex networks and are overly dependent on either the closed triangular structure of networks or the so-called preferential attachment phenomenon. The performance of these algorithms on highly sparse or treelike networks is poor. In this letter, we proposed a method that is based on the network heterogeneity. We test our algorithms for three real large sparse networks: a metropolitan water distribution network, a Twitter network, and a sexual contact network. We find that our method is effective and performs better than traditional algorithms, especially for the Twitter network. We further argue that heterogeneity is the most obvious defining pattern for complex networks, while other statistical properties failed to be predicted. Moreover, preferential attachment based link prediction performed poorly and hence we infer that preferential attachment is not a plausible model for the genesis of many networks. We also suggest that heterogeneity is an important mechanism for online information propagation.

[1]  M. Newman Clustering and preferential attachment in growing networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  Mong-Li Lee,et al.  Community-based user recommendation in uni-directional social networks , 2013, CIKM.

[3]  P. Bearman,et al.  Chains of Affection: The Structure of Adolescent Romantic and Sexual Networks1 , 2004, American Journal of Sociology.

[4]  Michael Small,et al.  The role of direct links for link prediction in evolving networks , 2017 .

[5]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[6]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[7]  M. Randic,et al.  Resistance distance , 1993 .

[8]  Sharon L. Milgram,et al.  The Small World Problem , 1967 .

[9]  S. Shen-Orr,et al.  Superfamilies of Evolved and Designed Networks , 2004, Science.

[10]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[11]  Igor M. Sokolov,et al.  Changing Correlations in Networks: Assortativity and Dissortativity , 2005 .

[12]  Tao Zhou,et al.  Link prediction in weighted networks: The role of weak ties , 2010 .

[13]  Tao Zhou,et al.  Solving the cold-start problem in recommender systems with social tags , 2010 .

[14]  Stanley Milgram,et al.  An Experimental Study of the Small World Problem , 1969 .

[15]  Jon M. Kleinberg,et al.  The link-prediction problem for social networks , 2007, J. Assoc. Inf. Sci. Technol..

[16]  Linyuan Lü,et al.  Toward link predictability of complex networks , 2015, Proceedings of the National Academy of Sciences.

[17]  Michael Small,et al.  Evolving networks—Using past structure to predict the future , 2016 .

[18]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[19]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[20]  R. Tsien,et al.  Specificity and Stability in Topology of Protein Networks , 2022 .

[21]  Louis K. Scheffer,et al.  A visual motion detection circuit suggested by Drosophila connectomics , 2013, Nature.

[22]  Giuseppe Sansonetti,et al.  Community Detection and Recommender Systems , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[23]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[24]  Arthur W. Wetzel,et al.  Network anatomy and in vivo physiology of visual cortical neurons , 2011, Nature.

[25]  Linyuan Lü,et al.  Predicting missing links via local information , 2009, 0901.0553.

[26]  François Fouss,et al.  Random-Walk Computation of Similarities between Nodes of a Graph with Application to Collaborative Recommendation , 2007, IEEE Transactions on Knowledge and Data Engineering.

[27]  Bao-qun Yin,et al.  Power-law strength-degree correlation from resource-allocation dynamics on weighted networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  M. Small,et al.  Growing optimal scale-free networks via likelihood. , 2015, Physical review. E, Statistical, nonlinear, and soft matter physics.

[29]  Michael Small,et al.  Fault prediction and modelling in transport networks , 2018, 2018 IEEE International Symposium on Circuits and Systems (ISCAS).

[30]  Michael Small,et al.  Fitness networks for real world systems via modified preferential attachment , 2017 .

[31]  Sid Redner,et al.  Networks: Teasing out the missing links , 2008, Nature.

[32]  T. Sørensen,et al.  A method of establishing group of equal amplitude in plant sociobiology based on similarity of species content and its application to analyses of the vegetation on Danish commons , 1948 .

[33]  F. Chung,et al.  Eigenvalues of Random Power law Graphs , 2003 .

[34]  Giulio Cimini,et al.  Removing spurious interactions in complex networks , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[35]  Michael Small,et al.  Rich-club connectivity dominates assortativity and transitivity of complex networks , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[36]  Jon M. Kleinberg,et al.  Simplicial closure and higher-order link prediction , 2018, Proceedings of the National Academy of Sciences.

[37]  Linyuan Lü,et al.  Similarity index based on local paths for link prediction of complex networks. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[38]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[39]  P. Jaccard,et al.  Etude comparative de la distribution florale dans une portion des Alpes et des Jura , 1901 .

[40]  Jure Leskovec,et al.  {SNAP Datasets}: {Stanford} Large Network Dataset Collection , 2014 .

[41]  M. Newman,et al.  Vertex similarity in networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.