Predicting missing links via effective paths

Recently, in complex network, link prediction has brought a surge of researches, among which similarity based link prediction outstandingly gains considerable success, especially similarity in terms of paths. In investigation of paths based similarity, we find that the effective influence of endpoints and strong connectivity make paths contribute more similarity between two unconnected endpoints, leading to a more accurate link prediction. Accordingly, we propose a so-called effective path index (EP) in this paper to leverage effective influence of endpoints and strong connectivity in similarity calculation. For demonstrating excellence of our index, the comparisons with six mainstream indices are performed on experiments in 15 real datasets and results show a great improvement of performance via our index.

[1]  Linyuan Lü,et al.  Predicting missing links via local information , 2009, 0901.0553.

[2]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[3]  Alexis Papadimitriou,et al.  Fast and accurate link prediction in social networking systems , 2012, J. Syst. Softw..

[4]  Carlos Melián,et al.  FOOD WEB COHESION , 2004 .

[5]  Guang Chen,et al.  Alleviating bias leads to accurate and personalized recommendation , 2013 .

[6]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[7]  Timothy Ravasi,et al.  From link-prediction in brain connectomes and protein interactomes to the local-community-paradigm in complex networks , 2013, Scientific Reports.

[8]  Bo Hu,et al.  Efficient routing on complex networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  M. Newman,et al.  Vertex similarity in networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  Lise Getoor,et al.  Link mining: a survey , 2005, SKDD.

[11]  Hiroshi Mamitsuka,et al.  Mining from protein–protein interactions , 2012, WIREs Data Mining Knowl. Discov..

[12]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  S. N. Dorogovtsev,et al.  Pseudofractal scale-free web. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[15]  Tore Opsahl,et al.  Clustering in weighted networks , 2009, Soc. Networks.

[16]  Zimo Yang,et al.  Anchoring bias in online voting , 2012, ArXiv.

[17]  M. Newman Clustering and preferential attachment in growing networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[18]  Chuang Liu,et al.  Gravity Effects on Information Filtering and Network Evolving , 2013, PloS one.

[19]  Pablo M. Gleiser,et al.  Community Structure in Jazz , 2003, Adv. Complex Syst..

[20]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[21]  Sophie Ahrens,et al.  Recommender Systems , 2012 .

[22]  V. Latora,et al.  Complex networks: Structure and dynamics , 2006 .

[23]  Norman P. Hummon,et al.  Connectivity in a citation network: The development of DNA theory☆ , 1989 .

[24]  Roger Guimerà,et al.  Missing and spurious interactions and the reconstruction of complex networks , 2009, Proceedings of the National Academy of Sciences.

[25]  Ciro Cattuto,et al.  What's in a crowd? Analysis of face-to-face behavioral networks , 2010, Journal of theoretical biology.

[26]  D. Bu,et al.  Topological structure analysis of the protein-protein interaction network in budding yeast. , 2003, Nucleic acids research.

[27]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[28]  L. da F. Costa,et al.  Characterization of complex networks: A survey of measurements , 2005, cond-mat/0505185.

[29]  Linyuan Lü,et al.  Similarity index based on local paths for link prediction of complex networks. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30]  Linyuan Lu,et al.  Link prediction based on local random walk , 2010, 1001.2467.

[31]  A Díaz-Guilera,et al.  Self-similar community structure in a network of human interactions. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[32]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[33]  Marko Bajec,et al.  Self-similar scaling of density in complex real-world networks , 2011, ArXiv.

[34]  Ratul Mahajan,et al.  Measuring ISP topologies with rocketfuel , 2002, TNET.

[35]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[36]  M E J Newman Assortative mixing in networks. , 2002, Physical review letters.

[37]  Yi-Cheng Zhang,et al.  Tag-Aware Recommender Systems: A State-of-the-Art Survey , 2011, Journal of Computer Science and Technology.

[38]  J. Hanley,et al.  A method of comparing the areas under receiver operating characteristic curves derived from the same cases. , 1983, Radiology.