Link prediction with node clustering coefficient

Predicting missing links in incomplete complex networks efficiently and accurately is still a challenging problem. The recently proposed Cannistrai–Alanis–Ravai (CAR) index shows the power of local link/triangle information in improving link-prediction accuracy. Inspired by the idea of employing local link/triangle information, we propose a new similarity index with more local structure information. In our method, local link/triangle structure information can be conveyed by clustering coefficient of common-neighbors directly. The reason why clustering coefficient has good effectiveness in estimating the contribution of a common-neighbor is that it employs links existing between neighbors of a common-neighbor and these links have the same structural position with the candidate link to this common-neighbor. In our experiments, three estimators: precision, AUP and AUC are used to evaluate the accuracy of link prediction algorithms. Experimental results on ten tested networks drawn from various fields show that our new index is more effective in predicting missing links than CAR index, especially for networks with low correlation between number of common-neighbors and number of links between common-neighbors.

[1]  M. Newman Clustering and preferential attachment in growing networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  P. Pin,et al.  Assessing the relevance of node features for network structure , 2008, Proceedings of the National Academy of Sciences.

[3]  Kathy P. Wheeler,et al.  Reviews of Modern Physics , 2013 .

[4]  Arthur W. Wetzel,et al.  Network anatomy and in vivo physiology of visual cortical neurons , 2011, Nature.

[5]  Pablo M. Gleiser,et al.  Community Structure in Jazz , 2003, Adv. Complex Syst..

[6]  A. Châtelain,et al.  The European Physical Journal D , 1999 .

[7]  Sid Redner,et al.  Networks: Teasing out the missing links , 2008, Nature.

[8]  Mischa Schwartz,et al.  ACM SIGCOMM computer communication review , 2001, CCRV.

[9]  Piotr J. Durka,et al.  Neuroinformatics , 2011, Bio Algorithms Med Syst..

[10]  H. White,et al.  “Structural Equivalence of Individuals in Social Networks” , 2022, The SAGE Encyclopedia of Research Design.

[11]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[12]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[13]  B. Zengin,et al.  Investigation of energy relaxation in 1-D nonlinear lattices by wavelets , 2012 .

[14]  Zhen Liu,et al.  Link prediction in complex networks: A local naïve Bayes model , 2011, ArXiv.

[15]  Linyuan Lü,et al.  Predicting missing links via local information , 2009, 0901.0553.

[16]  Linyuan Lü,et al.  Similarity index based on local paths for link prediction of complex networks. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[17]  J. Brown Behavioral Ecology and Sociobiology , 2019, Encyclopedia of Animal Behavior.

[18]  A Díaz-Guilera,et al.  Self-similar community structure in a network of human interactions. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  C. Elton,et al.  The Journal of Animal Ecology. , 1936 .

[20]  O. Bagasra,et al.  Proceedings of the National Academy of Sciences , 1914, Science.

[21]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[22]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[23]  D. Hennig,et al.  Collective transport of coupled particles , 2012 .

[24]  S. Maybank,et al.  Knowledge and Information Systems REGULAR PAPER , 2006 .

[25]  J. Flynn John,et al.  ESM Appendix B: Tseng ZJ and Flynn JJ. An integrative method for testing form–function linkages and reconstructed evolutionary pathways of masticatory specialization. Journal of the Royal Society Interface , 2015 .

[26]  J. Herskowitz,et al.  Proceedings of the National Academy of Sciences, USA , 1996, Current Biology.

[27]  V. Latora,et al.  Complex networks: Structure and dynamics , 2006 .

[28]  Jaideep Srivastava,et al.  Correlations between Community Structure and Link Formation in Complex Networks , 2013, PloS one.

[29]  B. Wang,et al.  Information filtering based on transferring similarity. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30]  Wiley Interscience Journal of the American Society for Information Science and Technology , 2013 .

[31]  F. Göbel,et al.  Random walks on graphs , 1974 .

[32]  Ginestra Bianconi,et al.  Emergent Complex Network Geometry , 2014, Scientific Reports.

[33]  Jonathan L. Herlocker,et al.  Evaluating collaborative filtering recommender systems , 2004, TOIS.

[34]  Eric Bonabeau Advances in Complex Systems: Already a New Name! , 1998, Adv. Complex Syst..

[35]  Linyuan Lü,et al.  Toward link predictability of complex networks , 2015, Proceedings of the National Academy of Sciences.

[36]  Daniel A. Keim,et al.  Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining , 2002, KDD.

[37]  Roger Guimerà,et al.  Missing and spurious interactions and the reconstruction of complex networks , 2009, Proceedings of the National Academy of Sciences.

[38]  L. da F. Costa,et al.  Characterization of complex networks: A survey of measurements , 2005, cond-mat/0505185.

[39]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[40]  H. Dawah,et al.  Structure of the parasitoid communities of grass-feeding chalcid wasps , 1995 .

[41]  Dunja Mladenic,et al.  Proceedings of the 3rd international workshop on Link discovery , 2005, KDD 2005.

[42]  P. Holme,et al.  Role-similarity based functional prediction in networked systems: application to the yeast proteome , 2005, Journal of The Royal Society Interface.

[43]  Sunita Sarawagi,et al.  Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Las Vegas, Nevada, USA, August 24-27, 2008 , 2008, KDD.

[44]  Albert-László Barabási,et al.  Collective credit allocation in science , 2014, Proceedings of the National Academy of Sciences.

[45]  Tao Zhou,et al.  Evaluating network models: A likelihood analysis , 2011, ArXiv.

[46]  Ulrik Brandes,et al.  Social Networks , 2013, Handbook of Graph Drawing and Visualization.

[47]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[48]  Timothy Ravasi,et al.  From link-prediction in brain connectomes and protein interactomes to the local-community-paradigm in complex networks , 2013, Scientific Reports.

[49]  O. N. Garcia,et al.  Knowledge and Data Engineering: An Outlook , 1989 .

[50]  François Fouss,et al.  Random-Walk Computation of Similarities between Nodes of a Graph with Application to Collaborative Recommendation , 2007, IEEE Transactions on Knowledge and Data Engineering.

[51]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[52]  Rolf Kötter,et al.  Online retrieval, processing, and visualization of primate connectivity data from the CoCoMac Database , 2007, Neuroinformatics.

[53]  Tao Zhou,et al.  Measuring multiple evolution mechanisms of complex networks , 2014, Scientific Reports.

[54]  Steve Gregory,et al.  Finding missing edges in networks based on their community structure , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[55]  D. Lusseau,et al.  The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations , 2003, Behavioral Ecology and Sociobiology.