Robustness of Link-prediction Algorithm Based on Similarity and Application to Biological Networks

Many algorithms have been proposed to predict missing links in a variety of real networks. These studies focus on mainly both accuracy and efficiency of these algorithms. However, little attention is paid to their robustness against either noise or irrationality of a link existing in almost all of real networks. In this paper, we investigate the robustness of several typical node-similarity-based algorithms and find that these algorithms are sensitive to the strength of noise. Moreover, we find that it also depends on networks' structure properties, especially on network efficiency, clustering coefficient and average degree. In addition, we make an attempt to enhance the robustness by using link weighting method to transform un-weighted network to weighted one and then make use of weights of links to characterize their reliability. The result shows that proper link weighting scheme can enhance both robustness and accuracy of these algorithms significantly in biological networks while it brings little computational effort.

[1]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[2]  A Grabowski,et al.  Dynamic phenomena and human activity in an artificial society. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  唐翌,et al.  Link prediction based on a semi-local similarity index , 2011 .

[4]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[5]  Lada A. Adamic,et al.  Friends and neighbors on the Web , 2003, Soc. Networks.

[6]  Linyuan Lü,et al.  Similarity index based on local paths for link prediction of complex networks. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[7]  Linyuan Lü,et al.  Predicting missing links via local information , 2009, 0901.0553.

[8]  M E J Newman Assortative mixing in networks. , 2002, Physical review letters.

[9]  V. Latora,et al.  Complex networks: Structure and dynamics , 2006 .

[10]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[11]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[12]  Carter T. Butts,et al.  Network inference, error, and informant (in)accuracy: a Bayesian approach , 2003, Soc. Networks.

[13]  J. Schafer,et al.  Missing data: our view of the state of the art. , 2002, Psychological methods.

[14]  Christos Faloutsos,et al.  Graphs over time: densification laws, shrinking diameters and possible explanations , 2005, KDD '05.

[15]  Gueorgi Kossinets Effects of missing data in social networks , 2006, Soc. Networks.

[16]  Lise Getoor,et al.  Link mining: a survey , 2005, SKDD.

[17]  Lada A. Adamic,et al.  The political blogosphere and the 2004 U.S. election: divided they blog , 2005, LinkKDD '05.

[18]  Sharon L. Milgram,et al.  The Small World Problem , 1967 .

[19]  Roger Guimerà,et al.  Missing and spurious interactions and the reconstruction of complex networks , 2009, Proceedings of the National Academy of Sciences.

[20]  H. White,et al.  STRUCTURAL EQUIVALENCE OF INDIVIDUALS IN SOCIAL NETWORKS , 1977 .

[21]  S. Brenner,et al.  The structure of the nervous system of the nematode Caenorhabditis elegans. , 1986, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[22]  J. Hanley,et al.  A method of comparing the areas under receiver operating characteristic curves derived from the same cases. , 1983, Radiology.

[23]  Sid Redner,et al.  Networks: Teasing out the missing links , 2008, Nature.

[24]  Carlos Melián,et al.  FOOD WEB COHESION , 2004 .

[25]  Haibo Hu,et al.  Disassortative mixing in online social networks , 2009, 0909.0450.

[26]  Tao Zhou,et al.  Link prediction in weighted networks: The role of weak ties , 2010 .

[27]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[28]  V Latora,et al.  Efficient behavior of small-world networks. , 2001, Physical review letters.

[29]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[30]  Linyuan Lu,et al.  Link Prediction in Complex Networks: A Survey , 2010, ArXiv.

[31]  S. N. Dorogovtsev,et al.  Evolution of networks , 2001, cond-mat/0106144.

[32]  L. da F. Costa,et al.  Characterization of complex networks: A survey of measurements , 2005, cond-mat/0505185.

[33]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[34]  Sebastian Wernicke,et al.  Efficient Detection of Network Motifs , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.