Reliability of rank order in sampled networks

Abstract.In complex scale-free networks, ranking the individual nodes based upon their importance has useful applications, such as the identification of hubs for epidemic control, or bottlenecks for controlling traffic congestion. However, in most real situations, only limited sub-structures of entire networks are available, and therefore the reliability of the order relationships in sampled networks requires investigation. With a set of randomly sampled nodes from the underlying original networks, we rank individual nodes by three centrality measures: degree, betweenness, and closeness. The higher-ranking nodes from the sampled networks provide a relatively better characterisation of their ranks in the original networks than the lower-ranking nodes. A closeness-based order relationship is more reliable than any other quantity, due to the global nature of the closeness measure. In addition, we show that if access to hubs is limited during the sampling process, an increase in the sampling fraction can in fact decrease the sampling accuracy. Finally, an estimation method for assessing sampling accuracy is suggested.

[1]  B. Kahng,et al.  Evolution of the Protein Interaction Network of Budding Yeast: Role of the Protein Family Compatibility Constraint , 2003, q-bio/0312009.

[2]  Cristopher Moore,et al.  Accuracy and scaling phenomena in Internet mapping. , 2004, Physical review letters.

[3]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[4]  Thomas W. Valente,et al.  The stability of centrality measures when networks are sampled , 2003, Soc. Networks.

[5]  Ove Frank,et al.  Models and Methods in Social Network Analysis: Network Sampling and Model Fitting , 2005 .

[6]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[7]  K. Goh,et al.  Universal behavior of load distribution in scale-free networks. , 2001, Physical review letters.

[8]  Alessandro Vespignani,et al.  Epidemic spreading in scale-free networks. , 2000, Physical review letters.

[9]  M E Newman,et al.  Scientific collaboration networks. I. Network construction and fundamental results. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[10]  A. Barabasi,et al.  Halting viruses in scale-free networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[12]  M. Newman Erratum: Scientific collaboration networks. II. Shortest paths, weighted networks, and centrality (Physical Review e (2001) 64 (016132)) , 2006 .

[13]  Stefano Mossa,et al.  Truncation of power law behavior in "scale-free" network models due to information filtering. , 2002, Physical review letters.

[14]  S. Strogatz Exploring complex networks , 2001, Nature.

[15]  Luciano Rossoni,et al.  Models and methods in social network analysis , 2006 .

[16]  G. K. Bhattacharyya,et al.  Statistical Concepts And Methods , 1978 .

[17]  A. Klovdahl,et al.  Social networks and the spread of infectious diseases: the AIDS example. , 1985, Social science & medicine.

[18]  M. Newman,et al.  Random graphs with arbitrary degree distributions and their applications. , 2000, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[20]  Beom Jun Kim,et al.  Path finding strategies in scale-free networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  Martin Stetter,et al.  Noisy scale-free networks , 2005 .

[22]  L. Freeman Centrality in social networks conceptual clarification , 1978 .

[23]  S. Wasserman,et al.  Models and Methods in Social Network Analysis , 2005 .

[24]  Hawoong Jeong,et al.  Statistical properties of sampled networks. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.