PageRank (PR) is an algorithm originally developed by Google to evaluate the importance of web pages. Considering how deeply rooted Google's PR algorithm is to gathering relevant information or to the success of modern businesses, the question of rank stability and choice of the damping factor (a parameter in the algorithm) is clearly important. We investigate PR as a function of the damping factor d on a network obtained from a domain of the World Wide Web, finding that rank reversal happens frequently over a broad range of PR (and of d). We use three different correlation measures, Pearson, Spearman, and Kendall, to study rank reversal as d changes, and we show that the correlation of PR vectors drops rapidly as d changes from its frequently cited value, d_{0}=0.85. Rank reversal is also observed by measuring the Spearman and Kendall rank correlation, which evaluate relative ranks rather than absolute PR. Rank reversal happens not only in directed networks containing rank sinks but also in a single strongly connected component, which by definition does not contain any sinks. We relate rank reversals to rank pockets and bottlenecks in the directed network structure. For the network studied, the relative rank is more stable by our measures around d=0.65 than at d=d_{0}.
[1]
Julia Kastner,et al.
Introduction to Robust Estimation and Hypothesis Testing
,
2005
.
[2]
Kumar Chellapilla,et al.
Proceedings of the 4th international workshop on Adversarial information retrieval on the web
,
2008,
AIRWeb 2008.
[3]
Manjit,et al.
Neurology
,
1912,
NeuroImage.
[4]
Giuseppe Di Battista,et al.
26 Computer Networks
,
2004
.
[5]
E. Ziegel.
Introduction to Robust Estimation and Hypothesis Testing (2nd ed.)
,
2005
.
[6]
I. Ial,et al.
Nature Communications
,
2010,
Nature Cell Biology.
[7]
Alexander Dekhtyar,et al.
Information Retrieval
,
2018,
Lecture Notes in Computer Science.
[8]
Peter Grassberger,et al.
Sampling properties of directed networks
,
2012,
Physical review. E, Statistical, nonlinear, and soft matter physics.