On computing PageRank via lumping the Google matrix

Computing Google's PageRank via lumping the Google matrix was recently analyzed in [I.C.F. Ipsen, T.M. Selee, PageRank computation, with special attention to dangling nodes, SIAM J. Matrix Anal. Appl. 29 (2007) 1281-1296]. It was shown that all of the dangling nodes can be lumped into a single node and the PageRank could be obtained by applying the power method to the reduced matrix. Furthermore, the stochastic reduced matrix had the same nonzero eigenvalues as the full Google matrix and the power method applied to the reduced matrix had the same convergence rate as that of the power method applied to the full matrix. Therefore, a large amount of operations could be saved for computing the full PageRank vector. In this note, we show that the reduced matrix obtained by lumping the dangling nodes can be further reduced by lumping a class of nondangling nodes, called weakly nondangling nodes, to another single node, and the further reduced matrix is also stochastic with the same nonzero eigenvalues as the Google matrix.

[1]  Gang Wu,et al.  A Power–Arnoldi algorithm for computing PageRank , 2007, Numer. Linear Algebra Appl..

[2]  Ilse C. F. Ipsen,et al.  PageRank Computation, with Special Attention to Dangling Nodes , 2007, SIAM J. Matrix Anal. Appl..

[3]  Gene H. Golub,et al.  Exploiting the Block Structure of the Web for Computing , 2003 .

[4]  Claude Brezinski,et al.  Extrapolation methods for PageRank computations , 2005 .

[5]  Amy Nicole Langville,et al.  Updating pagerank with iterative aggregation , 2004, WWW Alt. '04.

[6]  G. Golub,et al.  An Arnoldi-type algorithm for computing page rank , 2006 .

[7]  Ilse C. F. Ipsen,et al.  Convergence Analysis of a PageRank Updating Algorithm by Langville and Meyer , 2005, SIAM J. Matrix Anal. Appl..

[8]  Amy Nicole Langville,et al.  A Reordering for the PageRank Problem , 2005, SIAM J. Sci. Comput..

[9]  Gang Wu Eigenvalues and Jordan canonical form of a successively rank-one updated complex matrix with applications to Google's PageRank problem , 2008 .

[10]  Taher H. Haveliwala,et al.  The Second Eigenvalue of the Google Matrix , 2003 .

[11]  Amy Nicole Langville,et al.  Google's PageRank and beyond - the science of search engine rankings , 2006 .

[12]  Yimin Wei,et al.  Comments on "Jordan Canonical Form of the Google Matrix" , 2008, SIAM J. Matrix Anal. Appl..

[13]  Gene H. Golub,et al.  Computing PageRank using Power Extrapolation , 2003 .

[14]  Carl D. Meyer,et al.  Deeper Inside PageRank , 2004, Internet Math..

[15]  S. Serra-Capizzano Jordan Canonical Form of the Google Matrix: A Potential Contribution to the PageRank Computation , 2005 .

[16]  Pavel Berkhin,et al.  A Survey on PageRank Computing , 2005, Internet Math..

[17]  Gene H. Golub,et al.  Extrapolation methods for accelerating PageRank computations , 2003, WWW '03.

[18]  Taher H. Haveliwala,et al.  Adaptive methods for the computation of PageRank , 2004 .