The effects of dangling nodes on citation networks

This study discusses the effects of dangling nodes on citation networks through the PageRank algorithm. The origins of dangling nodes for citation networks are introduced and three methods are applied to handle dangling nodes: retaining all dangling nodes, deleting dangling nodes, and clustering dangling nodes into one node. Correlation analyses are used to compare these three methods.

[1]  Ilse C. F. Ipsen,et al.  PageRank Computation, with Special Attention to Dangling Nodes , 2007, SIAM J. Matrix Anal. Appl..

[2]  Carl D. Meyer,et al.  Deeper Inside PageRank , 2004, Internet Math..

[3]  Xu Jia,et al.  A Fast Two-Stage Algorithm for Computing SimRank and Its Extensions , 2010, WAIM Workshops.

[4]  C. Lee Giles,et al.  Accessibility of information on the web , 1999, Nature.

[5]  R. Rousseau,et al.  LOTKA: A program to fit a power law distribution to observed frequency data. , 2000 .

[6]  Gene H. Golub,et al.  Exploiting the Block Structure of the Web for Computing , 2003 .

[7]  Gabriel Pinski,et al.  Citation influence for journal aggregates of scientific publications: Theory, with application to the literature of physics , 1976, Inf. Process. Manag..

[8]  Albert-László Barabási,et al.  Linked - how everything is connected to everything else and what it means for business, science, and everyday life , 2003 .

[9]  Mike Thelwall,et al.  Search engine coverage bias: evidence and possible causes , 2004, Inf. Process. Manag..

[10]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[11]  Yi Zhao,et al.  Bringing PageRank to the citation analysis , 2008, Inf. Process. Manag..

[12]  Massimo Franceschet,et al.  PageRank: Stand on the shoulders of giants , 2010, ArXiv.

[13]  Judit Bar-Ilan,et al.  Informetrics at the beginning of the 21st century - A review , 2008, J. Informetrics.

[14]  Johan Bollen,et al.  Journal status , 2006, Scientometrics.

[15]  G. Golub,et al.  A Fast Two-Stage Algorithm for Computing PageRank , 2003 .

[16]  Kevin S. McCurley,et al.  Ranking the web frontier , 2004, WWW '04.

[17]  Rajeev Motwani,et al.  What can you do with a Web in your Pocket? , 1998, IEEE Data Eng. Bull..

[18]  Sergei Maslov,et al.  Finding scientific gems with Google's PageRank algorithm , 2006, J. Informetrics.