The hw-rank: an h-index variant for ranking web pages

We introduce a novel ranking of search results based on a variant of the h-index for directed information networks such as the Web. The h-index was originally introduced to measure an individual researcher’s scientific output and influence, but here a variant of it is applied to assess the “importance” of web pages. Like PageRank, the “importance” of a page is defined by the “importance” of the pages linking to it. However, unlike the computation of PageRank which involves the whole web graph, computing the h-index for web pages (the hw-rank) is based on a local computation and only the neighbors of the neighbors of the given node are considered. Preliminary results show a strong correlation between ranking with the hw-rank and PageRank, and moreover its computation is simpler and less complex than computation of the PageRank. Further, larger scale experiments are needed in order to assess the applicability of the method.

[1]  Ted Bergstrom Papers The Eigenfactor Metrics: A network approach to assessing scholarly journals , 2010 .

[2]  Wolfgang Glänzel,et al.  A Hirsch-type index for journals , 2006, Scientometrics.

[3]  J. Schneider,et al.  The Janus Faced Scholar: A Festschrift in Honour of Peter Ingwersen , 2010 .

[4]  Rick Kazman,et al.  WebQuery: Searching and Visualizing the Web Through Connectivity , 1997, Comput. Networks.

[5]  Peter Ingwersen,et al.  The calculation of web impact factors , 1998, J. Documentation.

[6]  Eugene Garfield,et al.  Citation Frequency as a Measure of Research Activity and Performance , 1962 .

[7]  Henk F. Moed,et al.  Citation Analysis in Research Evaluation , 1899 .

[8]  Eigenfactor TM Score and Article Influence TM Score : Detailed methods , .

[9]  Hector Garcia-Molina,et al.  Link Spam Alliances , 2005, VLDB.

[10]  J. E. Hirsch,et al.  An index to quantify an individual's scientific research output , 2005, Proc. Natl. Acad. Sci. USA.

[11]  A. Telcs,et al.  Lobby index in networks , 2008, 0809.0514.

[12]  Ronald Rousseau,et al.  Real and rational variants of the h-index and the g-index , 2009, J. Informetrics.

[13]  L. Egghe,et al.  Theory and practise of the g-index , 2006, Scientometrics.

[14]  Carl T. Bergstrom,et al.  The Eigenfactor MetricsTM: A Network Approach to Assessing Scholarly Journals , 2010, Coll. Res. Libr..

[15]  Vicente P. Guerrero-Bote,et al.  A further step forward in measuring journals' scientific prestige: The SJR2 indicator , 2012, J. Informetrics.

[16]  Rodrigo Costas,et al.  The h-index: Advantages, limitations and its relation with other bibliometric indicators at the micro level , 2007, J. Informetrics.

[17]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[18]  Wolfgang Glänzel,et al.  On the Opportunities and Limitations of the H-index , 2006 .

[19]  Judit Bar-Ilan,et al.  Rankings of information and library science journals by JIF and by h-type indices , 2010, J. Informetrics.

[20]  Marián Boguñá,et al.  Approximating PageRank from In-Degree , 2007, WAW.

[21]  Gabriel Pinski,et al.  Citation influence for journal aggregates of scientific publications: Theory, with application to the literature of physics , 1976, Inf. Process. Manag..

[22]  Anthony F. J. van Raan Comparison of the Hirsch-index with standard bibliometric indicators and with peer judgment for 147 chemistry research groups , 2013, Scientometrics.

[23]  Mike Thelwall,et al.  Interpreting social science link analysis research: A theoretical framework , 2006, J. Assoc. Inf. Sci. Technol..

[24]  András Schubert,et al.  Using the h-index for assessing single publications , 2009, Scientometrics.

[25]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[26]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[27]  David Hawking,et al.  Predicting Fame and Fortune: PageRank or Indegree? , 2003 .

[28]  H. Moed Citation Analysis in Research Evaluation (Information Science & Knowledge Management) , 2005 .

[29]  Jan Hauke,et al.  Comparison of Values of Pearson's and Spearman's Correlation Coefficients on the Same Sets of Data , 2011 .

[30]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[31]  魏屹东,et al.  Scientometrics , 2018, Encyclopedia of Big Data.

[32]  Richard S. J. Tol,et al.  Rational (successive) h-indices: An application to economics in the Republic of Ireland , 2008, Scientometrics.

[33]  L. Bornmann,et al.  The state of h index research , 2009, EMBO reports.

[34]  David F. Gleich,et al.  Algorithms and Models for the Web Graph , 2014, Lecture Notes in Computer Science.

[35]  Ronald Rousseau,et al.  h-Degree as a basic measure in weighted networks , 2011, J. Informetrics.