Do PageRank-based author rankings outperform simple citation counts?

The basic indicators of a researcher's productivity and impact are still the number of publications and their citation counts. These metrics are clear, straightforward, and easy to obtain. When a ranking of scholars is needed, for instance in grant, award, or promotion procedures, their use is the fastest and cheapest way of prioritizing some scientists over others. However, due to their nature, there is a danger of oversimplifying scientific achievements. Therefore, many other indicators have been proposed including the usage of the PageRank algorithm known for the ranking of webpages and its modifications suited to citation networks. Nevertheless, this recursive method is computationally expensive and even if it has the advantage of favouring prestige over popularity, its application should be well justified, particularly when compared to the standard citation counts. In this study, we analyze three large datasets of computer science papers in the categories of artificial intelligence, software engineering, and theory and methods and apply 12 different ranking methods to the citation networks of authors. We compare the resulting rankings with self-compiled lists of outstanding researchers selected as frequent editorial board members of prestigious journals in the field and conclude that there is no evidence of PageRank-based methods outperforming simple citation counts.

[1]  Dalibor Fiala,et al.  Bibliometric analysis of CiteSeer data for countries , 2012, Inf. Process. Manag..

[2]  Carl T. Bergstrom Eigenfactor Measuring the value and prestige of scholarly journals , 2007 .

[3]  Sergei Maslov,et al.  Ranking scientific publications using a model of network traffic , 2006, ArXiv.

[4]  Ying Ding,et al.  Discovering author impact: A PageRank perspective , 2010, Inf. Process. Manag..

[5]  Johan Bollen,et al.  Co-authorship networks in the digital library research community , 2005, Inf. Process. Manag..

[6]  Yi Zhao,et al.  Bringing PageRank to the citation analysis , 2008, Inf. Process. Manag..

[7]  James Caverlee,et al.  PageRank for ranking authors in co-citation networks , 2009 .

[8]  Santo Fortunato,et al.  Diffusion of scientific credits and the ranking of scientists , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  Vicente P. Guerrero-Bote,et al.  A new approach to the metric of journals' scientific prestige: The SJR indicator , 2010, J. Informetrics.

[10]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[11]  Cassidy R. Sugimoto,et al.  P-Rank: An indicator measuring prestige in heterogeneous scholarly networks , 2011, J. Assoc. Inf. Sci. Technol..

[12]  Johan Bollen,et al.  Journal status , 2006, Scientometrics.

[13]  J. E. Hirsch,et al.  An index to quantify an individual's scientific research output , 2005, Proc. Natl. Acad. Sci. USA.

[14]  Dalibor Fiala,et al.  Mining citation information from CiteSeer data , 2011, Scientometrics.

[15]  Dalibor Fiala Suborganizations of Institutions in Library and Information Science Journals , 2013, Inf..

[16]  Sergei Maslov,et al.  Finding scientific gems with Google's PageRank algorithm , 2006, J. Informetrics.

[17]  Erjia Yan Topic-based Pagerank: toward a topic-level scientific evaluation , 2014, Scientometrics.

[18]  François Rousselot,et al.  PageRank for bibliographic networks , 2008, Scientometrics.

[19]  Ying Ding,et al.  Applying weighted PageRank to author citation networks , 2011, J. Assoc. Inf. Sci. Technol..

[20]  Gabriel Pinski,et al.  Citation influence for journal aggregates of scientific publications: Theory, with application to the literature of physics , 1976, Inf. Process. Manag..

[21]  Dalibor Fiala,et al.  Sub-organizations of institutions in computer science journals at the turn of the century , 2014 .

[22]  Yannis Manolopoulos,et al.  A citation-based system to assist prize awarding , 2005, SGMD.

[23]  Dalibor Fiala,et al.  Time-aware PageRank for bibliographic networks , 2012, J. Informetrics.

[24]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[25]  Dalibor Fiala,et al.  PageRank variants in the evaluation of citation networks , 2014, J. Informetrics.

[26]  Chris H. Q. Ding,et al.  PageRank, HITS and a unified framework for link analysis , 2002, SIGIR '02.

[27]  Erjia Yan,et al.  Weighted citation: An indicator of an article's prestige , 2010 .

[28]  Franco Scarselli,et al.  Inside PageRank , 2005, TOIT.

[29]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[30]  Wenpu Xing,et al.  Weighted PageRank algorithm , 2004, Proceedings. Second Annual Conference on Communication Networks and Services Research, 2004..

[31]  Carl D. Meyer,et al.  Deeper Inside PageRank , 2004, Internet Math..

[32]  Marco Gori,et al.  A unified probabilistic framework for Web page scoring systems , 2004, IEEE Transactions on Knowledge and Data Engineering.

[33]  L. Egghe,et al.  Theory and practise of the g-index , 2006, Scientometrics.

[34]  Carl T. Bergstrom,et al.  Author-level Eigenfactor metrics: Evaluating the influence of authors, institutions, and countries within the social science research network community , 2013, J. Assoc. Inf. Sci. Technol..