Graph-based algorithms for ranking researchers: not all swans are white!

Scientific importance ranking has long been an important research topic in scientometrics. Many indices based on citation counts have been proposed. In recent years, several graph-based ranking algorithms have been studied and claimed to be reasonable and effective. However, most current researches fall short of a concrete view of what these graph-based ranking algorithms bring to bibliometric analysis. In this paper, we make a comparative study of state-of-the-art graph-based algorithms using the APS (American Physical Society) dataset. We focus on ranking researchers. Some interesting findings are made. Firstly, simple citation-based indices like citation count can return surprisingly better results than many cutting-edge graph-based ranking algorithms. Secondly, how we define researcher importance may have tremendous impacts on ranking performance. Thirdly, some ranking methods which at the first glance are totally different have high rank correlations. Finally, the data of which time period are chosen for ranking greatly influence ranking performance but still remains open for further study. We also try to give explanations to a large part of the above findings. The results of this study open a third eye on the current research status of bibliometric analysis.

[1]  K. Hajra,et al.  Modelling aging characteristics in citation networks , 2005, physics/0508035.

[2]  Z. K. Silagadze,et al.  Citation entropy and research impact estimation , 2009, ArXiv.

[3]  Kamalika Basu Hajra,et al.  Aging in citation networks , 2004, cond-mat/0409017.

[4]  Riyaz Sikora,et al.  Assessing the relative influence of journals in a citation network , 2005, CACM.

[5]  Santo Fortunato,et al.  Characterizing and Modeling Citation Dynamics , 2011, PloS one.

[6]  L. Egghe,et al.  Theory and practise of the g-index , 2006, Scientometrics.

[7]  Daren Yu,et al.  Effect of the age of papers on the preferential attachment in citation networks , 2009 .

[8]  Ying Ding,et al.  Applying centrality measures to impact analysis: A coauthorship network analysis , 2009, J. Assoc. Inf. Sci. Technol..

[9]  C. Lee Giles,et al.  Ranking authors in digital libraries , 2011, JCDL '11.

[10]  Mario Lefebvre,et al.  Applied Stochastic Processes , 2006 .

[11]  Sergei Maslov,et al.  Ranking scientific publications using a model of network traffic , 2006, ArXiv.

[12]  Cassidy R. Sugimoto,et al.  P-Rank: An indicator measuring prestige in heterogeneous scholarly networks , 2011, J. Assoc. Inf. Sci. Technol..

[13]  Hai Zhuge,et al.  Towards an effective and unbiased ranking of scientific literature through mutual reinforcement , 2012, CIKM.

[14]  Lise Getoor,et al.  FutureRank: Ranking Scientific Articles by Predicting their Future PageRank , 2009, SDM.

[15]  Hai Zhuge,et al.  Topological centrality and its e-Science applications , 2010, J. Assoc. Inf. Sci. Technol..

[16]  Michael I. Jordan,et al.  Stable algorithms for link analysis , 2001, SIGIR '01.

[17]  James Caverlee,et al.  PageRank for ranking authors in co-citation networks , 2009, J. Assoc. Inf. Sci. Technol..

[18]  J. E. Hirsch,et al.  An index to quantify an individual's scientific research output , 2005, Proc. Natl. Acad. Sci. USA.

[19]  Josep Domingo-Ferrer,et al.  A bibliometric index based on the collaboration distance between cited and citing authors , 2011, J. Informetrics.

[20]  James Caverlee,et al.  PageRank for ranking authors in co-citation networks , 2009 .

[21]  魏屹东,et al.  Scientometrics , 2018, Encyclopedia of Big Data.

[22]  Santo Fortunato,et al.  Diffusion of scientific credits and the ranking of scientists , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[23]  NerurSridhar,et al.  Assessing the relative influence of journals in a citation network , 2005 .

[24]  Hongyuan Zha,et al.  Co-ranking Authors and Documents in a Heterogeneous Network , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[25]  Shlomo Moran,et al.  SALSA: the stochastic approach for link-structure analysis , 2001, TOIS.

[26]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[27]  E. Garfield Citation analysis as a tool in journal evaluation. , 1972, Science.

[28]  Philip S. Yu,et al.  Time Sensitive Ranking with Application to Publication Search , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[29]  Sergei Maslov,et al.  Finding scientific gems with Google's PageRank algorithm , 2006, J. Informetrics.