Using Bayesian networks to discover relationships between bibliometric indices. A case study of computer science and artificial intelligence journals

As they are used to evaluate the importance of research at different levels by funding agencies and promotion committees, bibliometric indices have received a lot of attention from the scientific community over the last few years. Many bibliometric indices have been developed in order to take into account aspects not previously covered. The result is that, nowadays, the scientific community faces the challenge of selecting which of this pool of indices meets the required quality standards. In view of the vast number of bibliometric indices, it is necessary to analyze how they relate to each other (irrelevant, dependent and so on). Our main purpose is to learn a Bayesian network model from data to analyze the relationships among bibliometric indices. The induced Bayesian network is then used to discover probabilistic conditional (in)dependencies among the indices and, also for probabilistic reasoning. We also run a case study of 14 well-known bibliometric indices on computer science and artificial intelligence journals.

[1]  Chun-Ting Zhang,et al.  The e-Index, Complementing the h-Index for Excess Citations , 2009, PloS one.

[2]  Elvira: An Environment for Creating and Using Probabilistic Graphical Models , 2002, Probabilistic Graphical Models.

[3]  Massimo Franceschet,et al.  Journal influence factors , 2010, J. Informetrics.

[4]  Carl T. Bergstrom,et al.  The Eigenfactor™ Metrics , 2008, The Journal of Neuroscience.

[5]  Loet Leydesdorff,et al.  How are new citation-based journal indicators adding to the bibliometric toolbox? , 2009, J. Assoc. Inf. Sci. Technol..

[6]  Enrique F. Castillo,et al.  Expert Systems and Probabilistic Network Models , 1996, Monographs in Computer Science.

[7]  Michael Schreiber,et al.  An empirical investigation of the g-index for 26 physicists in comparison with the h-index, the A-index, and the R-index , 2008, J. Assoc. Inf. Sci. Technol..

[8]  Francisco Herrera,et al.  hg-index: a new index to characterize the scientific output of researchers based on the h- and g-indices , 2010, Scientometrics.

[9]  Philip M. Davis Eigenfactor: Does the principle of repeated improvement result in better estimates than raw citation counts? , 2008 .

[10]  Rodrigo Costas,et al.  Is g-index better than h-index? An exploratory study at the individual level , 2008, Scientometrics.

[11]  Richard S. J. Tol,et al.  Rational (successive) h-indices: An application to economics in the Republic of Ireland , 2008, Scientometrics.

[12]  Gregory F. Cooper,et al.  A Bayesian Method for the Induction of Probabilistic Networks from Data , 1992 .

[13]  Yang Tao,et al.  A Study on Development Planning for Management Science and Engineering , 2006 .

[14]  C. Lee Giles,et al.  Scholarly publishing in the Internet age: a citation analysis of computer science literature , 2001, Inf. Process. Manag..

[15]  Philip M. Davis Eigenfactor : Does the Principle of Repeated Improvement Result in Better Journal Impact Estimates than Raw Citation Counts? , 2008, ArXiv.

[16]  Wolfgang Glänzel,et al.  A Hirsch-type index for journals , 2006, Scientometrics.

[17]  Francisco Herrera,et al.  q2-Index: Quantitative and qualitative evaluation based on the number and impact of papers in the Hirsch core , 2010, J. Informetrics.

[18]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[19]  Alexander Serenko,et al.  The Development of an AI Journal Ranking List Based on the Revealed Preference Approach , 2010, AMCIS.

[20]  J. E. Hirsch,et al.  An index to quantify an individual's scientific research output , 2005, Proc. Natl. Acad. Sci. USA.

[21]  Lutz Bornmann,et al.  Is the h index related to (standard) bibliometric measures and to the assessments by peers? An investigation of the h index by using molecular life sciences data , 2008 .

[22]  E. Garfield Essays Of An Information Scientist , 1977 .

[23]  Gerhard J. Woeginger,et al.  An axiomatic characterization of the Hirsch-index , 2008, Math. Soc. Sci..

[24]  R. Rousseau,et al.  The R- and AR-indices: Complementing the h-index , 2007 .

[25]  Johan Bollen,et al.  A Principal Component Analysis of 39 Scientific Impact Measures , 2009, PloS one.

[26]  José Manuel Gutiérrez,et al.  Expert Systems and Probabiistic Network Models , 1996 .

[27]  E. Garfield Citation analysis as a tool in journal evaluation. , 1972, Science.

[28]  Robert D. Herbert,et al.  Correlation between the Journal Impact Factor and three other journal citation indices , 2010, Scientometrics.

[29]  Michael P. Wellman Fundamental Concepts of Qualitative Probabilistic Networks , 1990, Artif. Intell..

[30]  Lutz Bornmann,et al.  Are there better indices for evaluation purposes than the h index? A comparison of nine different variants of the h index using data from biomedicine , 2008, J. Assoc. Inf. Sci. Technol..

[31]  L. Egghe An improvement of the h-index: the g-index , 2006 .

[32]  Judit Bar-Ilan,et al.  Which h-index? — A comparison of WoS, Scopus and Google Scholar , 2008, Scientometrics.