Updating the SCImago journal and country rank classification: A new approach using Ward's clustering and alternative combination of citation measures

This study introduces a new proposal to refine the classification of the SCImago Journal and Country Rank (SJR) platform by using clustering techniques and an alternative combination of citation measures from an initial 18,891 SJR journal network. Thus, a journal–journal matrix including simultaneously fractionalized values of direct citation, cocitation, and coupling was symmetrized by cosine similarity and later transformed into distances before performing clustering. The results provided a new cluster‐based subject structure comprising 290 clusters that emerge by executing Ward's clustering in two phases and using a mixed labeling procedure based on tf‐idf scores of the original SJR category tags and significant words extracted from journal titles. In total, 13,716 SJR journals were classified using this new cluster‐based scheme. Although more than 5,000 journals were omitted in the classification process, the method produced a consistent classification with a balanced structure of coherent and well‐defined clusters, a moderated multiassignment of journals, and a softer concentration of journals over clusters than in the original SJR categories. New subject disciplines such as “nanoscience and nanotechnology” or “social work” were also detected, providing evidence of good performance of our approach in refining the journal classification and updating the subject classification structure.

[1]  Zaida Chinchilla-Rodríguez,et al.  Coverage analysis of Scopus: A journal metric approach , 2007, Scientometrics.

[2]  Olle Persson,et al.  Identifying research themes with weighted direct citation links , 2010, J. Informetrics.

[3]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[4]  Harold Borko,et al.  Measuring the reliability of subject classification by men and machines , 1964 .

[5]  B. Everitt,et al.  Cluster Analysis Ed. 5 , 2011 .

[6]  S. Dumais Latent Semantic Analysis. , 2005 .

[7]  Francis Narin,et al.  Clustering of scientific journals , 1973, J. Am. Soc. Inf. Sci..

[8]  Bart De Moor,et al.  Hybrid clustering for validation and improvement of subject-classification schemes , 2009, Inf. Process. Manag..

[9]  Henry G. Small,et al.  Clustering the science citation index using co-citations. II. Mapping science , 1985, Scientometrics.

[10]  Wolfgang Glänzel,et al.  Using ‘core documents’ for detecting and labelling new emerging topics , 2011, Scientometrics.

[11]  Kevin W. Boyack,et al.  Toward a consensus map of science , 2009, J. Assoc. Inf. Sci. Technol..

[12]  Eri Yagi,et al.  Derek J. de S. Price (1922–83) Historian of science and herald of scientometrics , 1996 .

[13]  Wolfgang Glänzel,et al.  Combining full-text analysis and bibliometric indicators , 2004 .

[14]  Henry Small,et al.  A SYSTEM FOR AUTOMATIC CLASSIFICATION OF SCIENTIFIC LITERATURE , 2013 .

[15]  Julie Bichteler,et al.  Document retrieval by means of an automatic classification algorithm for citations , 1974, Inf. Storage Retr..

[16]  R. M. Cormack,et al.  A Review of Classification , 1971 .

[17]  K. Boyack,et al.  Is there a Convergent Structure of Science? A Comparison of Maps using the ISI and Scopus Databases , 2007 .

[18]  Zaida Chinchilla-Rodríguez,et al.  Visualizing the marrow of science , 2007 .

[19]  Vladimir Batagelj,et al.  Optimizing SCImago Journal & Country Rank classification by community detection , 2014, J. Informetrics.

[20]  Samuel Schiminovich Automatic classification and retrieval of documents by means of a bibliographic pattern discovery algorithm , 1971, Inf. Storage Retr..

[21]  Wolfgang Glänzel,et al.  A new classification scheme of science fields and subfields designed for scientometric evaluation purposes , 2004, Scientometrics.

[22]  C.-M. Chen,et al.  Classification of scientific networks using aggregated journal-journal citation relations in the Journal Citation Reports , 2008, J. Assoc. Inf. Sci. Technol..

[23]  Félix de Moya Anegón,et al.  Visualizing the structure of science , 2007 .

[24]  Wolfgang Glänzel,et al.  Journal cross-citation analysis for validation and improvement of journal-based subject classification in bibliometric research , 2010, Scientometrics.

[25]  Michel Zitt,et al.  Indicators in a research institute: A multi-level classification of scientific journals , 1999, Scientometrics.

[26]  Harold Borko,et al.  Automatic Document Classification Part II . Additional Experiments , 1964, JACM.

[27]  Ronald Rousseau,et al.  A classification of author co-citations: Definitions and search strategies , 2004, J. Assoc. Inf. Sci. Technol..

[28]  B. C. Griffith,et al.  The Structure of Scientific Literatures I: Identifying and Graphing Specialties , 1974 .

[29]  R. Perrucci,et al.  From Little Science to Big Science , 2017 .

[30]  Wolfgang Glänzel,et al.  Combining full-text analysis and bibliometric indicators. A pilot study , 2005, Scientometrics.

[31]  Harold Borko,et al.  Automatic Document Classification , 1963, JACM.

[32]  W. Bruce Croft,et al.  Document clustering: An evaluation of some experiments with the cranfield 1400 collection , 1975, Inf. Process. Manag..

[33]  Henry G. Small,et al.  Clustering thescience citation index® using co-citations - I. A comparison of methods , 1985, Scientometrics.

[34]  W. Bruce Croft A model of cluster searching bases on classification , 1980, Inf. Syst..

[35]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[36]  Félix de Moya Anegón,et al.  Visualizing the marrow of science , 2007, J. Assoc. Inf. Sci. Technol..

[37]  E. Garfield,et al.  The geography of science: disciplinary and national mappings , 1985 .

[38]  Alexander I. Pudovkin,et al.  Algorithmic procedure for finding semantically related journals , 2002, J. Assoc. Inf. Sci. Technol..

[39]  Wolfgang Glänzel,et al.  Improving SCImago Journal & Country Rank (SJR) subject classification through reference analysis , 2011, Scientometrics.

[40]  Samuel Schiminovich,et al.  A clustering experiment: First step towards a computer-generated classification scheme , 1968, Inf. Storage Retr..

[41]  C. J. van Rijsbergen Further experiments with hierarchic clustering in document retrieval , 1974, Inf. Storage Retr..

[42]  Loet Leydesdorff,et al.  Dynamic and evolutionary updates of classificatory schemes in scientific journal structures , 2002, J. Assoc. Inf. Sci. Technol..

[43]  Peter Willett,et al.  Recent trends in hierarchic document clustering: A critical review , 1988, Inf. Process. Manag..

[44]  Ismael Rafols,et al.  A global map of science based on the ISI subject categories , 2009, J. Assoc. Inf. Sci. Technol..

[45]  Henry G. Small,et al.  Clustering thescience citation index® using co-citations , 1985, Scientometrics.

[46]  Zaida Chinchilla-Rodríguez,et al.  A new technique for building maps of large scientific domains based on the cocitation of classes and categories , 2004, Scientometrics.

[47]  Isabel Gómez,et al.  Coping with the problem of subject classification diversity , 2005, Scientometrics.

[48]  Loet Leydesdorff,et al.  Journal maps on the basis of Scopus data: A comparison with the Journal Citation Reports of the ISI , 2009, J. Assoc. Inf. Sci. Technol..

[49]  B. Everitt,et al.  Cluster Analysis: Everitt/Cluster Analysis , 2011 .

[50]  Tamara Krajna,et al.  SCImago Journal & Country Rank , 2008 .

[51]  Ismael Rafols,et al.  Content-based and algorithmic classifications of journals: Perspectives on the dynamics of scientific communication and indexer effects , 2008, J. Assoc. Inf. Sci. Technol..

[52]  Kevin W. Boyack,et al.  Mapping the backbone of science , 2004, Scientometrics.

[53]  GlänzelWolfgang,et al.  Hybrid clustering for validation and improvement of subject-classification schemes , 2009 .