Bibliographic coupling and hierarchical clustering for the validation and improvement of subject-classification schemes

An attempt is made to cluster journals from the complete Web of Science database by using bibliographic coupling similarities. Since the sparseness of the underlying similarity matrix proved inappropriate for this exercise, second-order similarities have been used. Only 0.12 % out of 8282 journals had to be removed from the classification as being singletons. The quality at three hierarchical levels with 6, 14 and 24 clusters substantiated the applicability of this method. Cluster labelling was made on the basis of the about 70 subfields of the Leuven–Budapest subject-classification scheme that also allowed the comparison with the existing two-level journal classification system developed in Leuven. The further comparison with the 22 field classification system of the Essential Science Indicators does, however, reveal larger deviations.

[1]  Wolfgang Glänzel,et al.  A new classification scheme of science fields and subfields designed for scientometric evaluation purposes , 2004, Scientometrics.

[2]  Loet Leydesdorff,et al.  Clusters and Maps of Science Journals Based on Bi-connected Graphs in the Journal Citation Reports , 2009, ArXiv.

[3]  Per Ahlgren,et al.  Document-document similarity approaches and science mapping: Experimental comparison of five approaches , 2009, J. Informetrics.

[4]  魏屹东,et al.  Scientometrics , 2018, Encyclopedia of Big Data.

[5]  GlänzelWolfgang,et al.  Bibliographic coupling and hierarchical clustering for the validation and improvement of subject-classification schemes , 2015 .

[6]  Francis Narin,et al.  Interrelationships of scientific journals , 1972, J. Am. Soc. Inf. Sci..

[7]  Wolfgang Glänzel,et al.  A new methodological approach to bibliographic coupling and its application to the national, regional and institutional level , 2005, Scientometrics.

[8]  Michel Zitt,et al.  Indicators in a research institute: A multi-level classification of scientific journals , 1999, Scientometrics.

[9]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[10]  Bo Jarneving The combined application of bibliographic coupling and the complete link cluster method in bibliometric science mapping , 2005 .

[11]  Frizo A. L. Janssens,et al.  Clustering of scientific fields by integrating text mining and bibliometrics , 2007 .

[12]  Wolfgang Glänzel,et al.  Journal cross-citation analysis for validation and improvement of journal-based subject classification in bibliometric research , 2010, Scientometrics.

[13]  Edgar Schiebel,et al.  Do second-order similarities provide added-value in a hybrid approach? , 2013, Scientometrics.

[14]  Bart De Moor,et al.  A hybrid mapping of information science , 2008, Scientometrics.

[15]  Subir K Sen,et al.  A mathematical extension of the idea of bibliographic coupling and its applications , 1983 .

[16]  Wolfgang Glänzel,et al.  Using ‘core documents’ for the representation of clusters and topics , 2011, Scientometrics.

[17]  Mathieu Bastian,et al.  Gephi: An Open Source Software for Exploring and Manipulating Networks , 2009, ICWSM.

[18]  Bart De Moor,et al.  Hybrid clustering for validation and improvement of subject-classification schemes , 2009, Inf. Process. Manag..