Citation-based clustering of publications using CitNetExplorer and VOSviewer

Clustering scientific publications in an important problem in bibliometric research. We demonstrate how two software tools, CitNetExplorer and VOSviewer, can be used to cluster publications and to analyze the resulting clustering solutions. CitNetExplorer is used to cluster a large set of publications in the field of astronomy and astrophysics. The publications are clustered based on direct citation relations. CitNetExplorer and VOSviewer are used together to analyze the resulting clustering solutions. Both tools use visualizations to support the analysis of the clustering solutions, with CitNetExplorer focusing on the analysis at the level of individual publications and VOSviewer focusing on the analysis at an aggregate level. The demonstration provided in this paper shows how a clustering of publications can be created and analyzed using freely available software tools. Using the approach presented in this paper, bibliometricians are able to carry out sophisticated cluster analyses without the need to have a deep knowledge of clustering techniques and without requiring advanced computer skills.

[1]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  Wolfgang Glänzel,et al.  Same data—different results? Towards a comparative approach to the identification of thematic structures in science , 2017, Scientometrics.

[3]  Kevin W. Boyack,et al.  Comparison of topic extraction approaches and their results , 2017, Scientometrics.

[4]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[5]  Kevin W. Boyack,et al.  Which Type of Citation Analysis Generates the Most Accurate Taxonomy of Scientific and Technical Knowledge? , 2015, J. Assoc. Inf. Sci. Technol..

[6]  Bart De Moor,et al.  A hybrid mapping of information science , 2008, Scientometrics.

[7]  Ludo Waltman,et al.  Software survey: VOSviewer, a computer program for bibliometric mapping , 2009, Scientometrics.

[8]  Ludvig Bohlin,et al.  Community detection and visualization of networks with the map equation framework , 2014 .

[9]  Kevin W. Boyack,et al.  Clustering More than Two Million Biomedical Publications: Comparing the Accuracies of Nine Text-Based Similarity Approaches , 2011, PloS one.

[10]  Ludo Waltman,et al.  A smart local moving algorithm for large-scale modularity-based community detection , 2013, The European Physical Journal B.

[11]  魏屹东,et al.  Scientometrics , 2018, Encyclopedia of Big Data.

[12]  Wolfgang Glänzel,et al.  Same data—different results? Towards a comparative approach to the identification of thematic structures in science , 2017, Scientometrics.

[13]  H. Small,et al.  Identifying emerging topics in science and technology , 2014 .

[14]  Ed C. M. Noyons,et al.  A unified approach to mapping and clustering of bibliometric networks , 2010, J. Informetrics.

[15]  Ludo Waltman,et al.  A new methodology for constructing a publication-level classification system of science , 2012, J. Assoc. Inf. Sci. Technol..

[16]  Wolfgang Glänzel,et al.  Same data—different results? Towards a comparative approach to the identification of thematic structures in science : Introduction to the special issue , 2017 .

[17]  Kevin W. Boyack,et al.  Including cited non-source items in a large-scale map of science: What difference does it make? , 2014, J. Informetrics.

[18]  Kevin W. Boyack,et al.  Co-citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately? , 2010 .

[19]  Abhay Ashtekar,et al.  Multipole moments of isolated horizons , 2004 .

[20]  Bart De Moor,et al.  Towards mapping library and information science , 2006, Inf. Process. Manag..

[21]  Ludo Waltman,et al.  Visualizing Bibliometric Networks , 2014 .

[22]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[23]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[24]  Dietmar Wolfram,et al.  Measuring Scholarly Impact: Methods and Practice , 2014 .

[25]  V A Traag,et al.  Narrow scope for resolution-limit-free community detection. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[26]  H. Ritter,et al.  Catalogue of Cataclysmic Binaries, Low-Mass X-Ray Binaries and Related Objects , 1984, astro-ph/0301444.

[27]  Ludo Waltman,et al.  CitNetExplorer: A new software tool for analyzing and visualizing citation networks , 2014, J. Informetrics.

[28]  Ludo Waltman,et al.  Text mining and visualization using VOSviewer , 2011, ArXiv.

[29]  Ludo Waltman,et al.  Clustering Scientific Publications Based on Citation Relations: A Systematic Comparison of Different Methods , 2015, PloS one.

[30]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[31]  Kevin W. Boyack,et al.  Co-citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately? , 2010, J. Assoc. Inf. Sci. Technol..