Measuring the movement of a research paradigm

A research paradigm is a dynamical system of scientific works, including their perceived values by peer scientists, and governed by intrinsic intellectual values and associated citation endurance and decay. Identifying an emerging research paradigm and monitoring changes in an existing paradigm have been a challenging task due to the scale and complexity involved. In this article, we describe an exploratory data analysis method for identifying a research paradigm based on clustering scientific articles by their citation half life and betweenness centrality as well as citation frequencies. The Expectation Maximization algorithm is used to cluster articles based on these attributes. It is hypothesized that the resultant clusters correspond to dynamic groupings of articles manifested by a research paradigm. The method is tested with three example datasets: Social Network Analysis (1992-2004), Mass Extinction (1981-2004), and Terrorism (1989-2004). All these subject domains have known emergent paradigms identified independently. The resultant clusters are interpreted and assessed with reference to clusters identified by co-citation links. The consistency and discrepancy between the EM clusters and the link-based co-citation clusters are also discussed.

[1]  U. Brandes A faster algorithm for betweenness centrality , 2001 .

[2]  Henry Small Visualizing science by citation mapping , 1999 .

[3]  Anthony F. J. van Raan,et al.  On Growth, Ageing, and Fractal Differentiation of Science , 2000, Scientometrics.

[4]  Ulrik Brandes,et al.  Visual unrolling of network evolution and the analysis of dynamic discourse , 2002, IEEE Symposium on Information Visualization, 2002. INFOVIS 2002..

[5]  Stephen G. Kobourov,et al.  Exploring the computing literature using temporal graph visualization , 2004, IS&T/SPIE Electronic Imaging.

[6]  Gary G. Yen,et al.  Time line visualization of research fronts , 2003, J. Assoc. Inf. Sci. Technol..

[7]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[8]  B. C. Griffith,et al.  The Structure of Scientific Literatures II: Toward a Macro- and Microstructure for Science , 1974 .

[9]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[10]  Kevin W. Boyack,et al.  Domain visualization using VxInsight® for science and technology management , 2002, J. Assoc. Inf. Sci. Technol..

[11]  Lucy T. Nowell,et al.  ThemeRiver: Visualizing Thematic Changes in Large Document Collections , 2002, IEEE Trans. Vis. Comput. Graph..

[12]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[13]  M. M. Kessler Bibliographic coupling between scientific papers , 1963 .

[14]  Chaomei Chen,et al.  Detecting and mapping thematic changes in transient networks , 2004, Proceedings. Eighth International Conference on Information Visualisation, 2004. IV 2004..

[15]  Katherine W. McCain,et al.  Visualizing a discipline: an author co-citation analysis of information science, 1972–1995 , 1998 .

[16]  Ray J. Paul,et al.  Visualizing a Knowledge Domain's Intellectual Structure , 2001, Computer.

[17]  Chaomei Chen,et al.  The centrality of pivotal points in the evolution of scientific networks , 2005, IUI.

[18]  Chaomei Chen,et al.  Searching for intellectual turning points: Progressive knowledge domain visualization , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Henk F. Moed,et al.  Mapping of science by combined co-citation and word analysis: II: Dynamical aspects , 1991, J. Am. Soc. Inf. Sci..

[20]  D J PRICE,et al.  NETWORKS OF SCIENTIFIC PAPERS. , 1965, Science.

[21]  Roger W. Schvaneveldt,et al.  Pathfinder associative networks: studies in knowledge organization , 1990 .

[22]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[23]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[24]  S. Redner Citation Statistics From More Than a Century of Physical Review , 2004, physics/0407137.

[25]  Henk F. Moed,et al.  Mapping of science by combined co-citation and word analysis, I. Structural aspects , 1991, J. Am. Soc. Inf. Sci..

[26]  Chaomei Chen,et al.  The rising landscape: A visual exploration of superstring revolutions in physics , 2003, J. Assoc. Inf. Sci. Technol..

[27]  B. C. Griffith,et al.  The Structure of Scientific Literatures I: Identifying and Graphing Specialties , 1974 .

[28]  Chaomei Chen,et al.  Visualising Semantic Spaces and Author Co-Citation Networks in Digital Libraries , 1999, Inf. Process. Manag..

[29]  Leonard M. Freeman,et al.  A set of measures of centrality based upon betweenness , 1977 .

[30]  Ulrik Brandes,et al.  Visualization of Bibliographic Networks with a Reshaped Landscape Metaphor , 2002, VisSym.

[31]  T. Kuhn,et al.  The Structure of Scientific Revolutions. , 1964 .

[32]  Eugene Garfield,et al.  Citation indexing - its theory and application in science, technology, and humanities , 1979 .

[33]  Anthony E. Cawkell,et al.  Mapping Scientific Frontiers: The Quest for Knowledge Visualization , 2003, J. Documentation.