Measuring relatedness between communities in a citation network

As academic disciplines are segmented and specialized, it becomes more difficult to capture relevant research areas precisely by common retrieval strategies using either keywords or journal categories. This paper proposes a method of measuring the relatedness among sets of academic papers in order to detect unrelated communities which are not related to target topic. A citation network, extracted by given keywords, is divided into communities based on the density of links. We measured and compared four measures of relatedness between two communities in a citation network for three large-scale citation datasets. We used both link and semantic similarities. The topological distance from the center in a citation network is a more efficient measure for removing the unrelated communities than the other three measures: the ratio of the number of intercluster links over the all links, the ratio of the number of common terms over all terms, cosine similarity of tf-idf vectors. © 2011 Wiley Periodicals, Inc.

[1]  Gert Sabidussi,et al.  The centrality index of a graph , 1966 .

[2]  Brij Mohan Gupta,et al.  Networks of scientific papers: A comparative analysis of co-citation, bibliographic coupling and direct citation , 1977 .

[3]  R B Haynes,et al.  Bridges between health care research evidence and clinical practice. , 1995, Journal of the American Medical Informatics Association : JAMIA.

[4]  T. D. Wilson,et al.  Information behaviour: an interdisciplinary perspective , 1997, Inf. Process. Manag..

[5]  F. W. Lancaster,et al.  Types and Levels of Collaboration in Interdisciplinary Research in the Sciences , 1997, J. Am. Soc. Inf. Sci..

[6]  Ronald N. Kostoff,et al.  Database tomography for information retrieval , 1997, J. Inf. Sci..

[7]  S. Gopalakrishnan,et al.  A review of innovation research in economics, sociology and technology management , 1997 .

[8]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[9]  Hideki Mima,et al.  The C-value / Example-based approach to the automatic recognition of multi-word terms for cross-language terminology , 1998 .

[10]  Chaomei Chen,et al.  Visualising Semantic Spaces and Author Co-Citation Networks in Digital Libraries , 1999, Inf. Process. Manag..

[11]  Carole L. Palmer,et al.  Structures and Strategies of Interdisciplinary Science , 1999, J. Am. Soc. Inf. Sci..

[12]  Henry Small Visualizing science by citation mapping , 1999 .

[13]  Hideki Mima,et al.  Automatic recognition of multi-word terms:. the C-value/NC-value method , 2000, International Journal on Digital Libraries.

[14]  Kevin W. Boyack,et al.  Domain visualization using VxInsight® for science and technology management , 2002, J. Assoc. Inf. Sci. Technol..

[15]  Martin Suter,et al.  Small World , 2002 .

[16]  Timothy Cribbin,et al.  Visualizing and tracking the growth of competing paradigms: Two case studies , 2002, J. Assoc. Inf. Sci. Technol..

[17]  Chaomei Chen,et al.  Visualizing knowledge domains , 2005, Annu. Rev. Inf. Sci. Technol..

[18]  Uri Oron,et al.  Living nanovesicles--chemical and physical survival strategies of primordial biosystems. , 2003, Journal of proteome research.

[19]  Henry G. Small,et al.  Paradigms, citations, and maps of science: A personal history , 2003, J. Assoc. Inf. Sci. Technol..

[20]  M. Roco Nanotechnology: convergence with modern biology and medicine. , 2003, Current opinion in biotechnology.

[21]  Bruce Hendrickson,et al.  Knowledge Mining With VxInsight: Discovery Through Interaction , 1998, Journal of Intelligent Information Systems.

[22]  Loet Leydesdorff,et al.  Clusters and Maps of Science Journals Based on Bi-connected Graphs in the Journal Citation Reports , 2009, ArXiv.

[23]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[24]  Meera Sharma,et al.  Morphological and immunological characteristics of nanobacteria from human renal stones of a north Indian population , 2004, Urological Research.

[25]  Byung Hong Kim,et al.  Enrichment of microbial community generating electricity using a fuel-cell-type electrochemical cell , 2004, Applied Microbiology and Biotechnology.

[26]  Albert H. Davis,et al.  Observation of large magnetic field effects in organic light- emitting diodes , 2004, SPIE Optics + Photonics.

[27]  Blake Ives,et al.  Review: IT-Dependent Strategic Initiatives and Sustained Competitive Advantage: A Review and Synthesis of the Literature , 2005, MIS Q..

[28]  Kevin W. Boyack,et al.  Mapping the backbone of science , 2004, Scientometrics.

[29]  O. Mermer,et al.  Large magnetoresistance at room-temperature in small-molecular-weight organic semiconductor sandwich devices , 2005, cond-mat/0501124.

[30]  Yuya Kajikawa,et al.  Filling the gap between researchers studying different materials and different methods: a proposal for structured keywords , 2006, J. Inf. Sci..

[31]  Kevin W. Boyack,et al.  Identifying a better measure of relatedness for mapping science , 2006 .

[32]  Yuya Kajikawa,et al.  Creating an academic landscape of sustainability science: an analysis of the citation network , 2007 .

[33]  Yoshiyuki Takeda,et al.  Detecting emerging research fronts based on topological measures in citation networks of scientific publications , 2008 .

[34]  Kevin W. Boyack,et al.  Toward a consensus map of science , 2009, J. Assoc. Inf. Sci. Technol..

[35]  Y. Kajikawa,et al.  Citation network analysis of organic LEDs , 2009 .

[36]  Ismael Rafols,et al.  A global map of science based on the ISI subject categories , 2009, J. Assoc. Inf. Sci. Technol..

[37]  M. Shao,et al.  Magnetic‐Field Effects in Organic Semiconducting Materials and Devices , 2009 .

[38]  Naoki Shibata,et al.  Comparative study on methods of detecting research fronts using different types of citation , 2009, J. Assoc. Inf. Sci. Technol..

[39]  Yoshiyuki Takeda,et al.  Tracking modularity in citation networks , 2010, Scientometrics.

[40]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.