How Related is Author Topical Similarity to Other Author Relatedness Measures?

Using a dataset of 26,228 Psychology document surrogates from Elsevier databases, we compare author relatedness measure outcomes for 125 authors based on topic modelling to more traditional approaches that rely on direct citation, co-citation and collaboration. Outcomes for the author topical similarity measure are compared to existing co-authorships in the dataset using UCINET/NetDraw. We demonstrate how author topical similarity outcomes provide a similar, but more complete, picture of author relationships than the co-authorship network. Nonparametric correlation analysis results of author topical similarity, co-authorship, citation, and co-citation were also compared for thirty author pairs of differing author topical similarity values. There is a significant correlation between author topical similarity and co-authorship and direct citation-based measures for high similarity author pairs, but not with co-citation measures. The author topical similarity measure, therefore, may serve as a reasonable predictor of collaboration or direct citation for authors with high topical similarity. The measure may also identify potential collaborators based on high author pair similarity values, where there is a lack of existing collaboration, and serve as a complement to author relatedness based on co-citation analysis. Conference Topic Methods and techniques

[1]  Chaomei Chen,et al.  Visualizing knowledge domains , 2005, Annu. Rev. Inf. Sci. Technol..

[2]  Andreas Strotmann,et al.  Evolution of research activities and intellectual influences in information science 1996-2005: Introducing author bibliographic-coupling analysis , 2008, J. Assoc. Inf. Sci. Technol..

[3]  David M. Blei,et al.  Supervised Topic Models , 2007, NIPS.

[4]  Jie Tang,et al.  ArnetMiner: extraction and mining of academic social networks , 2008, KDD.

[5]  Dietmar Wolfram,et al.  Exploring Author Similarity Using Citing Discipline Analysis , 2016 .

[6]  Andreas Strotmann,et al.  The knowledge base and research front of information science 2006–2010: An author cocitation and bibliographic coupling analysis , 2014, J. Assoc. Inf. Sci. Technol..

[7]  Kun Lu,et al.  Measuring author research relatedness: A comparison of word-based, topic-based, and author cocitation approaches , 2012, J. Assoc. Inf. Sci. Technol..

[8]  John D. Lafferty,et al.  Correlated Topic Models , 2005, NIPS.

[9]  Ruoming Jin,et al.  A Topic Modeling Approach and Its Integration into the Random Walk Framework for Academic Search , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[10]  Erjia Yan,et al.  Predicting and recommending collaborations: An author-, institution-, and country-level analysis , 2014, J. Informetrics.

[11]  Thomas L. Griffiths,et al.  Learning author-topic models from text corpora , 2010, TOIS.

[12]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[13]  ZhaoDangzhi,et al.  The knowledge base and research front of information science 2006-2010 , 2014 .

[14]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[15]  Quentin Pleple,et al.  Interactive Topic Modeling , 2013 .

[16]  Katherine W. McCain,et al.  Visualizing a discipline: an author co-citation analysis of information science, 1972–1995 , 1998 .

[17]  Min Song,et al.  Content-based author co-citation analysis , 2014, J. Informetrics.

[18]  J. Law,et al.  Mapping acidification research: A test of the co-word method , 1992, Scientometrics.

[19]  Kevin W. Boyack,et al.  Co-citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately? , 2010 .

[20]  W. Glänzel,et al.  Analysing Scientific Networks Through Co-Authorship , 2004 .