Combining Co-citation and Metadata for Recommending More Related Papers

Co-citation is one of the preeminent approach in finding related research articles. The originally proposed technique was solely relying on the bibliographic information but the state of the art in this approach primarily rely on the full-text analysis of articles. The limited availability of full-text limits the applicability of the proposed approaches. Thus, this research used the metadata and bibliographic information of research articles which are openly available. This research explored the hypothesis that traditional co-citation might outperform when combined with metadata relatedness. Similarity scores of different metadata fields (such as title, author and keyword) were calculated and combined with the traditional co-citation relevancy score. The proposed approach has been resiliently tested on the benchmark dataset of 1240 articles of diverse fields of science. The experimental results show that an improvement of 25 percent with co-citation was combined with metadata relevancy score. Further, an interactive visualization has been created to interactively display the resulted documents of co-citation plus metadata analysis.

[1]  Sunju Park,et al.  A link-based similarity measure for scientific literature , 2010, WWW '10.

[2]  Alison Callahan,et al.  Contextual cocitation: Augmenting cocitation analysis and its applications , 2010, J. Assoc. Inf. Sci. Technol..

[3]  Edward A. Fox,et al.  Automatic document metadata extraction using support vector machines , 2003, 2003 Joint Conference on Digital Libraries, 2003. Proceedings..

[4]  Chaomei Chen,et al.  The proximity of co-citation , 2011, Scientometrics.

[5]  Kevin W. Boyack,et al.  Improving the accuracy of co-citation clustering using full text , 2013, J. Assoc. Inf. Sci. Technol..

[6]  Bela Gipp,et al.  Research-paper recommender systems: a literature survey , 2015, International Journal on Digital Libraries.

[7]  J Diederich,et al.  Automatically Created Concept Graphs Using Descriptive Keywords in the Medical Domain , 2008, Methods of Information in Medicine.

[8]  M. M. Kessler Bibliographic coupling between scientific papers , 1963 .

[9]  Gonzalo Navarro,et al.  A guided tour to approximate string matching , 2001, CSUR.

[10]  Henry G. Small,et al.  Co-citation in the scientific literature: A new measure of the relationship between two documents , 1973, J. Am. Soc. Inf. Sci..

[11]  Kevin W. Boyack,et al.  Co-citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately? , 2010 .

[12]  Jöran Beel,et al.  Citation Proximity Analysis (CPA) : A New Approach for Identifying Related Work Based on Co-Citation Analysis , 2009 .