Global citation recommendation using knowledge graphs

Scholarly search engines, reference management tools, and academic social networks enable modern researchers to organize their scientific libraries. Moreover, they often provide recommendations for scientific publications that might be of interest to researchers. Because of the exponentially increasing volume of publications, effective citation recommendation is of great importance to researchers, as it reduces the time and effort spent on retrieving, understanding, and selecting research papers. In this context, we address the problem of citation recommendation, i.e., the task of recommending citations for a new paper. Current research investigates this task in different settings, including cases where rich user metadata is available (e.g., user profile, publications, citations). This work focus on a setting where the user provides only the abstract of a new paper as input. Our proposed approach is to expand the semantic features of the given abstract using knowledge graphs – and, combine them with other features (e.g., indegree, recency) to fit a learning to rank model. This model is used to generate the citation recommendations. By evaluating on real data, we show that the expanded semantic features lead to improving the quality of the recommendations measured by nDCG@10.

[1]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[2]  Bela Gipp,et al.  Research-paper recommender systems: a literature survey , 2015, International Journal on Digital Libraries.

[3]  Daniel Kifer,et al.  Context-aware citation recommendation , 2010, WWW '10.

[4]  W. Bruce Croft,et al.  Recommending citations for academic papers , 2007, SIGIR.

[5]  George Karypis,et al.  SLIM: Sparse Linear Methods for Top-N Recommender Systems , 2011, 2011 IEEE 11th International Conference on Data Mining.

[6]  Stuart E. Middleton,et al.  Capturing knowledge of user preferences: ontologies in recommender systems , 2001, K-CAP '01.

[7]  William W. Cohen,et al.  Information Extraction as Link Prediction: Using Curated Citation Networks to Improve Gene Detection , 2009, ICWSM.

[8]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[9]  Daniel Jurafsky,et al.  Who should I cite: learning literature search models from citation behavior , 2010, CIKM.

[10]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[11]  Shenghuo Zhu,et al.  Learning multiple graphs for document recommendations , 2008, WWW.

[12]  Hans Peter Luhn,et al.  A Statistical Approach to Mechanized Encoding and Searching of Literary Information , 1957, IBM J. Res. Dev..

[13]  Michael Mabe,et al.  The growth and number of journals , 2003 .

[14]  Yang Song,et al.  An Overview of Microsoft Academic Service (MAS) and Applications , 2015, WWW.

[15]  Doug Downey,et al.  Local and Global Algorithms for Disambiguation to Wikipedia , 2011, ACL.

[16]  Christian Bizer,et al.  DBpedia spotlight: shedding light on the web of documents , 2011, I-Semantics '11.

[17]  Wenyi Huang,et al.  RefSeer: A citation recommendation system , 2014, IEEE/ACM Joint Conference on Digital Libraries.

[18]  Cristhian Parra,et al.  Understanding and supporting search for scholarly knowledge , 2011 .

[19]  Matthias Hagen,et al.  Supporting Scholarly Search with Keyqueries , 2016, ECIR.

[20]  Qing Li,et al.  Finding Relevant Papers Based on Citation Relations , 2011, WAIM.

[21]  Jöran Beel,et al.  Introducing Docear's research paper recommender system , 2013, JCDL '13.

[22]  Peder Olesen Larsen,et al.  The rate of growth in scientific publication and the decline in coverage provided by Science Citation Index , 2010, Scientometrics.

[23]  Ruben Martinez-Cantin,et al.  BayesOpt: a Bayesian optimization library for nonlinear optimization, experimental design and bandits , 2014, J. Mach. Learn. Res..

[24]  Danielle S. McNamara,et al.  A Paper Recommendation System with ReaderBench : The Graphical Visualization of Semantically Related Papers and Concepts , 2016 .

[25]  Wang-Chien Lee,et al.  CiteSeerx: an architecture and web service design for an academic document search engine , 2006, WWW '06.

[26]  C. Lee Giles,et al.  CiteSeerX data: semanticizing scholarly papers , 2016, SBD '16.