An Analysis of Citation Recommender Systems: Beyond the Obvious

As science advances, the academic community has published millions of research papers. Researchers devote time and effort to search relevant manuscripts when writing a paper or simply to keep up with current research. In this paper, we consider the problem of citation recommendation by extending a set of known-to-be-relevant references. Our analysis shows the degrees of cited papers in the subgraph induced by the citations of a paper, called projection graph, follow a power law distribution. Existing popular methods are only good at finding the long tail papers, the ones that are highly connected to others. In other words, the majority of cited papers are loosely connected in the projection graph but they are not going to be found by existing methods. To address this problem, we propose to combine author, venue and keyword information to interpret the citation behavior behind those loosely connected papers. Results show that different methods are finding cited papers with widely different properties. We suggest multiple recommended lists by different algorithms could satisfy various users for a real citation recommendation system.

[1]  Jian Pei,et al.  Citation recommendation without author supervision , 2011, WSDM '11.

[2]  Sean M. McNee,et al.  On the recommending of citations for research papers , 2002, CSCW '02.

[3]  John Riedl,et al.  Automatically building research reading lists , 2010, RecSys '10.

[4]  Wenyi Huang,et al.  RefSeer: A citation recommendation system , 2014, IEEE/ACM Joint Conference on Digital Libraries.

[5]  Ümit V. Çatalyürek,et al.  Fast Recommendation on Bibliographic Networks , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[6]  Yang Song,et al.  An Overview of Microsoft Academic Service (MAS) and Applications , 2015, WWW.

[7]  Sean M. McNee,et al.  Enhancing digital libraries with TechLens+ , 2004, JCDL.

[8]  Ümit V. Çatalyürek,et al.  TheAdvisor: a webservice for academic recommendation , 2013, JCDL '13.

[9]  Min-Yen Kan,et al.  Exploiting potential citation papers in scholarly paper recommendation , 2013, JCDL '13.

[10]  Nicholas A. Cumpsty Some Lessons Learned , 2010 .

[11]  Yizhou Sun,et al.  Meta-Path-Based Ranking with Pseudo Relevance Feedback on Heterogeneous Graph for Citation Recommendation , 2014, CIKM.

[12]  Jiawei Han,et al.  Citation Prediction in Heterogeneous Bibliographic Networks , 2012, SDM.

[13]  Kevin W. Boyack,et al.  Co-citation analysis, bibliographic coupling, and direct citation: Which citation approach represents the research front most accurately? , 2010, J. Assoc. Inf. Sci. Technol..

[14]  Cornelia Caragea,et al.  Can't see the forest for the trees?: a citation recommendation system , 2013, JCDL '13.

[15]  Ahmed A. Rafea,et al.  KP-Miner: Participation in SemEval-2 , 2010, *SEMEVAL.

[16]  W. Bruce Croft,et al.  Recommending citations for academic papers , 2007, SIGIR.

[17]  Ümit V. Çatalyürek,et al.  Towards a personalized, scalable, and exploratory academic recommendation service , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[18]  Wenyi Huang,et al.  Recommending citations: translating papers into references , 2012, CIKM.

[19]  Bela Gipp,et al.  Research-paper recommender systems: a literature survey , 2015, International Journal on Digital Libraries.

[20]  Carlos Guestrin,et al.  Beyond keyword search: discovering relevant scientific literature , 2011, KDD.

[21]  Ümit V. Çatalyürek,et al.  Direction Awareness in Citation Recommendation , 2012 .

[22]  Hongfei Yan,et al.  Recommending citations with translation model , 2011, CIKM '11.

[23]  Michael Ley,et al.  DBLP - Some Lessons Learned , 2009, Proc. VLDB Endow..

[24]  Ümit V. Çatalyürek,et al.  Diversified recommendation on graphs: pitfalls, measures, and algorithms , 2013, WWW.

[25]  Yizhou Sun,et al.  Full-text based context-rich heterogeneous network mining approach for citation recommendation , 2014, IEEE/ACM Joint Conference on Digital Libraries.

[26]  Chong Wang,et al.  Collaborative topic modeling for recommending scientific articles , 2011, KDD.

[27]  Min-Yen Kan,et al.  Scholarly paper recommendation via user's recent research interests , 2010, JCDL '10.

[28]  Jiawei Han,et al.  ClusCite: effective citation recommendation by information network-based clustering , 2014, KDD.

[29]  Daniel Kifer,et al.  Context-aware citation recommendation , 2010, WWW '10.

[30]  Theodoros Lappas,et al.  SOFIA SEARCH: a tool for automating related-work search , 2012, SIGMOD Conference.

[31]  Marco Gori,et al.  Recommender Systems : A Random-Walk Based Approach , 2006 .

[32]  Jure Leskovec,et al.  Citing for high impact , 2010, JCDL '10.

[33]  Sean M. McNee,et al.  Enhancing digital libraries with TechLens , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..