Can't see the forest for the trees?: a citation recommendation system

Scientists continue to find challenges in the ever increasing amount of information that has been produced on a world wide scale, during the last decades. When writing a paper, an author searches for the most relevant citations that started or were the foundation of a particular topic, which would very likely explain the thinking or algorithms that are employed. The search is usually done using specific keywords submitted to literature search engines such as Google Scholar and CiteSeer. However, finding relevant citations is distinctive from producing articles that are only topically similar to an author's proposal. In this paper, we address the problem of citation recommendation using a singular value decomposition approach. The models are trained and evaluated on the Citeseer digital library. The results of our experiments show that the proposed approach achieves significant success when compared with collaborative filtering methods on the citation recommendation task.

[1]  松田 直人 『Google Scholar』の利点 , 2009 .

[2]  Bradley N. Miller,et al.  Using filtering agents to improve prediction quality in the GroupLens research collaborative filtering system , 1998, CSCW '98.

[3]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[4]  Daniel Jurafsky,et al.  Who should I cite: learning literature search models from citation behavior , 2010, CIKM.

[5]  C. Lee Giles,et al.  CiteSeer: an automatic citation indexing system , 1998, DL '98.

[6]  W. Bruce Croft,et al.  Recommending citations for academic papers , 2007, SIGIR.

[7]  Hongfei Yan,et al.  Recommending citations with translation model , 2011, CIKM '11.

[8]  Simone Teufel,et al.  Automatic classification of citation function , 2006, EMNLP.

[9]  Daniel Kifer,et al.  Context-aware citation recommendation , 2010, WWW '10.

[10]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[11]  Jie Tang,et al.  A Discriminative Approach to Topic-Based Citation Recommendation , 2009, PAKDD.

[12]  Sean M. McNee,et al.  On the recommending of citations for research papers , 2002, CSCW '02.

[13]  G. Kane Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol 1: Foundations, vol 2: Psychological and Biological Models , 1994 .

[14]  John Riedl,et al.  Application of Dimensionality Reduction in Recommender System - A Case Study , 2000 .

[15]  Wenyi Huang,et al.  Recommending citations: translating papers into references , 2012, CIKM.

[16]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[17]  Prasenjit Mitra,et al.  Utilizing Context in Generative Bayesian Models for Linked Corpus , 2010, AAAI.

[18]  Ramesh Nallapati,et al.  Joint latent topic models for text and citations , 2008, KDD.