Citation Classification And Its Applications

AbstractCitation analysis has been used to study various aspects of scholarly communication. In general, these studies have not differentiated among the multiple reasons for citations. However, authors cite other works for a number of reasons including demonstrating knowledge of the field, establishing the placement of the citing work in the field, comparing and criticizing other works, and paying homage to seminal work by pioneers in the field. In this paper, we present a number of applications in which distinguishing among authors' motivations for citations might be useful and present a machine learning approach to automatically classifying citations according to these motivations. Our approach to citation classification makes use of the structure and the argumentative nature of the scientific papers. We present the results of experiments we ran on papers in the computer science field. The results are encouraging and give us hope that we can use our citation classifier in analyzing large corpora of scientific papers.

[1]  E. Garfield When to Cite , 1996, The Library Quarterly.

[2]  W. Shadish,et al.  Author Judgements about Works They Cite: Three Studies from Psychology Journals , 1995 .

[3]  Henry G. Small,et al.  Co-citation in the scientific literature: A new measure of the relationship between two documents , 1973, J. Am. Soc. Inf. Sci..

[4]  Chris D. Paice,et al.  Constructing literature abstracts by computer: Techniques and prospects , 1990, Inf. Process. Manag..

[5]  C. Lee Giles,et al.  Digital Libraries and Autonomous Citation Indexing , 1999, Computer.

[6]  G. Gilbert Referencing as Persuasion , 1977 .

[7]  Claire Cardie,et al.  Noun Phrase Coreference as Clustering , 1999, EMNLP.

[8]  Marc Moens,et al.  Articles Summarizing Scientific Articles: Experiments with Relevance and Rhetorical Status , 2002, CL.

[9]  Leo Pavičić,et al.  Citation Context Versus the Frequency Counts of Citation Histories , 1998, J. Am. Soc. Inf. Sci..

[10]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[11]  Jonathan Furner,et al.  Scholarly communication and bibliometrics , 2005, Annu. Rev. Inf. Sci. Technol..

[12]  Janyce Wiebe,et al.  Tracking Point of View in Narrative , 1994, Comput. Linguistics.

[13]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[14]  H. D. White Citation Analysis and Discourse Analysis Revisited. , 2004 .

[15]  Stephen S. Murray,et al.  The bibliometric properties of article readership information , 2005, J. Assoc. Inf. Sci. Technol..

[16]  A. Leopold,et al.  Games Scientists Play , 1973 .

[17]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[18]  Koby Crammer,et al.  On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines , 2002, J. Mach. Learn. Res..

[19]  B. C. Griffith,et al.  The Structure of Scientific Literatures I: Identifying and Graphing Specialties , 1974 .

[20]  M. M. Kessler Bibliographic coupling between scientific papers , 1963 .