Purpose and Polarity of Citation: Towards NLP-based Bibliometrics

Bibliometric measures are commonly used to estimate the popularity and the impact of published research. Existing bibliometric measures provide “quantitative” indicators of how good a published paper is. This does not necessarily reflect the “quality” of the work presented in the paper. For example, when hindex is computed for a researcher, all incoming citations are treated equally, ignoring the fact that some of these citations might be negative. In this paper, we propose using NLP to add a “qualitative” aspect to biblometrics. We analyze the text that accompanies citations in scientific articles (which we term citation context). We propose supervised methods for identifying citation text and analyzing it to determine the purpose (i.e. author intention) and the polarity (i.e. author sentiment) of citation.

[1]  Dragomir R. Radev,et al.  Generating Extractive Summaries of Scientific Paradigms , 2013, J. Artif. Intell. Res..

[2]  Simone Teufel,et al.  Detection of Implicit Citations for Sentiment Detection , 2012, ACL 2012.

[3]  Dragomir R. Radev,et al.  Using Citations to Generate surveys of Scientific Paradigms , 2009, NAACL.

[4]  J. E. Hirsch,et al.  An index to quantify an individual's scientific research output , 2005, Proc. Natl. Acad. Sci. USA.

[5]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[6]  Dragomir R. Radev,et al.  The ACL anthology network corpus , 2009, Language Resources and Evaluation.

[7]  E. Garfield,et al.  Can Citation Indexing Be Automated ? , 1964 .

[8]  Roser Morante,et al.  *SEM 2012 Shared Task: Resolving the Scope and Focus of Negation , 2012, *SEMEVAL.

[9]  Eugene Garfield,et al.  THE USE OF CITATION DATA IN WRITING THE HISTORY OF SCIENCE , 1964 .

[10]  Dragomir R. Radev,et al.  Scientific Paper Summarization Using Citation Summary Networks , 2008, COLING.

[11]  Michael Alan Whidby,et al.  Citation handling: Processing citation texts in scientific documents , 2012 .

[12]  Susan Bonzi,et al.  Characteristics of a Literature as Predictors of Relatedness Between Cited and Citing Works , 2007, J. Am. Soc. Inf. Sci..

[13]  G. Thompson,et al.  Evaluation in the Reporting Verbs Used in Academic Papers. , 1991 .

[14]  I. Spiegel-Rosing Science Studies: Bibliometric and Content Analysis , 1977 .

[15]  H. D. White Citation Analysis and Discourse Analysis Revisited. , 2004 .

[16]  Melvin Weinatoek Citation Indexes , .

[17]  Eugene Garfield,et al.  Citation indexing - its theory and application in science, technology, and humanities , 1979 .

[18]  J. Ziman,et al.  Public knowledge. An essay concerning the social dimension of science , 1970, Medical History.

[19]  Jacob Cohen,et al.  Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit. , 1968 .

[20]  Awais Athar,et al.  Sentiment Analysis of Citations using Sentence Structure-Based Features , 2011, ACL.

[21]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[22]  Manabu Okumura,et al.  Towards Multi-paper Summarization Using Reference Information , 1999, IJCAI.

[23]  Dragomir R. Radev,et al.  Reference Scope Identification in Citing Sentences , 2012, NAACL.

[24]  Claire Cardie,et al.  OpinionFinder: A System for Subjectivity Analysis , 2005, HLT.

[25]  Jorge E. Hirsch,et al.  An index to quantify an individual’s scientific research output that takes into account the effect of multiple coauthorship , 2009, Scientometrics.

[26]  Simone Teufel,et al.  Automatic classification of citation function , 2006, EMNLP.

[27]  M. H. MacRoberts,et al.  The Negational Reference: or the Art of Dissembling , 1984 .

[28]  Daryl E. Chubin,et al.  Content Analysis of References: Adjunct or Alternative to Citation Counting? , 1975 .

[29]  M. Moravcsik,et al.  Some Results on the Function and Quality of Citations , 1975 .

[30]  Simone Teufel,et al.  Context-Enhanced Citation Sentiment Detection , 2012, NAACL.

[31]  Dragomir R. Radev,et al.  Coherent Citation-Based Summarization of Scientific Papers , 2011, ACL.

[32]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[33]  Noriko Kando,et al.  Classification of research papers using citation links and citation types: Towards automatic review article generation. , 2011 .

[34]  Tefko Saracevic,et al.  Information science: What is it? , 1968 .

[35]  Dragomir R. Radev,et al.  Identifying Non-Explicit Citing Sentences for Citation-Based Summarization. , 2010, ACL.

[36]  Eugene Garfield,et al.  Is citation analysis a legitimate evaluation tool? , 2005, Scientometrics.

[37]  L. Egghe,et al.  Theory and practise of the g-index , 2006, Scientometrics.

[38]  Douglas Biber,et al.  Variation across speech and writing: Methodology , 1988 .

[39]  Dragomir R. Radev,et al.  Citation Summarization Through Keyphrase Extraction , 2010, COLING.

[40]  Jan Svartvik,et al.  A __ comprehensive grammar of the English language , 1988 .