Automatic Extraction of Citation Contexts for Research Paper Summarization: A Coreference-chain based Approach

This paper proposes a new method based on coreference-chains for extracting citations from research papers. To evaluate our method we created a corpus of citations comprised of citing papers for 4 cited papers. We analyze some phenomena of citations that are present in our corpus, and then evaluate our method against a cue-phrase-based technique. Our method demonstrates higher precision by 7--10%.

[1]  William C. Mann,et al.  Rhetorical Structure Theory: A Framework for the Analysis of Texts , 1987 .

[2]  Thorsten Joachims,et al.  Making large-scale support vector machine learning practical , 1999 .

[3]  Jian-Yun Nie Towards a Unified Approach to CLIR and Multilingual IR ( Position paper ) , 2002 .

[4]  Henry G. Small,et al.  Co-citation in the scientific literature: A new measure of the relationship between two documents , 1973, J. Am. Soc. Inf. Sci..

[5]  Kentaro Inui,et al.  Multiple Purpose Annotation using SLAT — Segment and Link-based Annotation Tool — , 2008 .

[6]  Melvin Weinatoek Citation Indexes , .

[7]  Claire Gardent,et al.  Improving Machine Learning Approaches to Coreference Resolution , 2002, ACL.

[8]  Stephen E. Robertson,et al.  Comparing citation contexts for information retrieval , 2008, CIKM '08.

[9]  Eugene Garfield,et al.  THE USE OF CITATION DATA IN WRITING THE HISTORY OF SCIENCE , 1964 .

[10]  Noriko Kando,et al.  Classification of research papers using citation links and citation types: Towards automatic review article generation. , 2011 .

[11]  Ying Zhang,et al.  Mining translations of OOV terms from the web through cross-lingual query expansion , 2005, SIGIR '05.

[12]  Simone Teufel,et al.  How to Find Better Index Terms Through Citations , 2006 .

[13]  Manabu Okumura,et al.  Bilingual PRESRI - Integration of Multiple Research Paper Databases , 2004, RIAO.

[14]  Hwee Tou Ng,et al.  A Machine Learning Approach to Coreference Resolution of Noun Phrases , 2001, CL.

[15]  James V. Candy,et al.  Adaptive and Learning Systems for Signal Processing, Communications, and Control , 2006 .

[16]  Weifeng Liu,et al.  Adaptive and Learning Systems for Signal Processing, Communication, and Control , 2010 .

[17]  Dain Kaplan,et al.  Sighting Citation Sites — A Collective-Intelligence Approach for Automatic Summarization of Research Papers using C-Sites — , 2008 .

[18]  Serge Sharo Creating General-Purpose Corpora Using Automated Search Engine Queries , 2006 .

[19]  John O'Connor,et al.  Citing statements: Computer recognition and use to improve retrieval , 1982, Inf. Process. Manag..

[20]  Douglas E. Appelt,et al.  The (Non)Utility of Predicate-Argument Frequencies for Pronoun Interpretation , 2004, NAACL.

[21]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[22]  Shannon Bradshaw,et al.  Reference Directed Indexing: Redeeming Relevance for Subject Search in Citation Indexes , 2003, ECDL.

[23]  Simone Teufel,et al.  Automatic classification of citation function , 2006, EMNLP.

[24]  Dragomir R. Radev,et al.  Scientific Paper Summarization Using Citation Summary Networks , 2008, COLING.

[25]  M. M. Kessler Bibliographic coupling between scientific papers , 1963 .

[26]  Daniel Marcu,et al.  The rhetorical parsing of unrestricted texts: a surface-based approach , 2000, CL.