Citation Block Determination Using Textual Coherence

This work has been funded in part by the Microsoft 2008 WEBSCALE grant, as well as by the Intelligence Advanced Research Projects Activity (IARPA) via Department of Interior National Business Center (DoI/NBC) contract number D11PC20153. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright annotation thereon. Disclaimer: The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of IARPA, DoI/NBC, or the U.S. Government.

[1]  J. Bateman,et al.  Coherence relations: Towards a general specification , 1997 .

[2]  J. Laurie Snell,et al.  Markov Random Fields and Their Applications , 1980 .

[3]  Dragomir R. Radev,et al.  Identifying Non-Explicit Citing Sentences for Citation-Based Summarization. , 2010, ACL.

[4]  Patrick Pantel,et al.  Inducing Ontological Co-occurrence Vectors , 2005, ACL.

[5]  P. Shrestha,et al.  Corpus-Based methods for Short Text Similarity , 2011, JEPTALNRECITAL.

[6]  Soo-Min Kim,et al.  Automatically Assessing Review Helpfulness , 2006, EMNLP.

[7]  Mirella Lapata,et al.  Modeling Local Coherence: An Entity-Based Approach , 2005, ACL.

[8]  Weifeng Liu,et al.  Adaptive and Learning Systems for Signal Processing, Communication, and Control , 2010 .

[9]  Marti A. Hearst TextTiling: A Quantitative Approach to Discourse , 1993 .

[10]  Zoubin Ghahramani,et al.  The infinite HMM for unsupervised PoS tagging , 2009, EMNLP.

[11]  M. M. Kessler Bibliographic coupling between scientific papers , 1963 .

[12]  Henry G. Small,et al.  Co-citation in the scientific literature: A new measure of the relationship between two documents , 1973, J. Am. Soc. Inf. Sci..

[13]  Andrew McCallum,et al.  FACTORIE: Probabilistic Programming via Imperatively Defined Factor Graphs , 2009, NIPS.

[14]  Daniel Marcu,et al.  The rhetorical parsing of unrestricted texts: a surface-based approach , 2000, CL.

[15]  Jochen Hollmann An Evaluation of Documen tP refetching in a Distributed Digital Library , 2003 .

[16]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[17]  Scott Weinstein,et al.  Centering: A Framework for Modeling the Local Coherence of Discourse , 1995, CL.

[18]  Yannick Versley,et al.  BART: A Modular Toolkit for Coreference Resolution , 2008, ACL.

[19]  John O'Connor,et al.  Citing statements: Computer recognition and use to improve retrieval , 1982, Inf. Process. Manag..

[20]  Simone Teufel,et al.  Resolving Coreferent and Associative Noun Phrases in Scientific Text , 2014, EACL.

[21]  Simone Teufel,et al.  Detection of Implicit Citations for Sentiment Detection , 2012, ACL 2012.

[22]  Chris Mellish,et al.  Beyond Elaboration: The Interaction of Relations and Focus in Coherent Text , 2000 .

[23]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[24]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[25]  Melvin Weinatoek Citation Indexes , .

[26]  Dan Roth,et al.  Understanding the Value of Features for Coreference Resolution , 2008, EMNLP.

[27]  Renata Vieira,et al.  A Corpus-based Investigation of Definite Description Use , 1997, CL.

[28]  Dain Kaplan,et al.  Sighting Citation Sites — A Collective-Intelligence Approach for Automatic Summarization of Research Papers using C-Sites — , 2008 .

[29]  Marti A. Hearst,et al.  Citances: Citation Sentences for Semantic Analysis of Bioscience Text , 2004 .

[30]  J. Ziman Information, Communication, Knowledge , 1969, Nature.

[31]  Dragomir R. Radev,et al.  Scientific Paper Summarization Using Citation Summary Networks , 2008, COLING.

[32]  Mirella Lapata,et al.  Modeling Local Coherence: An Entity-Based Approach , 2005, ACL.

[33]  Michael Halliday,et al.  Cohesion in English , 1976 .

[34]  C. Lee Giles,et al.  CiteSeer: an automatic citation indexing system , 1998, DL '98.

[35]  Richárd Farkas,et al.  Data-driven Multilingual Coreference Resolution using Resolver Stacking , 2012, EMNLP-CoNLL Shared Task.

[36]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[37]  Matthew Hurst,et al.  A Language Model Approach to Keyphrase Extraction , 2003, ACL 2003.

[38]  J. E. Hirsch,et al.  An index to quantify an individual's scientific research output , 2005, Proc. Natl. Acad. Sci. USA.

[39]  Simone Teufel,et al.  How to Find Better Index Terms Through Citations , 2006 .

[40]  Manabu Okumura,et al.  Towards Multi-paper Summarization Using Reference Information , 1999, IJCAI.

[41]  Livio Robaldo,et al.  The Penn Discourse TreeBank 2.0. , 2008, LREC.

[42]  Hwee Tou Ng,et al.  A PDTB-styled end-to-end discourse parser , 2012, Natural Language Engineering.

[43]  Mirella Lapata,et al.  Automatic Evaluation of Text Coherence: Models and Representations , 2005, IJCAI.

[44]  Carlo Strapparava,et al.  Corpus-based and Knowledge-based Measures of Text Semantic Similarity , 2006, AAAI.

[45]  Jean Carletta,et al.  An annotation scheme for discourse-level argumentation in research articles , 1999, EACL.

[46]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[47]  H. D. White Citation Analysis and Discourse Analysis Revisited. , 2004 .

[48]  Noah A. Smith,et al.  Rich Source-Side Context for Statistical Machine Translation , 2008, WMT@ACL.

[49]  Jerry R. Hobbs Coherence and Coreference , 1979, Cogn. Sci..

[50]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[51]  John O'Connor Biomedical citing statements: Computer recognition and use to aid full-text retrieval , 1983, Inf. Process. Manag..

[52]  Xiaorong Huang,et al.  Planning Argumentative Texts , 1994, COLING.

[53]  Dragomir R. Radev,et al.  Introduction to the Special Issue on Summarization , 2002, CL.

[54]  William C. Mann,et al.  Rhetorical Structure Theory: A Framework for the Analysis of Texts , 1987 .

[55]  Dain Kaplan,et al.  Automatic Extraction of Citation Contexts for Research Paper Summarization: A Coreference-chain based Approach , 2009 .

[56]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[57]  Dragomir R. Radev,et al.  Blind men and elephants: What do citation summaries tell us about a research article? , 2008 .

[58]  Eugene Garfield,et al.  THE USE OF CITATION DATA IN WRITING THE HISTORY OF SCIENCE , 1964 .

[59]  Andy Lauriston,et al.  Criteria for Measuring Term Recognition , 1995, EACL.

[60]  Simone Teufel,et al.  Towards Domain-Independent Argumentative Zoning: Evidence from Chemistry and Computational Linguistics , 2009, EMNLP.

[61]  James V. Candy,et al.  Adaptive and Learning Systems for Signal Processing, Communications, and Control , 2006 .

[62]  Awais Athar,et al.  Sentiment Analysis of Citations using Sentence Structure-Based Features , 2011, ACL.

[63]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[64]  Graeme Hirst,et al.  Computing Word-Pair Antonymy , 2008, EMNLP.

[65]  Shannon Bradshaw,et al.  Reference Directed Indexing: Redeeming Relevance for Subject Search in Citation Indexes , 2003, ECDL.

[66]  Paul Deane,et al.  A Nonparametric Method for Extraction of Candidate Phrasal Terms , 2005, ACL.

[67]  Simone Teufel,et al.  Automatic classification of citation function , 2006, EMNLP.

[68]  Dekang Lin,et al.  Creating Robust Supervised Classifiers via Web-Scale N-Gram Data , 2010, ACL.

[69]  E GARFIELD,et al.  Citation indexes for science; a new dimension in documentation through association of ideas. , 2006, Science.

[70]  Ying Zhang,et al.  Automatic Acquisition of Chinese-English Parallel Corpus from the Web , 2006, ECIR.

[71]  Noriko Kando,et al.  Classification of research papers using citation links and citation types: Towards automatic review article generation. , 2011 .