论文信息 - Scientific Paper Extractive Summarization Enhanced by Citation Graphs - 字舞流文

Scientific Paper Extractive Summarization Enhanced by Citation Graphs

In a citation graph, adjacent paper nodes share related scientific terms and topics. The graph thus conveys unique structure information of document-level relatedness that can be utilized in the paper summarization task, for exploring beyond the intra-document information.In this work, we focus on leveraging citation graphs to improve scientific paper extractive summarization under different settings.We first propose a Multi-granularity Unsupervised Summarization model (MUS) as a simple and low-cost solution to the task.MUS finetunes a pre-trained encoder model on the citation graph by link prediction tasks.Then, the abstract sentences are extracted from the corresponding paper considering multi-granularity information.Preliminary results demonstrate that citation graph is helpful even in a simple unsupervised framework.Motivated by this, we next propose a Graph-based Supervised Summarizationmodel (GSS) to achieve more accurate results on the task when large-scale labeled data are available.Apart from employing the link prediction as an auxiliary task, GSS introduces a gated sentence encoder and a graph information fusion module to take advantage of the graph information to polish the sentence representation.Experiments on a public benchmark dataset show that MUS and GSS bring substantial improvements over the prior state-of-the-art model.

Xiangliang Zhang | Xin Gao | Xiuying Chen | Mingzhe Li | Shen Gao | Rui Yan

[1] Xuanjing Huang,et al. Enhancing Scientific Papers Summarization with Citation Graph , 2021, AAAI.

[2] Sylvain Lamprier,et al. QuestEval: Summarization Asks for Fact-based Evaluation , 2021, EMNLP.

[3] Amr Ahmed,et al. Unsupervised Abstractive Dialogue Summarization for Tete-a-Tetes , 2020, AAAI.

[4] Xiaojun Wan,et al. Multi-Granularity Interaction Network for Extractive and Abstractive Multi-Document Summarization , 2020, ACL.

[5] Suhang Wang,et al. Self-supervised Learning on Graphs: Deep Insights and New Direction , 2020, ArXiv.

[6] Dongyan Zhao,et al. From Standard Summarization to New Tasks and Beyond: Summarization with Manifold Information , 2020, IJCAI.

[7] Yue Dong,et al. HipoRank: Incorporating Hierarchical and Positional Information into Graph-based Unsupervised Long Document Extractive Summarization , 2020, ArXiv.

[8] Pengfei Liu,et al. Heterogeneous Graph Neural Networks for Extractive Document Summarization , 2020, ACL.

[9] Daniel S. Weld,et al. SPECTER: Document-level Representation Learning using Citation-informed Transformers , 2020, ACL.

[10] Yu Cheng,et al. Discourse-Aware Neural Extractive Text Summarization , 2019, ACL.

[11] Giuseppe Carenini,et al. Extractive Summarization of Long Documents by Combining Global and Local Context , 2019, EMNLP.

[12] Dongyan Zhao,et al. How to Write Summaries with Patterns? Learning towards Abstractive Summarization through Prototype Editing , 2019, EMNLP.

[13] Mirella Lapata,et al. Text Summarization with Pretrained Encoders , 2019, EMNLP.

[14] Rui Yan,et al. Learning towards Abstractive Timeline Summarization , 2019, IJCAI.

[15] Mirella Lapata,et al. Sentence Centrality Revisited for Unsupervised Summarization , 2019, ACL.

[16] Kilian Q. Weinberger,et al. BERTScore: Evaluating Text Generation with BERT , 2019, ICLR.

[17] Mirella Lapata,et al. Text Generation from Knowledge Graphs with Graph Transformers , 2019, NAACL.

[18] Iz Beltagy,et al. SciBERT: A Pretrained Language Model for Scientific Text , 2019, EMNLP.

[19] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[20] Jure Leskovec,et al. Inductive Representation Learning on Large Graphs , 2017, NIPS.

[21] Nazli Goharian,et al. Scientific Article Summarization Using Citation-Context and Article’s Discourse Structure , 2015, EMNLP.

[22] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[23] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[24] Rada Mihalcea,et al. TextRank: Bringing Order into Text , 2004, EMNLP.

[25] Eduard H. Hovy,et al. Identifying Topics by Position , 1997, ANLP.

[26] Yue Dong,et al. Discourse-Aware Unsupervised Summarization for Long Scientific Documents , 2021, EACL.

[27] Dongyan Zhao,et al. Capturing Relations between Scientific Papers: An Abstractive Model for Related Work Section Generation , 2021, ACL.

[28] Gongshen Liu,et al. Entity-Aware Abstractive Multi-Document Summarization , 2021, FINDINGS.

[29] C. Pal,et al. On Extractive and Abstractive Neural Document Summarization with Transformer Language Models , 2020, EMNLP.

[30] M. Maybury,et al. Automatic Summarization , 2002, Computational Linguistics.

[31] Simone Teufel,et al. Sentence extraction as a classification task , 1997 .