Scientific Paper Extractive Summarization Enhanced by Citation Graphs

In a citation graph, adjacent paper nodes share related scientific terms and topics. The graph thus conveys unique structure information of document-level relatedness that can be utilized in the paper summarization task, for exploring beyond the intra-document information.In this work, we focus on leveraging citation graphs to improve scientific paper extractive summarization under different settings.We first propose a Multi-granularity Unsupervised Summarization model (MUS) as a simple and low-cost solution to the task.MUS finetunes a pre-trained encoder model on the citation graph by link prediction tasks.Then, the abstract sentences are extracted from the corresponding paper considering multi-granularity information.Preliminary results demonstrate that citation graph is helpful even in a simple unsupervised framework.Motivated by this, we next propose a Graph-based Supervised Summarizationmodel (GSS) to achieve more accurate results on the task when large-scale labeled data are available.Apart from employing the link prediction as an auxiliary task, GSS introduces a gated sentence encoder and a graph information fusion module to take advantage of the graph information to polish the sentence representation.Experiments on a public benchmark dataset show that MUS and GSS bring substantial improvements over the prior state-of-the-art model.

[1]  Xuanjing Huang,et al.  Enhancing Scientific Papers Summarization with Citation Graph , 2021, AAAI.

[2]  Sylvain Lamprier,et al.  QuestEval: Summarization Asks for Fact-based Evaluation , 2021, EMNLP.

[3]  Amr Ahmed,et al.  Unsupervised Abstractive Dialogue Summarization for Tete-a-Tetes , 2020, AAAI.

[4]  Xiaojun Wan,et al.  Multi-Granularity Interaction Network for Extractive and Abstractive Multi-Document Summarization , 2020, ACL.

[5]  Suhang Wang,et al.  Self-supervised Learning on Graphs: Deep Insights and New Direction , 2020, ArXiv.

[6]  Dongyan Zhao,et al.  From Standard Summarization to New Tasks and Beyond: Summarization with Manifold Information , 2020, IJCAI.

[7]  Yue Dong,et al.  HipoRank: Incorporating Hierarchical and Positional Information into Graph-based Unsupervised Long Document Extractive Summarization , 2020, ArXiv.

[8]  Pengfei Liu,et al.  Heterogeneous Graph Neural Networks for Extractive Document Summarization , 2020, ACL.

[9]  Daniel S. Weld,et al.  SPECTER: Document-level Representation Learning using Citation-informed Transformers , 2020, ACL.

[10]  Yu Cheng,et al.  Discourse-Aware Neural Extractive Text Summarization , 2019, ACL.

[11]  Giuseppe Carenini,et al.  Extractive Summarization of Long Documents by Combining Global and Local Context , 2019, EMNLP.

[12]  Dongyan Zhao,et al.  How to Write Summaries with Patterns? Learning towards Abstractive Summarization through Prototype Editing , 2019, EMNLP.

[13]  Mirella Lapata,et al.  Text Summarization with Pretrained Encoders , 2019, EMNLP.

[14]  Rui Yan,et al.  Learning towards Abstractive Timeline Summarization , 2019, IJCAI.

[15]  Mirella Lapata,et al.  Sentence Centrality Revisited for Unsupervised Summarization , 2019, ACL.

[16]  Kilian Q. Weinberger,et al.  BERTScore: Evaluating Text Generation with BERT , 2019, ICLR.

[17]  Mirella Lapata,et al.  Text Generation from Knowledge Graphs with Graph Transformers , 2019, NAACL.

[18]  Iz Beltagy,et al.  SciBERT: A Pretrained Language Model for Scientific Text , 2019, EMNLP.

[19]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[20]  Jure Leskovec,et al.  Inductive Representation Learning on Large Graphs , 2017, NIPS.

[21]  Nazli Goharian,et al.  Scientific Article Summarization Using Citation-Context and Article’s Discourse Structure , 2015, EMNLP.

[22]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[23]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[24]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[25]  Eduard H. Hovy,et al.  Identifying Topics by Position , 1997, ANLP.

[26]  Yue Dong,et al.  Discourse-Aware Unsupervised Summarization for Long Scientific Documents , 2021, EACL.

[27]  Dongyan Zhao,et al.  Capturing Relations between Scientific Papers: An Abstractive Model for Related Work Section Generation , 2021, ACL.

[28]  Gongshen Liu,et al.  Entity-Aware Abstractive Multi-Document Summarization , 2021, FINDINGS.

[29]  C. Pal,et al.  On Extractive and Abstractive Neural Document Summarization with Transformer Language Models , 2020, EMNLP.

[30]  M. Maybury,et al.  Automatic Summarization , 2002, Computational Linguistics.

[31]  Simone Teufel,et al.  Sentence extraction as a classification task , 1997 .