论文信息 - Discourse-Aware Neural Extractive Text Summarization - 字舞流文

Discourse-Aware Neural Extractive Text Summarization

Recently BERT has been adopted for document encoding in state-of-the-art text summarization models. However, sentence-based extractive models often result in redundant or uninformative phrases in the extracted summaries. Also, long-range dependencies throughout a document are not well captured by BERT, which is pre-trained on sentence pairs instead of documents. To address these issues, we present a discourse-aware neural summarization model - DiscoBert. DiscoBert extracts sub-sentential discourse units (instead of sentences) as candidates for extractive selection on a finer granularity. To capture the long-range dependencies among discourse units, structural discourse graphs are constructed based on RST trees and coreference mentions, encoded with Graph Convolutional Networks. Experiments show that the proposed model outperforms state-of-the-art methods by a significant margin on popular summarization benchmarks compared to other BERT-base models.

Yu Cheng | Zhe Gan | Jingjing Liu | Jiacheng Xu

[1] Ming Zhou,et al. HIBERT: Document Level Pre-training of Hierarchical Bidirectional Transformers for Document Summarization , 2019, ACL.

[2] Ian S. Dunn,et al. Exploring the Limits , 2009 .

[3] Bowen Zhou,et al. SummaRuNNer: A Recurrent Neural Network Based Sequence Model for Extractive Summarization of Documents , 2016, AAAI.

[4] Colin Raffel,et al. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..

[5] Daniel Marcu,et al. Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory , 2001, SIGDIAL Workshop.

[6] Jiacheng Xu,et al. Neural Extractive Text Summarization with Syntactic Compression , 2019, EMNLP.

[7] Jackie Chi Kit Cheung,et al. BanditSum: Extractive Summarization as a Contextual Bandit , 2018, EMNLP.

[8] Masaaki Nagata,et al. Dependency-based Discourse Parser for Single-Document Summarization , 2014, EMNLP.

[9] Mark T. Maybury,et al. Automatic Summarization , 2002, Computational Linguistics.

[10] Alexander M. Rush,et al. Bottom-Up Abstractive Summarization , 2018, EMNLP.

[11] Philip Schlesinger,et al. Exploring the Limits: Europe’s Changing Communication Environment , 1997 .

[12] William C. Mann,et al. Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[13] Junyi Jessy Li,et al. The Role of Discourse Units in Near-Extractive Summarization , 2016, SIGDIAL Conference.

[14] Marc Brockschmidt,et al. Structured Neural Summarization , 2018, ICLR.

[15] Jackie Chi Kit Cheung,et al. EditNTS: An Neural Programmer-Interpreter Model for Sentence Simplification through Explicit Editing , 2019, ACL.

[16] Jason Weston,et al. A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.

[17] Mirella Lapata,et al. Ranking Sentences for Extractive Summarization with Reinforcement Learning , 2018, NAACL.

[18] Kathleen McKeown,et al. Content Selection in Deep Learning Models of Summarization , 2018, EMNLP.

[19] Xiaojun Wan,et al. Recent advances in document summarization , 2017, Knowledge and Information Systems.

[20] Yen-Chun Chen,et al. Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting , 2018, ACL.

[21] Edward Gibson,et al. Representing Discourse Coherence: A Corpus-Based Study , 2005, CL.

[22] Mirella Lapata,et al. Text Summarization with Pretrained Encoders , 2019, EMNLP.

[23] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[24] Rashmi Prasad,et al. The Penn Discourse Treebank , 2004, LREC.

[25] G. Meade. Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory , 2001 .

[26] Masaaki Nagata,et al. Single-Document Summarization as a Tree Knapsack Problem , 2013, EMNLP.

[27] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[28] Phil Blunsom,et al. Teaching Machines to Read and Comprehend , 2015, NIPS.

[29] Xuanjing Huang,et al. Searching for Effective Neural Extractive Summarization: What Works and What’s Next , 2019, ACL.

[30] Jacob Eisenstein,et al. Representation Learning for Text-level Discourse Parsing , 2014, ACL.

[31] Mirella Lapata,et al. Neural Summarization by Extracting Sentences and Words , 2016, ACL.

[32] Mirella Lapata,et al. Neural Latent Extractive Document Summarization , 2018, EMNLP.

[33] Yejin Choi,et al. Deep Communicating Agents for Abstractive Summarization , 2018, NAACL.

[34] Luke S. Zettlemoyer,et al. AllenNLP: A Deep Semantic Natural Language Processing Platform , 2018, ArXiv.

[35] Rui Zhang,et al. Graph-based Neural Multi-Document Summarization , 2017, CoNLL.

[36] Zhe Hu,et al. An Entity-Driven Framework for Abstractive Summarization , 2019, EMNLP.

[37] Furu Wei,et al. Faithful to the Original: Fact Aware Neural Abstractive Summarization , 2017, AAAI.

[38] Ani Nenkova,et al. Discourse indicators for content selection in summarization , 2010, SIGDIAL Conference.

[39] Yizhong Wang,et al. Toward Fast and Accurate Neural Discourse Segmentation , 2018, EMNLP.

[40] Dragomir R. Radev,et al. LexRank: Graph-based Lexical Centrality as Salience in Text Summarization , 2004, J. Artif. Intell. Res..

[41] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[42] Tiejun Zhao,et al. Neural Document Summarization by Jointly Learning to Score and Select Sentences , 2018, ACL.

[43] Mihai Surdeanu,et al. The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[44] 悠太菊池,et al. 大規模要約資源としてのNew York Times Annotated Corpus , 2015 .

[45] Luke S. Zettlemoyer,et al. End-to-end Neural Coreference Resolution , 2017, EMNLP.

[46] Xiaojun Wan,et al. Abstractive Document Summarization with a Graph-Based Attentional Neural Model , 2017, ACL.

[47] Christopher D. Manning,et al. Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[48] Dan Klein,et al. Learning-Based Single-Document Summarization with Compression and Anaphoricity Constraints , 2016, ACL.

[49] Xuanjing Huang,et al. Cached Long Short-Term Memory Neural Networks for Document-Level Sentiment Classification , 2016, EMNLP.