论文信息 - End-to-End Segmentation-based News Summarization - 字舞流文

End-to-End Segmentation-based News Summarization

In this paper, we bring a new way of digesting news content by introducing the task of segmenting a news article into multiple sections and generating the corresponding summary to each section. We make two contributions towards this new task. First, we create and make available a dataset, SEGNEWS, consisting of 27k news articles with sections and aligned heading-style section summaries. Second, we propose a novel segmentation-based language generation model adapted from pre-trained language models that can jointly segment a document and produce the summary for each section. Experimental results on SEGNEWS demonstrate that our model can outperform several state-of-the-art sequence-tosequence generation models for this new task.

Michael Zeng | Chenguang Zhu | Yang Liu | Yang Liu | Michael Zeng | Chenguang Zhu

[1] Rolf Ploetzner,et al. What contributes to the split-attention effect? The role of text segmentation, picture labelling, and spatial proximity , 2010 .

[2] Dan Klein,et al. Learning-Based Single-Document Summarization with Compression and Anaphoricity Constraints , 2016, ACL.

[3] Jing Li,et al. SegBot: A Generic Neural Text Segmentation Model with Pointer Network , 2018, IJCAI.

[4] Florian Boudin,et al. TopicRank: Graph-Based Topic Ranking for Keyphrase Extraction , 2013, IJCNLP.

[5] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[6] Ilya Sutskever,et al. Language Models are Unsupervised Multitask Learners , 2019 .

[7] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[8] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[9] Marti A. Hearst. Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.

[10] Jason Weston,et al. A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.

[11] Mirella Lapata,et al. Don’t Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization , 2018, EMNLP.

[12] 悠太菊池,et al. 大規模要約資源としてのNew York Times Annotated Corpus , 2015 .

[13] Christopher D. Manning,et al. Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[14] Mirella Lapata,et al. Single Document Summarization as Tree Induction , 2019, NAACL.

[15] Johanna D. Moore,et al. Automatic Segmentation of Multiparty Dialogue , 2006, EACL.

[16] M. Maybury,et al. Automatic Summarization , 2002, Computational Linguistics.

[17] Bowen Zhou,et al. Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond , 2016, CoNLL.

[18] Omer Levy,et al. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension , 2019, ACL.

[19] Bowen Zhou,et al. SummaRuNNer: A Recurrent Neural Network Based Sequence Model for Extractive Summarization of Documents , 2016, AAAI.

[20] Freddy Y. Y. Choi. Advances in domain independent linear text segmentation , 2000, ANLP.

[21] Danushka Bollegala,et al. A Sequential Model for Discourse Segmentation , 2010, CICLing.

[22] Mirella Lapata,et al. Text Summarization with Pretrained Encoders , 2019, EMNLP.

[23] Yao Zhao,et al. PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization , 2020, ICML.

[24] 知秀柴田. 5分で分かる!? 有名論文ナナメ読み：Jacob Devlin et al. : BERT : Pre-training of Deep Bidirectional Transformers for Language Understanding , 2020 .

[25] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.

[26] S. Reeves,et al. Discourse Analysis , 2018, The Study of Language.

[27] Xu Tan,et al. MASS: Masked Sequence to Sequence Pre-training for Language Generation , 2019, ICML.

[28] Chris Biemann,et al. TopicTiling: A Text Segmentation Algorithm based on LDA , 2012, ACL 2012.

[29] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[30] Chris D. Paice,et al. Constructing literature abstracts by computer: Techniques and prospects , 1990, Inf. Process. Manag..

[31] Marti A. Hearst. Multi-Paragraph Segmentation Expository Text , 1994, ACL.

[32] Mirella Lapata,et al. Neural Summarization by Extracting Sentences and Words , 2016, ACL.

[33] Jianfeng Gao,et al. UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training , 2020, ICML.

[34] John D. Lafferty,et al. Statistical Models for Text Segmentation , 1999, Machine Learning.

[35] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[36] Marti A. Hearst,et al. A Critique and Improvement of an Evaluation Metric for Text Segmentation , 2002, CL.

[37] Xueqi Cheng,et al. Outline Generation: Understanding the Inherent Content Structure of Documents , 2019, SIGIR.

[38] Eric Fosler-Lussier,et al. Discourse Segmentation of Multi-Party Conversation , 2003, ACL.

[39] Dragomir R. Radev,et al. Introduction to the Special Issue on Summarization , 2002, CL.