论文信息 - Make Lead Bias in Your Favor: A Simple and Effective Method for News Summarization

Make Lead Bias in Your Favor: A Simple and Effective Method for News Summarization

Lead bias is a common phenomenon in news summarization, where early parts of an article often contain the most salient information. While many algorithms exploit this fact in summary generation, it has a detrimental effect on teaching the model to discriminate and extract important information. We propose that the lead bias can be leveraged in a simple and effective way in our favor to pretrain abstractive news summarization models on large-scale unlabeled corpus: predicting the leading sentences using the rest of an article. Via careful data cleaning and filtering, our transformer-based pretrained model without any finetuning achieves remarkable results over various news summarization tasks. With further finetuning, our model outperforms many competitive baseline models. Human evaluations further show the effectiveness of our method.

Xuedong Huang | R. Gmyr | Chenguang Zhu | Michael Zeng | Ziyi Yang

[1] 知秀柴田. 5分で分かる!? 有名論文ナナメ読み：Jacob Devlin et al. : BERT : Pre-training of Deep Bidirectional Transformers for Language Understanding , 2020 .

[2] D. Donnelly. Philadelphia , 2019, History of My Own Times; or, the Life and Adventures of William Otter, Sen., Comprising a Series of Events, and Musical Incidents Altogether Original.

[3] Peter J. Liu,et al. PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization , 2019, ICML.

[4] Yejin Choi,et al. BottleSum: Unsupervised and Self-supervised Sentence Summarization using the Information Bottleneck Principle , 2019, EMNLP.

[5] Jackie Chi Kit Cheung,et al. Countering the Effects of Lead Bias in News Summarization via Multi-Stage Training and Auxiliary Losses , 2019, EMNLP.

[6] Jianfeng Gao,et al. On the Variance of the Adaptive Learning Rate and Beyond , 2019, ICLR.

[7] Mirella Lapata,et al. Text Summarization with Pretrained Encoders , 2019, EMNLP.

[8] Weijia Jia,et al. Improving Abstractive Document Summarization with Salient Information Modeling , 2019, ACL.

[9] Xiaodong Liu,et al. Unified Language Model Pre-training for Natural Language Understanding and Generation , 2019, NeurIPS.

[10] Ilya Gusev,et al. Importance of Copying Mechanism for News Headline Generation , 2019, ArXiv.

[11] Ioannis Konstas,et al. SEQˆ3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sentence Compression , 2019, NAACL.

[12] Kyunghyun Cho,et al. Passage Re-ranking with BERT , 2019, ArXiv.

[13] Chenguang Zhu,et al. SDNet: Contextualized Attention-based Deep Network for Conversational Question Answering , 2018, ArXiv.

[14] Kathleen McKeown,et al. Content Selection in Deep Learning Models of Summarization , 2018, EMNLP.