论文信息 - A Determinantal Point Process Based Novel Sampling Method of Abstractive Text Summarization

A Determinantal Point Process Based Novel Sampling Method of Abstractive Text Summarization

In recent years abstractive text summarization (ATS) research has made considerable progress attributed to two key improvements, deep neural modeling and likelihood estimation based sampling, in the end-to-end optimization training. While modeling has grounded on a few de facto highly capable base models within encoder-decoder architecture, novel sampling ideas, such as random masking classification and generative prediction by unsupervised learning, have also been explored. They aim at improving prior knowledge, particularly of language modeling for downstream tasks. It has led to the notable performance gain of ATS. But several challenges remain, for example, undesirable word repeats. In this paper, we propose a determinantal point process (DPP) based novel sampling method to address the issue. It can be easily integrated with the existing ATS models. Our experiments and subsequent analysis have revealed that the adopted models trained by our sampling method reduce undesirable word repeats and improve word coverage while achieving competitive ROUGE scores.

Junyu Xuan | Christy Jie Liang | Jianbin Shen

[1] Yaohui Jin,et al. FusionSum: Abstractive summarization with sentence fusion and cooperative reinforcement learning , 2022, Knowl. Based Syst..

[2] Tom J kuriakose,et al. Automatic Text Summarization Using Deep Learning and Reinforcement Learning , 2021, Advances in Intelligent Systems and Computing.

[3] Mirella Lapata,et al. Multi-Document Summarization with Determinantal Point Process Attention , 2021, J. Artif. Intell. Res..

[4] Yixin Liu,et al. SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization , 2021, ACL.

[5] Armen Aghajanyan,et al. Better Fine-Tuning by Reducing Representational Collapse , 2020, ICLR.

[6] Pengfei Liu,et al. Extractive Summarization as Text Matching , 2020, ACL.

[7] Ming Zhou,et al. ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training , 2020, FINDINGS.

[8] Peter J. Liu,et al. PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization , 2019, ICML.

[9] Omer Levy,et al. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension , 2019, ACL.

[10] Hassan Foroosh,et al. Multi-Document Summarization with Determinantal Point Processes and Contextualized Representations , 2019, EMNLP.

[11] N. Vanetik,et al. In Conclusion Not Repetition: Comprehensive Abstractive Summarization with Diversified Attention Based on Determinantal Point Processes , 2019, CoNLL.