Unsupervised query-focused multi-document summarization based on transfer learning from sentence embedding models, BM25 model, and maximal marginal relevance criterion

Extractive query-focused multi-document summarization (QF-MDS) is the process of automatically generating an informative summary from a collection of documents that answers a pre-given query. Sentence and query representation is a fundamental cornerstone that affects the effectiveness of several QF-MDS methods. Transfer learning using pre-trained word embedding models has shown promising performance in many applications. However, most of these representations do not consider the order and the semantic relationships between words in a sentence, and thus they do not carry the meaning of a full sentence. In this paper, to deal with this issue, we propose to leverage transfer learning from pre-trained sentence embedding models to represent documents’ sentences and users’ queries using embedding vectors that capture the semantic and the syntactic relationships between their constituents (words, phrases). Furthermore, BM25 and semantic similarity function are linearly combined to retrieve a subset of sentences based on their relevance to the query. Finally, the maximal marginal relevance criterion is applied to re-rank the selected sentences by maintaining query relevance and minimizing redundancy. The proposed method is unsupervised, simple, efficient, and requires no labeled text summarization training data. Experiments are conducted using three standard datasets from the DUC evaluation campaign (DUC’2005–2007). The overall obtained results show that our method outperforms several state-of-the-art systems and achieves comparable results to the best performing systems, including supervised deep learning-based methods.

[1]  M. de Rijke,et al.  Sentence Relations for Extractive Summarization with Deep Neural Networks , 2018, ACM Trans. Inf. Syst..

[2]  David Konopnicki,et al.  Unsupervised Query-Focused Multi-Document Summarization using the Cross Entropy Method , 2017, SIGIR.

[3]  Jing Long,et al.  Query-oriented unsupervised multi-document summarization via deep learning model , 2015, Expert Syst. Appl..

[4]  Mirella Lapata,et al.  Coarse-to-Fine Query Focused Multi-Document Summarization , 2020, EMNLP.

[5]  Sebastian Ruder,et al.  Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[6]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[7]  Bernard Espinasse,et al.  An unsupervised method for extractive multi-document summarization based on centroid approach and sentence embeddings , 2021, Expert Syst. Appl..

[8]  Sanja Fidler,et al.  Skip-Thought Vectors , 2015, NIPS.

[9]  Hadrien Van Lierde,et al.  Learning with fuzzy hypergraphs: A topical approach to query-oriented text summarization , 2019, Inf. Sci..

[10]  Chris H. Q. Ding,et al.  Integrating Clustering and Multi-Document Summarization by Bi-Mixture Probabilistic Latent Semantic Analysis (PLSA) with Sentence Bases , 2011, AAAI.

[11]  Ani Nenkova,et al.  A Survey of Text Summarization Techniques , 2012, Mining Text Data.

[12]  Christopher Potts,et al.  A large annotated corpus for learning natural language inference , 2015, EMNLP.

[13]  Aditya Jain,et al.  Extractive Text Summarization Using Word Vector Embedding , 2017, 2017 International Conference on Machine Learning and Data Science (MLDS).

[14]  M. de Rijke,et al.  Leveraging Contextual Sentence Relations for Extractive Summarization Using a Neural Attention Model , 2017, SIGIR.

[15]  Omer Levy,et al.  BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension , 2019, ACL.

[16]  Dilek Z. Hakkani-Tür,et al.  A Hybrid Hierarchical Model for Multi-Document Summarization , 2010, ACL.

[17]  Dragomir R. Radev,et al.  Multi-News: A Large-Scale Multi-Document Summarization Dataset and Abstractive Hierarchical Model , 2019, ACL.

[18]  Furu Wei,et al.  AttSum: Joint Learning of Focusing and Summarization with Neural Attention , 2016, COLING.

[19]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[20]  Giovanni Semeraro,et al.  Centroid-based Text Summarization through Compositionality of Word Embeddings , 2017, MultiLing@EACL.

[21]  Lucy Vanderwende,et al.  Exploring Content Models for Multi-Document Summarization , 2009, NAACL.

[22]  David Konopnicki,et al.  Unsupervised Dual-Cascade Learning with Pseudo-Feedback Distillation for Query-based Extractive Summarization , 2018, ArXiv.

[23]  Fei Liu,et al.  Adapting the Neural Encoder-Decoder Framework from Single to Multi-Document Summarization , 2018, EMNLP.

[24]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[25]  Mirella Lapata,et al.  Text Summarization with Pretrained Encoders , 2019, EMNLP.

[26]  Mahmood Yousefi-Azar,et al.  Text summarization using unsupervised deep learning , 2017, Expert Syst. Appl..

[27]  Ani Nenkova,et al.  Automatic Summarization , 2011, ACL.

[28]  Hal Daumé,et al.  Deep Unordered Composition Rivals Syntactic Methods for Text Classification , 2015, ACL.

[29]  Eduardo Fidalgo,et al.  SummCoder: An unsupervised framework for extractive text summarization based on deep auto-encoders , 2019, Expert Syst. Appl..

[30]  Yue Xu,et al.  Dual pattern-enhanced representations model for query-focused multi-document summarisation , 2019, Knowl. Based Syst..

[31]  Tommy W. S. Chow,et al.  Query-oriented text summarization based on hypergraph transversals , 2019, Inf. Process. Manag..

[32]  Jian Zhang,et al.  SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.

[33]  Qin Lu,et al.  Applying regression models to query-focused multi-document summarization , 2011, Inf. Process. Manag..

[34]  Xiang Ren,et al.  Multi-document Summarization with Maximal Marginal Relevance-guided Reinforcement Learning , 2020, EMNLP.

[35]  Hayato Kobayashi,et al.  Summarization Based on Embedding Distributions , 2015, EMNLP.

[36]  Pavel Brazdil,et al.  Exploring actor–object relationships for query-focused multi-document summarization , 2015, Soft Comput..

[37]  Thomas Wolf,et al.  Transfer Learning in Natural Language Processing , 2019, NAACL.

[38]  Jade Goldstein-Stewart,et al.  The use of MMR, diversity-based reranking for reordering documents and producing summaries , 1998, SIGIR '98.

[39]  Holger Schwenk,et al.  Supervised Learning of Universal Sentence Representations from Natural Language Inference Data , 2017, EMNLP.

[40]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[41]  Zhi-Hong Deng,et al.  An Unsupervised Multi-Document Summarization Framework Based on Neural Document Model , 2016, COLING.

[42]  Kevin Gimpel,et al.  Pushing the Limits of Paraphrastic Sentence Embeddings with Millions of Machine Translations , 2017, ArXiv.

[43]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents , 2004, Inf. Process. Manag..

[44]  Daniel Marcu,et al.  Bayesian Query-Focused Summarization , 2006, ACL.

[45]  Xiaojun Wan,et al.  CTSUM: extracting more certain summaries for news articles , 2014, SIGIR.

[46]  Igor Kononenko,et al.  Weighted archetypal analysis of the multi-element graph for query-focused multi-document summarization , 2014, Expert Syst. Appl..

[47]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[48]  Donghong Ji,et al.  Query-focused multi-document summarization using hypergraph-based ranking , 2016, Inf. Process. Manag..

[49]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[50]  Xiaojun Wan,et al.  Compressive Document Summarization via Sparse Optimization , 2015, IJCAI.

[51]  J. Sheela,et al.  RETRACTED ARTICLE: An abstractive summary generation system for customer reviews and news article using deep learning , 2020, Journal of Ambient Intelligence and Humanized Computing.

[52]  Xiaojun Wan,et al.  Topic analysis for topic-focused multi-document summarization , 2009, CIKM.

[53]  Pengfei Liu,et al.  Extractive Summarization as Text Matching , 2020, ACL.

[54]  Jade Goldstein-Stewart,et al.  The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries , 1998, SIGIR Forum.

[55]  Kawin Ethayarajh,et al.  Unsupervised Random Walk Sentence Embeddings: A Strong but Simple Baseline , 2018, Rep4NLP@ACL.

[56]  Tetsuya Sakai,et al.  A Comparative Study of Deep Learning Approaches for Query-Focused Extractive Multi-Document Summarization , 2019, 2019 IEEE 2nd International Conference on Information and Computer Technologies (ICICT).