论文信息 - A Hierarchical Network for Abstractive Meeting Summarization with Cross-Domain Pretraining

A Hierarchical Network for Abstractive Meeting Summarization with Cross-Domain Pretraining

With the abundance of automatic meeting transcripts, meeting summarization is of great interest to both participants and other parties. Traditional methods of summarizing meetings depend on complex multi-step pipelines that make joint optimization intractable. Meanwhile, there are a handful of deep neural models for text summarization and dialogue systems. However, the semantic structure and styles of meeting transcripts are quite different from articles and conversations. In this paper, we propose a novel abstractive summary network that adapts to the meeting scenario. We design a hierarchical structure to accommodate long meeting transcripts and a role vector to depict the difference among speakers. Furthermore, due to the inadequacy of meeting summary data, we pretrain the model on large-scale news summary data. Empirical results show that our model outperforms previous approaches in both automatic metrics and human evaluation. For example, on ICSI dataset, the ROUGE-1 score increases from 34.66% to 46.28%.

Chenguang Zhu | Xuedong Huang | Michael Zeng | Ruochen Xu

[1] Dilek Z. Hakkani-Tür,et al. Packing the meeting summarization knapsack , 2008, INTERSPEECH.

[2] Florian Metze,et al. Integrating Intra-Speaker Topic Modeling and Temporal-Based Inter-Speaker Topic Modeling in Random Walk for Improved Multi-Party Meeting Summarization , 2012, INTERSPEECH.

[3] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[4] Andreas Stolcke,et al. The ICSI Meeting Corpus , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[5] Ming Zhou,et al. Hierarchical Recurrent Neural Network for Document Modeling , 2015, EMNLP.

[6] Phil Blunsom,et al. Teaching Machines to Read and Comprehend , 2015, NIPS.

[7] Mark Johnson,et al. An Improved Non-monotonic Transition System for Dependency Parsing , 2015, EMNLP.

[8] Daniel Jurafsky,et al. A Hierarchical Neural Autoencoder for Paragraphs and Documents , 2015, ACL.

[9] Christopher D. Manning,et al. Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[10] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[11] Jean-Pierre Lorré,et al. Unsupervised Abstractive Meeting Summarization with Multi-Sentence Compression and Budgeted Submodular Maximization , 2018, ACL.

[12] Heng Ji,et al. Keep Meeting Summaries on Topic: Abstractive Multi-Modal Meeting Summarization , 2019, ACL.

[13] Claire Cardie,et al. Domain-Independent Abstract Generation for Focused Meeting Summarization , 2013, ACL.

[14] Yao Zhao,et al. PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization , 2020, ICML.

[15] Mirella Lapata,et al. Don’t Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization , 2018, EMNLP.

[16] Giuseppe Carenini,et al. Abstractive Meeting Summarization with Entailment and Fusion , 2013, ENLG.