论文信息 - A Hybrid Word-Character Approach to Abstractive Summarization - 字舞流文

A Hybrid Word-Character Approach to Abstractive Summarization

Automatic abstractive text summarization is an important and challenging research topic of natural language processing. Among many widely used languages, the Chinese language has a special property that a Chinese character contains rich information comparable to a word. Existing Chinese text summarization methods, either adopt totally character-based or word-based representations, fail to fully exploit the information carried by both representations. To accurately capture the essence of articles, we propose a hybrid word-character approach (HWC) which preserves the advantages of both word-based and character-based representations. We evaluate the advantage of the proposed HWC approach by applying it to two existing methods, and discover that it generates state-of-the-art performance with a margin of 24 ROUGE points on a widely used dataset LCSTS. In addition, we find an issue contained in the LCSTS dataset and offer a script to remove overlapping pairs (a summary and a short text) to create a clean dataset for the community. The proposed HWC approach also generates the best performance on the new, clean LCSTS dataset.

Jane Yung-jen Hsu | Chih-Yuan Yang | Chi-Chia Huang | Chieh-Teng Chang | Chih-Yuan Yang | Chieh-Teng Chang | Chi-Chia Huang

[1] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[2] Hans Peter Luhn,et al. The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[3] Zhiyuan Liu,et al. Neural Headline Generation with Minimum Risk Training , 2016, ArXiv.

[4] Yann Dauphin,et al. A Convolutional Encoder Model for Neural Machine Translation , 2016, ACL.

[5] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[6] Hang Li,et al. “ Tony ” DNN Embedding for “ Tony ” Selective Read for “ Tony ” ( a ) Attention-based Encoder-Decoder ( RNNSearch ) ( c ) State Update s 4 SourceVocabulary Softmax Prob , 2016 .

[7] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[8] Qingcai Chen,et al. LCSTS: A Large Scale Chinese Short Text Summarization Dataset , 2015, EMNLP.

[9] Lukasz Kaiser,et al. Sentence Compression by Deletion with LSTMs , 2015, EMNLP.

[10] Bowen Zhou,et al. Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond , 2016, CoNLL.

[11] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.

[12] Piji Li,et al. Actor-Critic based Training Framework for Abstractive Summarization , 2018, ArXiv.

[13] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[14] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[15] Mirella Lapata,et al. Neural Summarization by Extracting Sentences and Words , 2016, ACL.

[16] Rada Mihalcea,et al. TextRank: Bringing Order into Text , 2004, EMNLP.

[17] Jason Weston,et al. A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.

[18] Mirella Lapata,et al. Sentence Compression Beyond Word Deletion , 2008, COLING.

[19] Juan-Manuel Torres-Moreno,et al. Automatic Text Summarization , 2014 .

[20] Maosong Sun,et al. Neural Headline Generation with Sentence-wise Optimization , 2016 .

[21] Daniel Marcu,et al. Statistics-Based Summarization - Step One: Sentence Compression , 2000, AAAI/IAAI.

[22] Shuming Ma,et al. Word Embedding Attention Network: Generating Words by Querying Distributed Word Representations for Paraphrase Generation , 2018, NAACL 2018.

[23] Dragomir R. Radev,et al. LexRank: Graph-based Lexical Centrality as Salience in Text Summarization , 2004, J. Artif. Intell. Res..

[24] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[25] Michele Banko,et al. Headline Generation Based on Statistical Translation , 2000, ACL.

[26] Piji Li,et al. Deep Recurrent Generative Decoder for Abstractive Text Summarization , 2017, EMNLP.

[27] Vishal Gupta,et al. Recent automatic text summarization techniques: a survey , 2016, Artificial Intelligence Review.

[28] Zhen-Hua Ling,et al. Distraction-Based Neural Networks for Document Summarization , 2016, ArXiv.

[29] Yoshua Bengio,et al. On Using Very Large Target Vocabulary for Neural Machine Translation , 2014, ACL.

[30] Alexander M. Rush,et al. OpenNMT: Open-Source Toolkit for Neural Machine Translation , 2017, ACL.