WIRE: An Automated Report Generation System using Topical and Temporal Summarization

The demand for a tool for summarizing emerging topics is increasing in modern life since the tool can deliver well-organized information to its users. Even though there are already a number of successful search systems, the system which automatically summarizes and organizes the content of emerging topics is still in its infancy. To fulfill such demand, we introduce an automated report generation system that generates a well-summarized human-readable report for emerging topics. In this report generation system, emerging topics are automatically discovered by a topic model and news articles are indexed by the discovered topics. Then, a topical summary and a timeline summary for each topic is generated by a topical multi-document summarizer and a timeline summarizer respectively. In order to enhance the apprehensibility of the users, the proposed report system provides two report modes. One is Today's Briefing which summarizes five discovered topics of every day, and the other is Full Report which shows a long-term view of each topic with a detailed topical summary and an important event timeline.

[1]  Seong-Bae Park,et al.  Abstractive Sentence Compression with Event Attention , 2019 .

[2]  Mona Attariyan,et al.  Parameter-Efficient Transfer Learning for NLP , 2019, ICML.

[3]  Seong-Bae Park,et al.  WiseReporter: A Korean Report Generation System , 2017, IJCNLP.

[4]  Zhoujun Li,et al.  Exploiting Timelines to Enhance Multi-document Summarization , 2014, ACL.

[5]  Hal Daumé,et al.  Incorporating Lexical Priors into Topic Models , 2012, EACL.

[6]  Mirella Lapata,et al.  Data-to-Text Generation with Content Selection and Planning , 2018, AAAI.

[7]  Alexander J. Smola,et al.  Online Inference for the Infinite Topic-Cluster Model: Storylines from Streaming Text , 2011, AISTATS.

[8]  Fei Liu,et al.  Adapting the Neural Encoder-Decoder Framework from Single to Multi-Document Summarization , 2018, EMNLP.

[9]  Eric P. Xing,et al.  Timeline: A Dynamic Hierarchical Dirichlet Process Model for Recovering Birth/Death and Evolution of Topics in Text Stream , 2010, UAI.

[10]  Mirella Lapata,et al.  Text Summarization with Pretrained Encoders , 2019, EMNLP.

[11]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[12]  Barbara Poblete,et al.  On-line relevant anomaly detection in the Twitter stream: an efficient bursty keyword detection model , 2013, ODD '13.

[13]  Hyeyoung Park,et al.  Image Recommendation for Automatic Report Generation using Semantic Similarity , 2019, 2019 International Conference on Artificial Intelligence in Information and Communication (ICAIIC).