论文信息 - Design and analysis of microblog-based summarization system

Design and analysis of microblog-based summarization system

A daily summary or digest from microblogs allows social media users to stay up to date on what happened today on their favorite topic. Summarizing microblogs is a non-trivial task. This paper presents a summarization system built over the Twitter stream to summarize the topic for a given duration. Tweet ranking is the primary task of designing a microblog-based summarization system. After ranking tweets, the selection of relevant tweets is the crucial task for any summarization system due to the massive volume of tweets in the Twitter stream. In addition, the summarization system should include novel tweets in the summary or digest. The measure of relevance is typically the similarity score obtained from different text similarity algorithms. These measure the similarity between user information needs and each tweet. The more similar, the higher the score. So we need to choose a threshold that can minimize false-positive judgments for this task. In this paper, we proposed novel threshold estimation methods to find optimal values for these thresholds and evaluate them against thresholds determined via grid search. These methods estimate the thresholds with reasonable accuracy, according to the results. Previous research has empirically and heuristically set these thresholds, and our work suggests a method that exploits statistical features of the ranking list to estimate these thresholds. We used language models to rank the tweets and to select relevant tweets. For any language model, the selection of the smoothing technique and its parameters are critical. The results are also compared with the standard probabilistic ranking algorithm BM25. Learning to rank strategies is also implemented, which shows substantial improvement in some of the result metrics. Experiments were performed on standard benchmarks like the TREC Microblog 2015, TREC RTS 2016, and TREC RTS 2017 datasets. Different variants of normal discounted cumulative gain, the standard official evaluation metric of TREC, nDCG-1, nDCG-0, and nDCG-p are used in this study. We also performed a comprehensive failure analysis on our experiments and identified key issues for improvement that can be addressed in the future.

[1] Niloy Ganguly,et al. Extracting Situational Information from Microblogs during Disaster Events: a Classification-Summarization Approach , 2015, CIKM.

[2] Hui Fang,et al. Silent Day Detection on Microblog Data , 2018, NLDB.

[3] Dragomir R. Radev,et al. Introduction to the Special Issue on Summarization , 2002, CL.

[4] Charles L. A. Clarke,et al. Simple Dynamic Emission Strategies for Microblog Filtering , 2016, SIGIR.

[5] Stanley F. Chen,et al. An empirical study of smoothing techniques for language modeling , 1999 .

[6] Min Yang,et al. MARES: multitask learning algorithm for Web-scale real-time event summarization , 2018, World Wide Web.

[7] Charles L. A. Clarke,et al. An Exploration of Evaluation Metrics for Mobile Push Notifications , 2016, SIGIR.

[8] Michalis Vazirgiannis,et al. An Optimization Approach for Sub-event Detection and Summarization in Twitter , 2018, ECIR.

[9] W. Bruce Croft,et al. Query performance prediction in web search environments , 2007, SIGIR.

[10] Jimmy J. Lin,et al. Online In-Situ Interleaved Evaluation of Real-Time Push Notification Systems , 2017, SIGIR.

[11] Ziyu Lu,et al. Neural Network based Reinforcement Learning for Real-time Pushing on Text Stream , 2017, SIGIR.

[12] Xianzhi Wang,et al. Deep learning for misinformation detection on online social networks: a survey and new perspectives , 2020, Social Network Analysis and Mining.

[13] Hywel T. P. Williams,et al. Good and bad events: combining network-based event detection with sentiment analysis , 2020, Social Network Analysis and Mining.

[14] João Magalhães,et al. Analysis of Subtopic Discovery Algorithms for Real-time Information Summarization , 2018, WWW.

[15] Ahmad Al-Rubaie,et al. National happiness index monitoring using Twitter for bilanguages , 2021, Soc. Netw. Anal. Min..

[16] Tie-Yan Liu. Learning to Rank for Information Retrieval , 2009, Found. Trends Inf. Retr..

[17] Candice L. Lanius,et al. Use of bot and content flags to limit the spread of misinformation among social networks: a behavior and attitude survey , 2021, Social Network Analysis and Mining.

[18] Thomas Mandl,et al. The effect of named entities on effectiveness in cross-language information retrieval evaluation , 2005, SAC '05.

[19] Mohand Boughanem,et al. Optimization framework model for retrospective tweet summarization , 2018, SAC.