A New Approach for Twitter Event Summarization Based on Sentence Identification and Partial Textual Entailment

Recent trend of information propagation on any real-time event in Twitter makes this platform more and more popular than any other online communication media. This trend creates a necessity to understand real-time events quickly and precisely by summarizing all the relevant tweets. In this paper, we propose a two-phase summarization approach to produce abstract summary of any Twitter event. The approach first extracts key sentences from the whole set of event relevant tweets and eliminates maximum redundant information by exploring Partial Textual Entailment (PTE) relation between sentences. Next, generates an abstract summary over the least redundant key sentences. We conduct experiments to evaluate the performance of our propose approach and report that the approach outperforms over the baseline approach as well as state-of-the-art event summarization approach.

[1]  Bernard Dousset,et al.  Multi-criterion Real Time Tweet Summarization Based upon Adaptive Threshold , 2016, 2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI).

[2]  Kavita Ganesan,et al.  ROUGE 2.0: Updated and Improved Measures for Evaluation of Summarization Tasks , 2015, ArXiv.

[3]  Tao Li,et al.  Event summarization for sports games using twitter streams , 2017, World Wide Web.

[4]  Srinivasan Parthasarathy,et al.  A framework for summarizing and analyzing twitter feeds , 2012, KDD.

[5]  Nikos Pelekis,et al.  DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis , 2017, *SEMEVAL.

[6]  Oren Etzioni,et al.  Named Entity Recognition in Tweets: An Experimental Study , 2011, EMNLP.

[7]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[8]  Jugal K. Kalita,et al.  Better Twitter Summaries? , 2013, HLT-NAACL.

[9]  Michael S. Bernstein,et al.  Twitinfo: aggregating and visualizing microblogs for event exploration , 2011, CHI.

[10]  Amitava Das,et al.  A Survey on Automatic Twitter Event Summarization , 2018, J. Inf. Process. Syst..

[11]  D. Inouye Multiple Post Microblog Summarization , 2010 .

[12]  Jiawei Han,et al.  Evaluating Event Credibility on Twitter , 2012, SDM.

[13]  Harry Shum,et al.  Twitter Topic Summarization by Ranking Tweets using Social Influence and Content Quality , 2012, COLING.

[14]  Omer F. Rana,et al.  Automatic Summarization of Real World Events Using Twitter , 2016, ICWSM.

[15]  Sanda M. Harabagiu,et al.  Relevance Modeling for Microblog Summarization , 2011, ICWSM.

[16]  Wenjie Li,et al.  Automatic Twitter Topic Summarization With Speech Acts , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[17]  Ido Dagan,et al.  Interactive Abstractive Summarization for Event News Tweets , 2017, EMNLP.

[18]  U. Ajay,et al.  On Summarization and Timeline Generation for Evolutionary Tweet Streams , 2016 .

[19]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[20]  Fan Yang,et al.  Automatic detection of rumor on Sina Weibo , 2012, MDS '12.

[21]  Chris Callison-Burch,et al.  Extracting Lexically Divergent Paraphrases from Twitter , 2014, TACL.

[22]  Tao Li,et al.  A Participant-based Approach for Event Summarization Using Twitter Streams , 2013, NAACL.

[23]  Christian S. Jensen,et al.  Efficient Online Summarization of Large-Scale Dynamic Networks , 2016, IEEE Transactions on Knowledge and Data Engineering.

[24]  Christopher D. Manning,et al.  Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[25]  Ji-Rong Wen,et al.  Generating timeline summaries with social media attention , 2015, Frontiers of Computer Science.

[26]  Bowen Zhou,et al.  Sequence-to-Sequence RNNs for Text Summarization , 2016, ArXiv.

[27]  Bu-Sung Lee,et al.  Event Detection in Twitter , 2011, ICWSM.

[28]  Thorsten Joachims,et al.  Training linear SVMs in linear time , 2006, KDD '06.

[29]  Wenjie Li,et al.  Sequential Summarization: A New Application for Timely Updated Twitter Trending Topics , 2013, ACL.

[30]  Alexander F. Gelbukh,et al.  Summarizing Conceptual Graphs for Automatic Summarization Task , 2013, ICCS.

[31]  Amitava Das,et al.  Sentence Boundary Detection for Social Media Text , 2015, ICON.

[32]  J. Kalita,et al.  Automatic Summarization of Twitter Topics , 2010 .

[33]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[34]  Brendan T. O'Connor,et al.  Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments , 2010, ACL.

[35]  Maria Soledad Pera,et al.  Mining Twitter features for event summarization and rating , 2017, WI.

[36]  Jérôme Euzenat,et al.  A Feature and Information Theoretic Framework for Semantic Similarity and Relatedness , 2010, SEMWEB.

[37]  Alex Lascarides,et al.  Indirect Speech Acts , 2001, Synthese.

[38]  Miles Osborne,et al.  Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (HLT '10) , 2010 .

[39]  Deepayan Chakrabarti,et al.  Event Summarization Using Tweets , 2011, ICWSM.

[40]  Oren Etzioni,et al.  Open domain event extraction from twitter , 2012, KDD.

[41]  Ponnurangam Kumaraguru,et al.  TweetCred: Real-Time Credibility Assessment of Content on Twitter , 2014, SocInfo.

[42]  Grigori Sidorov,et al.  Soft Similarity and Soft Cosine Measure: Similarity of Features in Vector Space Model , 2014, Computación y Sistemas.

[43]  Julio Gonzalo,et al.  Towards real-time summarization of scheduled events from twitter streams , 2012, HT '12.

[44]  Muhammad Imran,et al.  Summarizing Situational Tweets in Crisis Scenario , 2016, HT.

[45]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[46]  Amitava Das,et al.  Recognition of Partial Textual Entailment for Indian Social Media Text , 2019, Computación y Sistemas.

[47]  Brendan T. O'Connor,et al.  Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics , 2011 .

[48]  Rim Faiz,et al.  Conversational based method for tweet contextualization , 2017, Vietnam Journal of Computer Science.

[49]  Jugal K. Kalita,et al.  Summarizing Microblogs Automatically , 2010, HLT-NAACL.

[50]  Amitava Das,et al.  Exploring the Partial Textual Entailment Problem for Bengali News Texts , 2014, Res. Comput. Sci..

[51]  Andrei Olariu Efficient Online Summarization of Microblogging Streams , 2014, EACL.

[52]  Amitava Das,et al.  Measuring the Limit of Semantic Divergence for English Tweets , 2017, RANLP.

[53]  Cheng-Lin Liu,et al.  TR-LDA: A Cascaded Key-Bigram Extractor for Microblog Summarization , 2015 .

[54]  Alexandra I. Cristea,et al.  Real-Time Timeline Summarisation for High-Impact Events in Twitter , 2016, ECAI.

[55]  Ruifang He,et al.  Twitter Summarization Based on Social Network and Sparse Reconstruction , 2018, AAAI.