Classification and Summarization for Informative Tweets

Microblogging websites like twitter, facebook, etc. has become a substantive platform for the people to publicize their feelings, requirements, etc. It allows users to post short messages for their online audience. These messages are the fusion of blogging and minute messaging, consisting of images, videos, or voice notes. We have primarily focused on information provided by microblogging sites for achieving real-time informational data. Microblogging websites are widely used around the globe by people for portraying what has been happening around their normal living. So, data through these sites eventually helps us getting non-manipulated data directly from the user. In this paper, a disaster dataset (Fani Cyclone dataset) is considered, which consists of the tweets related to a Cyclone named "Fani". The tweets are pre-processed and then classified into two categories – informative and non-informative. We have been able to achieve a classification accuracy of 74:268% when pre-processed data is being considered. As we are dealing with disaster dataset, so in the end, we have summarized the informative tweets for the concerned authorities, which would help them to have an overview of the data.

[1]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[2]  J. Steinberger,et al.  Using Latent Semantic Analysis in Text Summarization and Summary Evaluation , 2004 .

[3]  Younghwan Namkoong,et al.  Ordinal Classification of Imbalanced Data with Application in Emergency and Disaster Information Services , 2016, IEEE Intelligent Systems.

[4]  Iadh Ounis,et al.  Real-Time Detection, Tracking, and Monitoring of Automatically Discovered Events in Social Media , 2014, ACL.

[5]  Muhammad Imran,et al.  Summarizing Situational Tweets in Crisis Scenario , 2016, HT.

[6]  Kathleen McKeown,et al.  Summarizing Disasters Over Time , 2014 .

[7]  Leysia Palen,et al.  Identifying and Categorizing Disaster-Related Tweets , 2016, SocialNLP@EMNLP.

[8]  Chen Huang,et al.  Microblogging after a major disaster in China: a case study of the 2010 Yushu earthquake , 2011, CSCW.

[9]  Pablo Gamallo,et al.  Citius: A Naive-Bayes Strategy for Sentiment Analysis on English Tweets , 2014, *SEMEVAL.

[10]  Anirban Sen,et al.  Extracting situational awareness from microblogs during disaster events , 2015, 2015 7th International Conference on Communication Systems and Networks (COMSNETS).

[11]  Beverly Estephany Parilla-Ferrer,et al.  SVM-Based Domain Adaptation Machine Learned Models for the Automatic Classification of Disaster-Related Tweets , 2022 .

[12]  Gerry Stahl,et al.  Developing Summarization Skills through the Use of LSA-Based Feedback , 2000, Interact. Learn. Environ..

[13]  Niloy Ganguly,et al.  Extracting and Summarizing Situational Information from the Twitter Social Media during Disasters , 2018, ACM Trans. Web.

[14]  Andrei Olariu Efficient Online Summarization of Microblogging Streams , 2014, EACL.

[15]  Furu Wei,et al.  Improving Multi-Document Summarization via Text Classification , 2016, AAAI.

[16]  Niloy Ganguly,et al.  Extracting Situational Information from Microblogs during Disaster Events: a Classification-Summarization Approach , 2015, CIKM.

[17]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[18]  Sarah Vieweg Twitter communications in mass emergency: contributions to situational awareness , 2012, CSCW.

[19]  Qunying Huang,et al.  Geographic Situational Awareness: Mining Tweets for Disaster Preparedness, Emergency Response, Impact, and Recovery , 2015, ISPRS Int. J. Geo Inf..