Word associations in media posts related to disasters — A statistical analysis

The paper aims to analyze the frequency of the posts in case of earthquakes and of the word associations included in such Social Media (SM) posts. Since important posts are shared by users in SM, the purpose was to identify the variation of a number of posts having unique content that occurred over a period of time in Social Media for a particular topic. The present study uses messages generated by the Twitter platform, which had been posted before and after the occurrence of the earthquakes in the areas with important seismic activity, such as Vrancea (24th September 2016), Ussita (30th October 2016), New Zealand (13th November 2016) and Papua (23rd January 2017). For the analysis of the contents of the tweets, the A-priori algorithm was used to extract words associations from these posts, keywords that draw attention to the analyzed earthquake situation.

[1]  Brian Lott,et al.  Survey of Keyword Extraction Techniques , 2012 .

[2]  Clara Pizzuti,et al.  GA-Net: A Genetic Algorithm for Community Detection in Social Networks , 2008, PPSN.

[3]  Heikki Mannila,et al.  Efficient Algorithms for Discovering Association Rules , 1994, KDD Workshop.

[4]  Horia-Nicolai Teodorescu Social signals and the ENR index - noise of searches on SNs with keyword-based logic conditions , 2015, 2015 International Symposium on Signals, Circuits and Systems (ISSCS).

[5]  Anders Hald,et al.  Statistical Theory with Engineering Applications , 1952 .

[6]  Shrikanth S. Narayanan,et al.  A System for Real-time Twitter Sentiment Analysis of 2012 U.S. Presidential Election Cycle , 2012, ACL.

[7]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[8]  Cecilia Bolea,et al.  Language Processes and Related Statistics in the Posts Associated to Disasters on Social Networks , 2016, Int. J. Comput. Commun. Control.

[9]  Gend Lal Prajapati,et al.  The role of Apriori algorithm for finding the association rules in Data mining , 2014, 2014 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT).

[10]  Kyuseok Shim,et al.  Mining optimized association rules with categorical and numeric attributes , 1998, Proceedings 14th International Conference on Data Engineering.

[11]  Jiawei Han,et al.  Mining Multiple-Level Association Rules in Large Databases , 1999, IEEE Trans. Knowl. Data Eng..

[12]  Robert V. Brill,et al.  Applied Statistics and Probability for Engineers , 2004, Technometrics.

[13]  I. Evans The selection of class intervals , 1977 .

[14]  John E. Freund,et al.  Probability and statistics for engineers , 1965 .

[15]  Horia-Nicolai Teodorescu Emergency-Related, Social Network Time Series: Description and Analysis , 2016 .

[16]  Ramakrishnan Srikant,et al.  Mining generalized association rules , 1995, Future Gener. Comput. Syst..

[17]  Mironela Pirnau,et al.  Analysis of requirements for SN monitoring applications in disasters — a case study , 2016, 2016 8th International Conference on Electronics, Computers and Artificial Intelligence (ECAI).

[18]  Das Amrita,et al.  Mining Association Rules between Sets of Items in Large Databases , 2013 .

[19]  Gerd Stumme,et al.  Mining Minimal Non-redundant Association Rules Using Frequent Closed Itemsets , 2000, Computational Logic.

[20]  Satoshi Kurihara,et al.  Information sharing on Twitter during the 2011 catastrophic earthquake , 2013, WWW.

[21]  Christopher Cheong,et al.  Social Media Data Mining: A Social Network Analysis Of Tweets During The 2010-2011 Australian Floods , 2011, PACIS.

[22]  Jun-Lin Lin,et al.  Mining association rules: anti-skew algorithms , 1998, Proceedings 14th International Conference on Data Engineering.

[23]  Jesús S. Aguilar-Ruiz,et al.  Knowledge discovery from data streams , 2009, Intell. Data Anal..

[24]  Horia-Nicolai Teodorescu,et al.  Using Analytics and Social Media for Monitoring and Mitigation of Social Disasters , 2015 .

[25]  Herbert A. Sturges,et al.  The Choice of a Class Interval , 1926 .