Actionable pattern discovery for Sentiment Analysis on Twitter Data in clustered environment

Actionable Patterns are desired knowledge to be mined from large datasets. Action Rules are vital data mining method for gaining actionable knowledge from the datasets. They recommend actions which users can undertake to their advantage, or to accomplish their goal. Meta actions are the sub-actions to the Action Rules, which intends to change the attribute value of an object, under consideration, to attain the desirable value. The essence of this paper is to propose a new optimized and more promising system, in terms of speed and efficiency, for generating meta-actions by implementing Specific Action Rule discovery based on Grabbing strategy (SARGS) algorithm, and to apply that for Sentiment Analysis on Twitter data. We perform a comparative analysis of meta-actions generating algorithmic implementation in Apache Spark driven system, conventional Hadoop driven system and Single node machine using the Twitter social networking data and evaluate the results. We implement corpus based Sentimental Analysis of social networking data, and test the total time taken by the systems and their sub components for the data processing. Results show faster computational time for Spark system compared to Hadoop MapReduce and Single node machine for the meta-action generation methods.

[1]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[2]  Angelina A. Tzacheva,et al.  Rule Based Systems in a Distributed Environment: Survey , 2017 .

[3]  Lei Yu,et al.  A Hadoop MapReduce Performance Prediction Method , 2013, 2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing.

[4]  Jerzy W. Grzymala-Busse,et al.  An Empirical Comparison of Rule Sets Induced by LERS and Probabilistic Rough Classification , 2010, RSCTC.

[5]  Zbigniew W. Ras,et al.  Discovering Extended Action-Rules (System DEAR) , 2003, IIS.

[6]  Zbigniew W. Ras,et al.  Action Rules Discovery, a New Simplified Strategy , 2006, ISMIS.

[7]  Angelina A. Tzacheva,et al.  Support confidence and utility of action rules triggered by meta-actions , 2016, 2016 IEEE International Conference on Knowledge Engineering and Applications (ICKEA).

[8]  Zbigniew W. Ras,et al.  Association Action Rules , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[9]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[10]  Michael J. Franklin,et al.  Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing , 2012, NSDI.

[11]  Bernard Dousset,et al.  Multi-criterion Real Time Tweet Summarization Based upon Adaptive Threshold , 2016, 2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI).

[12]  Carlo Curino,et al.  Apache Hadoop YARN: yet another resource negotiator , 2013, SoCC.

[13]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[14]  Scott Shenker,et al.  Spark: Cluster Computing with Working Sets , 2010, HotCloud.

[15]  Angelina A. Tzacheva,et al.  Action Rules for Sentiment Analysis on Twitter Data Using Spark , 2017, 2017 IEEE International Conference on Data Mining Workshops (ICDMW).

[16]  Zbigniew W. Ras,et al.  Extracting Rules from Incomplete Decision Systems: System ERID , 2006, Foundations and Novel Approaches in Data Mining.

[17]  Dong Zhou,et al.  Inferring Your Expertise from Twitter: Integrating Sentiment and Topic Relatedness , 2016, 2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI).

[18]  Zbigniew W. Ras,et al.  Action-Rules: How to Increase Profit of a Company , 2000, PKDD.

[19]  Angelina A. Tzacheva,et al.  MR - Random Forest Algorithm for Distributed Action Rules Discovery , 2016 .

[20]  Felipe Bravo-Marquez,et al.  From Opinion Lexicons to Sentiment Classification of Tweets and Vice Versa: A Transfer Learning Approach , 2016, 2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI).

[21]  Kewen Wang,et al.  Performance Prediction for Apache Spark Platform , 2015, 2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems.

[23]  Zbigniew W. Ras,et al.  Action Rule Extraction from a Decision Table: ARED , 2008, ISMIS.

[24]  Owen Rambow,et al.  Sentiment Analysis of Twitter Data , 2011 .

[25]  Angelina A. Tzacheva,et al.  Action rules for sentiment analysis using Twitter , 2020 .