Sentimental causal rule discovery from Twitter

Social media, especially Twitter is now one of the most popular platforms where people can freely express their opinion. However, it is difficult to extract important summary information from many millions of tweets sent every hour. In this work we propose a new concept, sentimental causal rules, and techniques for extracting sentimental causal rules from textual data sources such as Twitter which combine sentiment analysis and causal rule discovery. Sentiment analysis refers to the task of extracting public sentiment from textual data. The value in sentiment analysis lies in its ability to reflect popularly voiced perceptions that are stated in natural language. Causal rules on the other hand indicate associations between different concepts in a context where one (or several concepts) cause(s) the other(s). We believe that sentimental causal rules are an effective summarization mechanism that combine causal relations among different aspects extracted from textual data as well as the sentiment embedded in these causal relationships. In order to show the effectiveness of sentimental causal rules, we have conducted experiments on Twitter data collected on the Kurdish political issue in Turkey which has been an ongoing heated public debate for many years. Our experiments on Twitter data show that sentimental causal rule discovery is an effective method to summarize information about important aspects of an issue in Twitter which may further be used by politicians for better policy making.

[1]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[2]  Cemal Yilmaz,et al.  Automatically identifying a software product's quality attributes through sentiment analysis of tweets , 2013, 2013 1st International Workshop on Natural Language Analysis in Software Engineering (NaturaLiSE).

[3]  Yong Shi,et al.  The Role of Text Pre-processing in Sentiment Analysis , 2013, ITQM.

[4]  Preslav Nakov,et al.  SemEval-2013 Task 2: Sentiment Analysis in Twitter , 2013, *SEMEVAL.

[5]  Syin Chan,et al.  Extracting Causal Knowledge from a Medical Database Using Graphical Patterns , 2000, ACL.

[6]  Ismail Hakki Toroslu,et al.  Sentiment Analysis of Turkish Political News , 2012, 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[7]  Owen Rambow,et al.  Sentiment Analysis of Twitter Data , 2011 .

[8]  Johanna D. Moore,et al.  Twitter Sentiment Analysis: The Good the Bad and the OMG! , 2011, ICWSM.

[9]  Hao Sheng,et al.  Sentiment Classification of Web Review Using Association Rules , 2013, HCI.

[10]  Yücel Saygin,et al.  Adaptation and Use of Subjectivity Lexicons for Domain Dependent Sentiment Classification , 2012, 2012 IEEE 12th International Conference on Data Mining Workshops.

[11]  ChengXiang Zhai,et al.  Mining causal topics in text data: iterative topic modeling with time series feedback , 2013, CIKM.

[12]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[13]  Yücel Saygin,et al.  SU-Sentilab : A Classification System for Sentiment Analysis in Twitter , 2013, *SEMEVAL.

[14]  Wen-Chi Hou Quality of Association Rules by Chi-Squared Test , 2009, Encyclopedia of Data Warehousing and Mining.

[15]  Daniela Garcia,et al.  COATIS, an NLP System to Locate Expressions of Actions Connected by Causality Links , 1997, EKAW.

[16]  Bruno Ohana,et al.  Sentiment Classification of Reviews Using SentiWordNet , 2009 .

[17]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[18]  Rajeev Motwani,et al.  Scalable Techniques for Mining Causal Structures , 1998, Data Mining and Knowledge Discovery.

[19]  Tom Burr,et al.  Causation, Prediction, and Search , 2003, Technometrics.

[20]  Luís Cavique,et al.  A scalable algorithm for the market basket analysis , 2007 .

[21]  Brian J. Taylor,et al.  Causal discovery in social media using quasi-experimental designs , 2010, SOMA '10.

[22]  Dan I. Moldovan,et al.  Text Mining for Causal Relations , 2002, FLAIRS.

[23]  Marek J. Druzdzel,et al.  A Hybrid Anytime Algorithm for the Construction of Causal Models From Sparse Data , 1999, UAI.

[24]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[25]  Gregory F. Cooper,et al.  A Simple Constraint-Based Algorithm for Efficiently Mining Observational Databases for Causal Relationships , 1997, Data Mining and Knowledge Discovery.