Autonomic Discovery of News Evolvement in Twitter

Recently the continuous increase in data sizes has resulted in many data processing challenges. This increase has compelled data users to find automatic means of looking into databases to bring out vital information. Retrieving information from ‘Big data’, (as it is often referred to) can be likened to finding ‘a needle in the haystack’. It is worthy of note that while big data has several computational challenges, it also serves as gateway to technological preparedness in making the world a global village. Social media sites (of which Twitter is one) are known to be big data collectors as well as an open source for information retrieval. Easy access to social media sites and the advancement of technology tools such as the computer and smart devices have made it convenient for different entities to store enormous data in real time. Twitter is known to be the most powerful and most popular microbloging tool in social media. It offers its users the opportunity of posting and receiving instantaneous information from the network. Traditional news media follow the activities on Twitter network in order to retrieve interesting tweets that can be used to enhance their news reports and news updates. Twitter users include hashtags symbols (#) as prefix to keywords used in tweets to describe its content and to enhance the readability of their tweets. This chapter uses the Apriori method for Association Rule Mining (ARM) and a novel methodology termed Rule Type Identification-Mapping (RTI-Mapping) which is inherited from Transaction-based Rule Change Mining TRCM (Adedoyin-Olowe et al., 2013) and Transaction-based Rule Change Mining-Rule Type Identification (TRCM-RTI) (Gomes et al., 2013) to map Association Rules (ARs) detected in tweets’ hashtags to evolving news reports and news updates of traditional news agents in real life. TRCM uses Association Rule Mining (ARM) to analyse tweets on the same topic over consecutive periods t and t + 1 using Rule Matching (RM) to detected changes in ARs such as emerging, unexpected, new and dead rules. This is obtained by setting user-defined Rule Matching Threshold (RMT) to match rules in tweets at time t with those in tweets at t + 1 in order to ascertain rules that fall into the different patterns. TRCM-RTI is a methodology built from TRCM, it identifies rule types of evolving ARs present in tweets’ hashtags at different time periods. This chapter adopts RTI-Mapping from methodologies in (Adedoyin-Olowe et al., 2013) and (Gomes et al., 2013) to map ARs with online evolving news of top traditional news agents in order to detect and track news and news updates of evolving events. This is an initial experiment of ARs mapping to evolving news. The mapping is done manually at this stage and the methodology is validated using four events and news topics as case studies. The experiments show substantial result on the selected news topics.

[1]  Bu-Sung Lee,et al.  Event Detection in Twitter , 2011, ICWSM.

[2]  Ana-Maria Popescu,et al.  Detecting controversial events from twitter , 2010, CIKM.

[3]  Mohamed Medhat Gaber,et al.  Rule Type Identification Using TRCM for Trend Analysis in Twitter , 2013, SGAI Conf..

[4]  Lon Safko,et al.  The Social Media Bible: Tactics, Tools, and Strategies for Business Success , 2009 .

[5]  H. Jaakkola,et al.  Will universities and university teachers become extinct in our bright online future? , 2013, 2013 36th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO).

[6]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[7]  Miles Osborne,et al.  Streaming First Story Detection with application to Twitter , 2010, NAACL.

[8]  Andreas M. Kaplan,et al.  The fairyland of Second Life: Virtual social worlds and how to use them , 2009 .

[9]  Yutaka Matsuo,et al.  Semantic Twitter: Analyzing Tweets for Real-Time Event Notification , 2008, BlogTalk.

[10]  Mohamed Medhat Gaber,et al.  TRCM: A Methodology for Temporal Analysis of Evolving Concepts in Twitter , 2013, ICAISC.

[11]  Alexandros Labrinidis,et al.  Challenges and Opportunities with Big Data , 2012, Proc. VLDB Endow..

[12]  Gilad Mishne,et al.  Finding high-quality content in social media , 2008, WSDM '08.

[13]  Michael L. Brodie,et al.  The meaningful use of big data: four perspectives -- four challenges , 2012, SGMD.

[14]  C. Lynch Big data: How do your data grow? , 2008, Nature.

[15]  Jacek M. Zurada,et al.  Artificial Intelligence and Soft Computing, 10th International Conference, ICAISC 2010, Zakopane, Poland, June 13-17, 2010, Part I , 2010, International Conference on Artificial Intelligence and Soft Computing.

[16]  Duen-Ren Liu,et al.  Mining the change of event trends for decision support in environmental scanning , 2009, Expert Syst. Appl..

[17]  Deepayan Chakrabarti,et al.  Event Summarization Using Tweets , 2011, ICWSM.

[18]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[19]  Bernard J. Jansen,et al.  Twitter power: Tweets as electronic word of mouth , 2009 .

[20]  Charu C. Aggarwal,et al.  An Introduction to Social Network Data Analytics , 2011, Social Network Data Analytics.

[21]  Soung Hie Kim,et al.  Mining the change of customer behavior in an internet shopping mall , 2001, Expert Syst. Appl..

[22]  T. Murata,et al.  Breaking News Detection and Tracking in Twitter , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[23]  Hila Becker,et al.  Identifying content for planned events across social media sites , 2012, WSDM '12.

[24]  Dave Evans,et al.  Social Media Marketing: The Next Generation of Business Engagement , 2010 .

[25]  Erik Qualman Socialnomics: How Social Media Transforms the Way We Live and Do Business , 2009 .

[26]  Andrew B. Whinston,et al.  Social Computing: An Overview , 2007, Commun. Assoc. Inf. Syst..

[27]  Nick Koudas,et al.  TwitterMonitor: trend detection over the twitter stream , 2010, SIGMOD Conference.

[28]  Mary Beth Rosson,et al.  How and why people Twitter: the role that micro-blogging plays in informal communication at work , 2009, GROUP.

[29]  A. Kaplan If you love something, let it go mobile: Mobile marketing and mobile social media 4x4 , 2012 .

[30]  Mario Cataldi,et al.  Emerging topic detection on Twitter based on temporal and social terms evaluation , 2010, MDMKDD '10.

[31]  Hila Becker,et al.  Beyond Trending Topics: Real-World Event Identification on Twitter , 2011, ICWSM.

[32]  Huan Liu,et al.  Community Detection and Mining in Social Media , 2010, Community Detection and Mining in Social Media.

[33]  Gautam Shroff,et al.  Catching the Long-Tail: Extracting Local News Events from Twitter , 2012, ICWSM.