Visualization of High-Level Associations from Twitter Data

The Data Mining and Knowledge Discovery (KDD) process focuses on extracting useful information from large datasets. To support analysts in making decisions, a relevant research effort has been devoted to visualizing the extracted data mining models effectively. A particular attention has been paid to the discovery of strong association rules from textual data coming from social networks, which represent potentially relevant correlations among document terms. However, state-of-the-art rule visualization tools do not allow experts to visualize data correlations at different abstraction levels. Hence, the effectiveness of the proposed approaches is limited, especially when dealing with fairly sparse data. This chapter presents Twitter Generalized Rule Visualizer (TGRV), a novel text mining and visualization tool. It aims at supporting analysts in looking into the results of the generalized association rule mining process from textual data coming from Twitter supplied with WordNet taxonomies. Taxonomies are used for aggregating document terms into higher-level concepts. Generalized rules represent high-level associations among document terms. By exploiting taxonomy-based models, experts may look into the discovered data correlations from different perspectives and figure out interesting knowledge. Changing the perspective from which data correlations are visualized is shown to improve the readability and the usability of the generated rule-based model. The experimental results show the applicability and the usefulness of the proposed visualization tool on real textual data coming from Twitter. The visualized data correlations are shown to be valuable for advanced analysis, such as topic trend and user behavior analysis.

[1]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD 2000.

[2]  Ke Wang,et al.  Visually Aided Exploration of Interesting Association Rules , 1999, PAKDD.

[3]  Masaru Kitsuregawa,et al.  FP-tax: tree structure based generalized association rule mining , 2004, DMKD '04.

[4]  Marc Cheong,et al.  Integrating web-based intelligence retrieval and decision-making from the twitter trends knowledge base , 2009, CIKM-SWSM.

[5]  Yiming Ma,et al.  Analyzing the interestingness of association rules from the temporal dimension , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[6]  Thanaruk Theeramunkong,et al.  A new method for finding generalized frequent itemsets in generalized association rule mining , 2002, Proceedings ISCC 2002 Seventh International Symposium on Computers and Communications.

[7]  Rüdiger Wirth,et al.  A New Algorithm for Faster Mining of Generalized Association Rules , 1998, PKDD.

[8]  Herman Chernoff,et al.  The Use of Faces to Represent Points in k- Dimensional Space Graphically , 1973 .

[9]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[10]  Kevin J. O'Sullivan,et al.  Strategic Intellectual Capital Management in Multinational Organizations: Sustainability and Successful Implications , 2009 .

[11]  Raymond Y. K. Lau,et al.  Toward a Fuzzy Domain Ontology Extraction Method for Adaptive e-Learning , 2009, IEEE Transactions on Knowledge and Data Engineering.

[12]  R. V. Kulkarni,et al.  REVIEW OF LITERATURE ON DATA MINING , 2012 .

[13]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[14]  Ramakrishnan Srikant,et al.  Mining generalized association rules , 1995, Future Gener. Comput. Syst..

[15]  Jae-Gil Lee,et al.  MoveMine: Mining moving object data for discovery of animal movement patterns , 2011, TIST.

[16]  Luca Cagliero,et al.  Support driven opportunistic aggregation for generalized itemset extraction , 2010, 2010 5th IEEE International Conference Intelligent Systems.

[17]  Virgílio A. F. Almeida,et al.  Characterizing user navigation and interactions in online social networks , 2012, Inf. Sci..

[18]  Jiawei Han,et al.  Mining Multiple-Level Association Rules in Large Databases , 1999, IEEE Trans. Knowl. Data Eng..

[19]  Sharma Chakravarthy,et al.  Visualization of association rules over relational DBMSs , 2003, SAC '03.

[20]  Luca Cagliero Discovering Temporal Change Patterns in the Presence of Taxonomies , 2013, IEEE Transactions on Knowledge and Data Engineering.

[21]  Luca Cagliero,et al.  Context-Aware User and Service Profiling by Means of Generalized Association Rules , 2009, KES.

[22]  Rui Li,et al.  Exploring social tagging graph for web object classification , 2009, KDD.

[23]  Elena Baralis,et al.  Data mining techniques for effective and scalable traffic analysis , 2005, 2005 9th IFIP/IEEE International Symposium on Integrated Network Management, 2005. IM 2005..

[24]  Vipin Kumar,et al.  Discovery of Web Robot Sessions Based on their Navigational Patterns , 2004, Data Mining and Knowledge Discovery.

[25]  Lucy T. Nowell,et al.  ThemeRiver: Visualizing Thematic Changes in Large Document Collections , 2002, IEEE Trans. Vis. Comput. Graph..

[26]  Pak Chung Wong,et al.  Visualizing association rules for text mining , 1999, Proceedings 1999 IEEE Symposium on Information Visualization (InfoVis'99).

[27]  Engelbert Mephu Nguifo,et al.  Emulating a cooperative behavior in a generic association rule visualization tool , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[28]  Nick Koudas,et al.  TwitterMonitor: trend detection over the twitter stream , 2010, SIGMOD Conference.

[29]  Luca Cagliero,et al.  Generalized association rule mining with constraints , 2012, Inf. Sci..

[30]  Hector Garcia-Molina,et al.  Social tag prediction , 2008, SIGIR '08.

[31]  Songqing Chen,et al.  Analyzing patterns of user content generation in online social networks , 2009, KDD.

[32]  Qing Li,et al.  An Effective News Recommendation in Social Media Based on Users' Preference , 2008, 2008 International Workshop on Education Technology and Training & 2008 International Workshop on Geoscience and Remote Sensing.

[33]  Jaideep Srivastava,et al.  Selecting the right interestingness measure for association patterns , 2002, KDD.

[34]  Leslie Gadman,et al.  Multinational Intellect: The Synergistic Power of Cross Cultural Knowledge Networks , 2011 .

[35]  Michael H. Böhlen,et al.  Visual Data Mining: An Introduction and Overview , 2008, Visual Data Mining.

[36]  Daniel A. Keim,et al.  Information Visualization and Visual Data Mining , 2002, IEEE Trans. Vis. Comput. Graph..

[37]  Simon Fong,et al.  Visualizing e-Government portal and its performance in WEBVS , 2010, 2010 Fifth International Conference on Digital Information Management (ICDIM).

[38]  Daniel A. Keim,et al.  Designing Pixel-Oriented Visualization Techniques: Theory and Applications , 2000, IEEE Trans. Vis. Comput. Graph..

[39]  Xin Li,et al.  Tag-based social interest discovery , 2008, WWW.

[40]  Timo Ropinski,et al.  Survey of glyph-based visualization techniques for spatial multivariate medical data , 2011, Comput. Graph..

[41]  Pourang Irani,et al.  WiFIsViz: Effective Visualization of Frequent Itemsets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[42]  Fabrice Guillet,et al.  Exploratory Visualization for Association Rule Rummaging , 2003, KDD 2003.

[43]  Corrado Loglisci,et al.  Mining Generalized Association Rules on Biomedical Literature , 2005, IEA/AIE.