Latent Semantic Analysis for Mining Rules in Big Data Environment

This Finding valuable rules from a given data set and detecting events using the rules are recent popular research topics. The association rule reduction technique finds unnecessary associations rules and removes them for extracting meaningful relationship between data. The researches on enhancing the reduction rate of final association rules and efficient data structure minimizing the number of scans have been actively performed to reduce the execution time. The previous schemes sometimes fail to reduce the association rules while more reduction is possible since they do not consider the relationship between the data items. In this paper we propose Latent Semantic Analysis (LSA) reduction technique for mining valuable rules at high speed regardless of the number of items. The proposed scheme extracts the relationship such as inverse and equivalence between a set of items. Computer simulation reveals that it significantly increases credibility, support, processing time, reduction rate of the rules and rejection rate of the item, compared to the existing schemes.

[1]  Hee Yong Youn,et al.  Reduction of Association Rules for Big Data Sets in Socially-Aware Computing , 2013, 2013 IEEE 16th International Conference on Computational Science and Engineering.

[2]  Patrick F. Reidy An Introduction to Latent Semantic Analysis , 2009 .

[3]  Rajeev Motwani,et al.  Dynamic itemset counting and implication rules for market basket data , 1997, SIGMOD '97.

[4]  Inge S. Helland,et al.  Central Limit Theorems for Martingales with Discrete or Continuous Time , 1982 .

[5]  Fabrice Guillet,et al.  Knowledge-Based Interactive Postmining of Association Rules Using Ontologies , 2010, IEEE Transactions on Knowledge and Data Engineering.

[6]  Soon Myoung Chung,et al.  Mining association rules using inverted hashing and pruning , 2002, Inf. Process. Lett..

[7]  Xiao Wu,et al.  Adaptive association rule mining for web video event classification , 2013, 2013 IEEE 14th International Conference on Information Reuse & Integration (IRI).

[8]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[9]  Sang Hyuk Son,et al.  Using fuzzy logic for robust event detection in wireless sensor networks , 2012, Ad Hoc Networks.

[10]  E. Ilavarasan,et al.  Performance Evaluation of Semantic Approaches for Automatic Clustering of Similar Web Services , 2014, 2014 World Congress on Computing and Communication Technologies.

[11]  G. Pearlson,et al.  Semantic Clustering of Category Fluency in Schizophrenia Examined with Singular Value Decomposition , 2012, Journal of the International Neuropsychological Society.

[12]  Yanxi Liu Study on Application of Apriori Algorithm in Data Mining , 2010, 2010 Second International Conference on Computer Modeling and Simulation.

[13]  Edmon Begoli,et al.  Design Principles for Effective Knowledge Discovery from Big Data , 2012, 2012 Joint Working IEEE/IFIP Conference on Software Architecture and European Conference on Software Architecture.