An Improved Progressive Sampling based Approach for Association Rule Mining

Data Mining is the multistage process of extraction of useful information from the large database. Association rule mining is one of the important techniques of data mining in which relationships among the items present in the transactions are discovered. There are different algorithms are available in the field of data mining for association rule mining but most of them are time consuming hence the run time and memory overheads incurred is extremely high specially in the case of very large database. Sampling is one of the remarkable approach which can be used to speed up the process of association rule mining hence it is a approach to reduce the complexity of association rule mining technique to some extent but still consuming comparable time and memory. A progressive sampling based approach is a noval expert approach in the field of association rule mining to reduce the overheads of usual sampling based approaches. It is very effective in case of the large databases. In this paper, we have extended the Progressive sampling based approach presented by Umarani & Punithavalli,2009[22] and performed an extensive experimental analysis of the progressive samplingbased approach for the different Partitioned itemset 1/3,1/4,2/3,3/4 with the sample dataset also in addition the performance of this Improved Progressive Sampling Based Approach is evaluated with the Progressive sampling based approach by Umarani & Punithavalli,2009[22]. The experimental results illustrate the complexity of an algorithm in terms of run time as well as the memory utilization. Complete implementation has been done in Java Jdk 6.1. and MySQL5.0 on the Sample dataset CompPeriPurchase. General Terms Data Mining

[1]  Herbert Schildt Java the Complete Reference, Seventh Edition , 2006 .

[2]  J. van Leeuwen,et al.  Intelligent Data Engineering and Automated Learning , 2003, Lecture Notes in Computer Science.

[3]  Yogish Sabharwal,et al.  Analysis of sampling techniques for association rule mining , 2009, ICDT '09.

[4]  Yosef Hasan Jbara,et al.  An Improved Algorithm for Mining Association Rules in Large Databases , 2011 .

[5]  Jaideep Srivastava,et al.  Web Mining: Pattern Discovery from World Wide Web Transactions , 1996 .

[6]  Victor Maojo,et al.  A Survey of Data Mining Techniques , 2000, ISMDA.

[7]  Hannu Toivonen,et al.  Sampling Large Databases for Association Rules , 1996, VLDB.

[8]  Dimitris Kanellopoulos,et al.  Association Rules Mining: A Recent Overview , 2006 .

[9]  Gopal K Gupta,et al.  Introduction to Data Mining with Case Studies , 2011 .

[10]  Vikram Vaswani MySQL: The Complete Reference , 2004 .

[11]  Srinivasan Parthasarathy,et al.  Efficient progressive sampling for association rules , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[12]  M. Punithavalli,et al.  Developing Novel and Effective Approach for Association Rule Mining Using Progressive Sampling , 2009, 2009 Second International Conference on Computer and Electrical Engineering.

[13]  Tsau Y. Lin Sampling in association rule mining , 2004, SPIE Defense + Commercial Sensing.

[14]  Ferenc Bodon,et al.  A fast APRIORI implementation , 2003, FIMI.

[15]  Margaret H. Dunham,et al.  Data Mining: Introductory and Advanced Topics , 2002 .

[16]  Srinivasan Parthasarathy,et al.  Evaluation of sampling for data mining of association rules , 1997, Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications.

[17]  M. Punithavalli,et al.  A Novel Progressive Sampling based Approach for Effective Mining of Association Rules , 2010 .

[18]  Ramakrishnan Srikant,et al.  Mining Association Rules with Item Constraints , 1997, KDD.

[19]  M. Punithavalli,et al.  On developing an effectual progressive sampling-based approach for association rule discovery , 2010, 2010 2nd IEEE International Conference on Information Management and Engineering.

[20]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[21]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[22]  Basel A. Mahafzah,et al.  A new sampling technique for association rule mining , 2009, J. Inf. Sci..