Fast Algorithms for Discovering Sequential Patterns in Massive Datasets

Problem statement: Sequential pattern mining is one of the specific data mining tasks, particularly from retail data. The task is to discover all sequential patterns with a user-specified minimum support, where support of a pattern is the number of data-sequences that contain the pattern. Approach: To find a sequence patterns variety of algorithm like AprioriAll and Generalized Sequential Patterns (GSP) were there. We present fast and efficient algorithms called AprioriAllSID and GSPSID for mining sequential patterns that were fundamentally different from known algorithms. Results: The proposed algorithm had been implemented and compared with AprioriAll and Generalized Sequential Patterns (GSP). Its performance was studied on an experimental basis. We combined the AprioriAllSID algorithm with AprioriAll algorithm into a Hybrid algorithm, called AprioriAll Hybrid. Conclusion: Implementation shows that the execution time of the algorithm to find sequential pattern depends on total no of candidates generated at each level and the time taken to scan the database. Our performance study shows that the proposed algorithms have an excellent performance over the best existing algorithms.

[1]  K.R. Suneetha,et al.  Advanced Version of A Priori Algorithm , 2010, 2010 First International Conference on Integrated Intelligent Computing.

[2]  Hong Zhao,et al.  An implementation of improved apriori algorithm , 2009, 2009 International Conference on Machine Learning and Cybernetics.

[3]  Xiaojun Wang Study of data ming based on Apriori algorithm , 2010, 2010 2nd International Conference on Software Technology and Engineering.

[4]  Changsheng Zhang,et al.  A Modified Apriori Algorithm with Its Application in Instituting Cross-Selling Strategies of the Retail Industry , 2009, 2009 International Conference on Electronic Commerce and Business Intelligence.

[5]  Zhang Changsheng,et al.  An Improved Algorithm for Apriori , 2009, 2009 First International Workshop on Education Technology and Computer Science.

[6]  Yanxi Liu Study on Application of Apriori Algorithm in Data Mining , 2010, 2010 Second International Conference on Computer Modeling and Simulation.

[7]  Yong-qing Wei,et al.  An improved Apriori algorithm for association rules of mining , 2009, 2009 IEEE International Symposium on IT in Medicine & Education.