Comparative Study of Various Sequential Pattern Mining Algorithms

In Sequential pattern mining represents an important class of data mining problems with wide range of applications. It is one of the very challenging problems because it deals with the careful scanning of a combinatorially large number of possible subsequence patterns. Broadly sequential pattern ming algorithms can be classified into three types namely Apriori based approaches, Pattern growth algorithms and Early pruning algorithms. These algorithms have further classification and extensions. Detailed explanation of each algorithm along with its important features, pseudo code, advantages and disadvantages is given in the subsequent sections of the paper. At the end a comparative analysis of all the algorithms with their supporting features is given in the form of a table. This paper tries to enrich the knowledge and understanding of various approaches of sequential pattern mining. General Terms Sequential Pattern Mining, Subsequence detection, Candidate pruning.

[1]  Bharat Chaudhari,et al.  A Comparative Study of Sequential Pattern Mining Algorithms , 2013 .

[2]  Nizar R. Mabroukeh,et al.  A taxonomy of sequential pattern mining algorithms , 2010, CSUR.

[3]  Simon Fraser,et al.  DISCOVERING AND MINING USER WEB-PAGE TRAVERSAL PATTERNS , 2001 .

[4]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[5]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[6]  Murat Ali Bayir,et al.  Performance Comparison of Pattern Discovery Methods on Web Log Data , 2006, IEEE International Conference on Computer Systems and Applications, 2006..

[7]  Amor Lazzez,et al.  Sequential Mining: Patterns and Algorithms Analysis , 2013, ArXiv.

[8]  Umeshwar Dayal,et al.  FreeSpan: frequent pattern-projected sequential pattern mining , 2000, KDD '00.

[9]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[10]  Jianyong Wang,et al.  Mining sequential patterns by pattern-growth: the PrefixSpan approach , 2004, IEEE Transactions on Knowledge and Data Engineering.