Integration of K-means algorithm and AprioriSome algorithm for fuzzy sequential pattern mining

Since Agrawal and Srikant proposed sequential pattern mining in 1995, there have been many scholars working to improve the efficiency and reduce the processing time of algorithms. This study intends to propose a fuzzy AprioriSome algorithm for fuzzy sequential patterns mining with integration with clustering technique, K-means algorithm. Two experiments performed using transaction data provided by a securities firm and foodmarket data from SQL sever 2000 demonstrate the strength of fuzzy AprioriSome sequential pattern mining in mining large quantity of transaction data.

[1]  Aidong Zhang,et al.  Cluster analysis for gene expression data: a survey , 2004, IEEE Transactions on Knowledge and Data Engineering.

[2]  Kyuseok Shim,et al.  Mining Sequential Patterns with Regular Expression Constraints , 2002, IEEE Trans. Knowl. Data Eng..

[3]  Tzung-Pei Hong,et al.  Mining fuzzy sequential patterns from quantitative data , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).

[4]  Umeshwar Dayal,et al.  FreeSpan: frequent pattern-projected sequential pattern mining , 2000, KDD '00.

[5]  Man Hon Wong,et al.  Finding Fuzzy Sets for the Mining of Fuzzy Association Rules for Numerical Attributes , 1998 .

[6]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[7]  A. Gyenesei,et al.  Determining Fuzzy Sets for Quantitative Attributes in Data Mining Problems , 2000 .

[8]  Poonam Sharma,et al.  PrefixSpan: Mining Sequential Patterns by Prefix- Projected Pattern , 2011 .

[9]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[10]  T. Hong,et al.  Mining fuzzy sequential patterns from multiple-item transactions , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[11]  Yi-Chung Hu,et al.  Discovery of fuzzy sequential patterns for fuzzy partitions in quantitative attributes , 2001, Proceedings ACS/IEEE International Conference on Computer Systems and Applications.

[12]  Umeshwar Dayal,et al.  PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth , 2001, ICDE 2001.

[13]  Reda Alhajj,et al.  Multi-objective genetic algorithm based approach for optimizing fuzzy sequential patterns , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[14]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.