Survey on high utility oriented sequential pattern mining

Main purpose of using different data mining techniques is to find novel, potentially useful patterns which can be useful in real world applications to derive best knowledge and using it into useful way. For identifying the usefulness of patterns, many constraints has been proposed by many researchers are like time-interval between itemsets, utility parameters (price, profit, quantity etc.), weight of an itemset etc. Among these, time-interval between items is useful to learn about at what time next item will be purchased and utility parameters are useful to mine patterns which are more profitable in real world from huge transaction database. Here, we miss hybridization of these constraints so that more useful patterns can be derived. Hybridization of these constraints can give more useful pattern by combining advantages of each constraint. Organization of this survey is given as follows: In first section we introduced basic terms like Data mining, Frequent pattern mining, Sequential pattern mining, Time-interval Sequential Pattern Mining and utility mining. In second section we summarizes some important previous research work related to Sequential pattern mining, Time-interval based sequential pattern mining and utility mining. In last section we concluded the survey work by suggesting some future directions for discovering some strong sequential rules that can infer from extracted patterns.

[1]  Jianyong Wang,et al.  Mining sequential patterns by pattern-growth: the PrefixSpan approach , 2004, IEEE Transactions on Knowledge and Data Engineering.

[2]  Longbing Cao,et al.  USpan: an efficient algorithm for mining high utility sequential patterns , 2012, KDD.

[3]  Qiang Yang,et al.  Mining high utility itemsets , 2003, Third IEEE International Conference on Data Mining.

[4]  Yue-Shi Lee,et al.  Mining non-redundant time-gap sequential patterns , 2013, Applied Intelligence.

[5]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[6]  Chin-Chen Chang,et al.  A Fast Algorithm for Mining Share-Frequent Itemsets , 2005, APWeb.

[7]  Johannes Gehrke,et al.  Sequential PAttern mining using a bitmap representation , 2002, KDD.

[8]  Benjamin C. M. Fung,et al.  Direct Discovery of High Utility Itemsets without Candidate Generation , 2012, 2012 IEEE 12th International Conference on Data Mining.

[9]  Howard J. Hamilton,et al.  Extracting Share Frequent Itemsets with Infrequent Subsets , 2003, Data Mining and Knowledge Discovery.

[10]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[11]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[12]  Tzung-Pei Hong,et al.  Mining high average-utility itemsets , 2009, 2009 IEEE International Conference on Systems, Man and Cybernetics.

[13]  Ying Liu,et al.  A Two-Phase Algorithm for Fast Discovery of High Utility Itemsets , 2005, PAKDD.

[14]  Chichang Jou,et al.  A data mining approach to discovering reliable sequential patterns , 2013, J. Syst. Softw..

[15]  S. Jayanthi,et al.  A Fast Algorithm for Mining High Utility Itemsets , 2009, 2009 IEEE International Advance Computing Conference.

[16]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[17]  Tony Cheng-Kui Huang Knowledge gathering of fuzzy multi-time-interval sequential patterns , 2010, Inf. Sci..

[18]  A. Choudhary,et al.  A fast high utility itemsets mining algorithm , 2005, UBDM '05.

[19]  Yen-Liang Chen,et al.  Discovering fuzzy time-interval sequential patterns in sequence databases , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[20]  Bin Luo,et al.  Mining long high utility itemsets in transaction databases , 2007 .

[21]  Howard J. Hamilton,et al.  Mining itemset utilities from transaction databases , 2006, Data Knowl. Eng..

[22]  Nick Cercone,et al.  Share Based Measures for Itemsets , 1997, PKDD.

[23]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[24]  Chin-Chen Chang,et al.  Direct Candidates Generation: A Novel Algorithm for Discovering Complete Share-Frequent Itemsets , 2005, FSKD.

[25]  Ming-Tat Ko,et al.  Discovering time-interval sequential patterns in sequence databases , 2003, Expert Syst. Appl..

[26]  Yen-Liang Chen,et al.  On mining multi-time-interval sequential patterns , 2009, Data Knowl. Eng..

[27]  Jianying Hu,et al.  High-utility pattern mining: A method for discovery of high-utility item sets , 2007, Pattern Recognit..

[28]  Umeshwar Dayal,et al.  FreeSpan: frequent pattern-projected sequential pattern mining , 2000, KDD '00.

[29]  Ming-Yen Lin,et al.  High utility pattern mining using the maximal itemset property and lexicographic tree structures , 2012, Inf. Sci..

[30]  Hayato Yamana,et al.  Generalized Sequential Pattern Mining with Item Intervals , 2006, J. Comput..

[31]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[32]  Chin-Chen Chang,et al.  Isolated items discarding strategy for discovering high utility itemsets , 2008, Data Knowl. Eng..