MapReduce-Based Frequent Pattern Mining Framework with Multiple Item Support

The analysis of big data mining for frequent patterns is become even more problematic. Many efficient itemset mining algorithms to set a multiple support values for each transaction which could seem feasible as real life applications. To solve problem of single support have been discovered in the past. Since, we know that parallel and distributed computing are valid approaches to deal with large datasets. In order to reduce the search space, we using MISFP-growth algorithm without the process of rebuilding and post pruning steps. Accordingly, in this paper we proposed a model to use of MapReduce framework for implement the parallelization under multi-sup values, thereby improving the overall performance of mining frequent patterns and rare items accurately and efficiently.

[1]  Heikki Mannila,et al.  Fast Discovery of Association Rules , 1996, Advances in Knowledge Discovery and Data Mining.

[2]  Yahya Slimani,et al.  MS-FP-Growth: A Multi-support Version of FP-Growth Algorithm , 2014 .

[3]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD 2000.

[4]  Dina M. Ibrahim,et al.  Improved FP-growth Algorithm with Multiple Minimum Supports Using Maximum Constraints , 2015 .

[5]  Yen-Liang Chen,et al.  Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism , 2004, Decision Support Systems.

[6]  Songfeng Lu,et al.  AN ENHANCED FREQUENT PATTERN GROWTH BASED ON MAP REDUCE FOR MINING ASSOCIATION RULES , 2016 .

[7]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[8]  Azura Che Soh,et al.  Design Stable Robust Intelligent Nonlinear Controller for 6- DOF Serial Links Robot Manipulator , 2014 .

[9]  M. Villari,et al.  Huge amount of heterogeneous sensed data needs the cloud , 2012, International Multi-Conference on Systems, Sygnals & Devices.

[10]  Wil Van Der Aalst,et al.  Editorial: “Business Process Intelligence: Connecting Data and Processes” , 2015 .

[11]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[12]  Erik Brynjolfsson,et al.  Big data: the management revolution. , 2012, Harvard business review.

[13]  R. B. V. Subramanyam,et al.  Mining Interesting Infrequent Itemsets from Very Large Data based on MapReduce Framework , 2015 .

[14]  Xindong Wu,et al.  Data mining with big data , 2014, IEEE Transactions on Knowledge and Data Engineering.

[15]  Neal Leavitt,et al.  Storage Challenge: Where Will All That Big Data Go? , 2013, Computer.