An improved apriori algorithm based on support weight matrix for data mining in transaction database

Data mining is a process to discover hidden information or knowledge automatically from huge database. In order to reduce the number of scanning databases and reflect the importance of different items and transaction so as to extract more valuable information, an improved Apriori algorithm is proposed in this paper, which is to build the 0–1 transaction matrix by scanning transaction database for getting the weighted support and confidence. The items and transactions is weighted to reflect the importance in the transaction database. The experiment results, both qualitative and quantitative, have shown that our improved algorithm shortens the running time and reduces the memory requirement and the number of I/O operations. Meanwhile, the support for rare items tends to increase, while the support for other items decreases slightly, thus the hidden and valuable items can be effectively extracted.

[1]  Mohammad R. Akbarzadeh-Totonchi,et al.  Confabulation-Inspired Association Rule Mining for Rare and Frequent Itemsets , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[2]  Sanjeev Rao,et al.  Implementing Improved Algorithm Over APRIORI Data Mining Association Rule Algorithm , 2012 .

[3]  Alejandro Bellogín,et al.  Improving memory-based collaborative filtering by neighbour selection based on user preference overlap , 2013, OAIR.

[4]  Mohammed J. Zaki Parallel and distributed association mining: a survey , 1999, IEEE Concurr..

[5]  Jie Dong,et al.  BitTableFI: An efficient mining frequent itemsets algorithm , 2007, Knowl. Based Syst..

[6]  Jiao Yabing,et al.  Research of an Improved Apriori Algorithm in Data Mining Association Rules , 2013 .

[7]  Jennifer T. Wong,et al.  Comparison of Methodology Approach to Identify Causal Factors of Accident Severity , 2008 .

[8]  Kais Dai,et al.  The workforce analyzer: group discovery among LinkedIn public profiles , 2018, J. Ambient Intell. Humaniz. Comput..

[9]  Ashutosh Gupta,et al.  Improvised Apriori Algorithm using frequent pattern tree for real time applications in data mining , 2014, ArXiv.

[10]  Yuelong Zhao,et al.  MRPrePost—A parallel algorithm adapted for mining big data , 2014, 2014 IEEE Workshop on Electronics, Computer and Applications.

[11]  Fu Sh The Research and Improvement of Apriori Algorithm for Mining Association Rules , 2013 .

[12]  Jae Kyeong Kim,et al.  A literature review and classification of recommender systems research , 2012, Expert Syst. Appl..

[13]  Sen Zhang,et al.  New Techniques for Mining Frequent Patterns in Unordered Trees , 2015, IEEE Transactions on Cybernetics.

[14]  Antonio Fernández-Caballero,et al.  Towards personalized recommendation by two-step modified Apriori data mining algorithm , 2008, Expert Syst. Appl..

[15]  Jamil Ahmed,et al.  Analysis of Effectiveness of Apriori and Frequent Pattern Tree Algorithm in Software Engineering Data Mining , 2015, 2015 6th International Conference on Intelligent Systems, Modelling and Simulation.

[16]  Philip S. Yu,et al.  Using a Hash-Based Method with Transaction Trimming for Mining Association Rules , 1997, IEEE Trans. Knowl. Data Eng..